Gene GM21_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3768 
Symbol 
ID8139142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4342609 
End bp4344039 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content62% 
IMG OID644871387 
ProductCarbohydrate-selective porin OprB 
Protein accessionYP_003023545 
Protein GI253702356 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3659] Carbohydrate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones112 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCA AAAAGATTCC GGTAGCAACT GCTTTCTTGT TGGGGAGCCT TTTGCTTTCT 
TCAGGAGCGG CGTTCGCGCT GCATCCGGAG CTGGTGGTTC CGGAAAGGGT GGAGCTGAAG
CACAAGGCCT GCCAGGAGAT TGTTCGGCTC GGAGCGAAGT ACAAGGTGGA GGGGCTCTTT
ACTCCGGAGT TCCTGGAGGG AAAGCAGCAC GACTGCAGCC GGATCGATGT TGCGCTGGCG
GTGCAGCTGC TGACCGAGAA AATGGCGGAA AAAGCGGTGA AGGAGGGGAA CCAGGCCGTG
GACAGGGAAG ACCTGCTGCT GCTCGCGGAC CTGAAGGAGG AACTGCGCGC CGAGATGCTT
CTGGTAGGTA CCCGGACCTT CCAGTCCCGC TACCAGGACC TGGGCACCAG GTTCACCGCC
CTCACAAAGA ACATCTCCCT CAGCGGCGGC ATGGTCGGCG TGCTCCAGGG AACCGTCGGC
CACAGCCCTA AGAATCACGC GGACACCGTG GGACGCGCCG ATCTGGTCTT CAACTTCAAG
GTGGGGGAGA ATACCATCGC CGTATTCGAC CTCGAAGCGA CCGGTGGCGA AGGGATCGAC
AACACCGCCG GCATCAACTC CTTCTCGGGC CTGAACGGCC TGGCCGGATC CACCGGCGAC
CGGGTCAGAT TTCGCGAGGC TTGGGTCGAG CATTCCGCGT TCGACGACCG GATGGTGCTG
ACCGCCGGCA AGGTCGACCT TTCCAACTAC TTCGACTCCA ACGCGGTGGC AAACGACGAG
ACTGGGCAGT TCCTGGCCGG CGCCTTCGTC CACTCCGCCG TGCTCCCCTT CCCCGCTAAC
GGGCCGGGTG CGAGGGTGGC CGCGAAACTG ACCGATTCCC TCGTCGTAGG CCTTGGCTAT
GGCAGCGGCG ATGCCGACAG CGAAGACAGC TCCGACTCAG CCGACATCTT CAGCCACGGC
TTTGGCATCG CGGAACTCGA CTATAAAGTC AAGGCCGGGA ATCTGGAAGG CGACTACCGT
CTTTACGCGG CCTTGGACGG AGCAGTCGCA GGCAAGCTGG AGCCGAAAAA CGCCTGGAAC
TTCGGCGTGA GCCTCGACCA GCAGCTGACC GACAAGCTGA CCCTCTTCGC CCGCTACGGT
CAGCGCGACA AGGATGTCTA CGAGGTCCAA AAGGCCTGGA GCGCAGGCGG ACAGTACACA
GGGCTTTTCC CTTCCAGGAA GGACGACGTT CTCGGCGTGG CCTACGGCCA GATCAAGGCG
CACGCATCCA TCGCCGACAC CCAGGAGAAA CTGACCGAGC TCTACTACAA CTTCAAGATA
AACGAGCAGA TCGAGATCGC ACCGGTGGCG CAGTACCTGG TCCACCCGGC CGGGATGCGC
GGCAACGACG ACGTGCTGGC GCTGGCGCTG CGTACCCGGA TCAGCTTCTG A
 
Protein sequence
MNFKKIPVAT AFLLGSLLLS SGAAFALHPE LVVPERVELK HKACQEIVRL GAKYKVEGLF 
TPEFLEGKQH DCSRIDVALA VQLLTEKMAE KAVKEGNQAV DREDLLLLAD LKEELRAEML
LVGTRTFQSR YQDLGTRFTA LTKNISLSGG MVGVLQGTVG HSPKNHADTV GRADLVFNFK
VGENTIAVFD LEATGGEGID NTAGINSFSG LNGLAGSTGD RVRFREAWVE HSAFDDRMVL
TAGKVDLSNY FDSNAVANDE TGQFLAGAFV HSAVLPFPAN GPGARVAAKL TDSLVVGLGY
GSGDADSEDS SDSADIFSHG FGIAELDYKV KAGNLEGDYR LYAALDGAVA GKLEPKNAWN
FGVSLDQQLT DKLTLFARYG QRDKDVYEVQ KAWSAGGQYT GLFPSRKDDV LGVAYGQIKA
HASIADTQEK LTELYYNFKI NEQIEIAPVA QYLVHPAGMR GNDDVLALAL RTRISF