Gene GM21_1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1830 
Symbol 
ID8137161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2129663 
End bp2131513 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content59% 
IMG OID644869441 
ProductTonB-dependent receptor plug 
Protein accessionYP_003021641 
Protein GI253700452 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.000821839 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCAAAGAA CCATTGCCAT TGTTCAGTCT GCTTTTGCCA GAGCTAAGGC CCCTCTGATA 
GTGCAGATGT TGTTTTTGAT ATTTGCCGCC TCTCAGGCGC TGGGGGACGA GGAGCTTCTC
TCTTTGGAAC TCTACAACGG CGACGAGGCC GAGATAGTGT CGGGGAGCAG GTCCCCAAGG
CCCGCCTCTC AGACCGCCGA GAACATAACC GTGGTCACTT CCGAGGATAT ATTGGCGTTG
AACGCGCACA CACTGGCGGA CATACTCTAT GCCGTAACCG GCGTTCAGAT GGAGATGACC
CGCACCCCGG GGGCCTCAAC CAGCTTCAAG CTGCAGGGGG CCAACTTCAA CCACGTGCTG
GTGTTGATAG ACAACGTCCC CCTCAATACG CTCTCCGAGA ACTACCCCGA CATCGCAGCG
GTCCCGGTGC AGATGATCGA GAGGGTGGAA ATAGTGAAGG GCGCCGCGTC GTCGGCATGG
GGCAACGCCT TGGGCGGGGT CATCAACGTC ATGACCAAGA TGCCCGACCA GGAACGTCCC
GTGGGGGGGA TGGTATCCGG TTCTTTTGGC AAACGAGAGA CCGCTGACCT CAGGGGGGAG
TTGAGCGGCA CGGCGAACAG CAAAGGCTAC TACATAACCG GCGGGAAGCT TAAGTCGGAC
GGTCTTCTGG CCAACAACAT GGTCGACAAG GAAAACTTCT ACGGCAAGTT GCTGTACGAC
CTGCCAGGCC GTGGTTCGCT CACCTTGACC ACCTGGCTTA GCCAGGGCTT CAACGGGATG
TACGAGGAAG GTCCAGTCTC CACCAACCAG GAAATGCGGT ATCTCATCAG CACCCTGGCC
GCGAGCTATC AGCTGTCCGA CCACCTGCGT CTCGAAGCCG CCGCGGTAGC CTCGGAAACT
GAGGGAACCC TCTTTCTACG CCAGATGGAC ATGCTGGACG CTTCCAGGAC ATCAACTGTT
GTTCTGGACG AGTCTAGGGT AGGAGGGAGC TTCAAGCTCT CCTGGCTGGA CGAGTTTCAG
AGGGTGGTGG CCGGGGTCGA TTACGAACAT GTGGCGGCGC AGGTGAGCAG CTCGCAGATC
ACGGCCGACC TCCTGAACCA AGGCGCTGAT CGGGTCGGCG TCTACCTGAG CGACACGCTG
ACGCTCGGGC GCTTTGCCGT CACCCCGAGT GCGCGCTTTG ACAGGACAGG CTCAGGCGGC
AACCACTTCA GCCCAAGCTT CGGCGTCACC TACGCGCTCA CCGACAACAG CGTCCTGCGC
GGATACACCG CCACCGGATA CAGCCTTACC TCGCTCAACC GCTCGGACTC CACGGAAAAG
GTCTGGACCT CGCAGTTCGG CTTCGAGTCC GGCGACCTTC CCTACCTCTG GGTTAAGGGA
ACCCTATTCA GAAACGATAC CTGGGACGTC ATCGCGCGGA GCCCGACCGG TATTGTGAAG
GAGCGGCAGC TGAAGCAGGG GGTTGAGCTT GAGGCGAAGA CCCTTCCTCT TTTTAACACC
TCGCTTTCTG TCGGGTACAC CTTCATCACC GCCACCCTGG GAGTCAACGG CCCCGTCATC
ATGGGGGTGC CCAGGCACAC CGTCAACATC GGACTCAGAT ACCAGGACGC AAGTAACTTG
CGCGGAGAGC TAACCGGGCA CTACCTGGAC TGGAACGAGG CTAGGCAGGG AGGAAGGTAC
AGTGACGTCA TCTGGGACCT GCACCTGAGA AAGGAGTTCA ACCGCTGGGA GCATGTGTCC
CTCGAGCCGT TTCTTTCGGT CAGGAACCTG CTGAACGGAC GTCAGTACCC AAACAGCTTG
TACGAGAACC CTGGGCGCTG GGTGGAAGCG GGGGTCAGAT GCAACTTCTG A
 
Protein sequence
MQRTIAIVQS AFARAKAPLI VQMLFLIFAA SQALGDEELL SLELYNGDEA EIVSGSRSPR 
PASQTAENIT VVTSEDILAL NAHTLADILY AVTGVQMEMT RTPGASTSFK LQGANFNHVL
VLIDNVPLNT LSENYPDIAA VPVQMIERVE IVKGAASSAW GNALGGVINV MTKMPDQERP
VGGMVSGSFG KRETADLRGE LSGTANSKGY YITGGKLKSD GLLANNMVDK ENFYGKLLYD
LPGRGSLTLT TWLSQGFNGM YEEGPVSTNQ EMRYLISTLA ASYQLSDHLR LEAAAVASET
EGTLFLRQMD MLDASRTSTV VLDESRVGGS FKLSWLDEFQ RVVAGVDYEH VAAQVSSSQI
TADLLNQGAD RVGVYLSDTL TLGRFAVTPS ARFDRTGSGG NHFSPSFGVT YALTDNSVLR
GYTATGYSLT SLNRSDSTEK VWTSQFGFES GDLPYLWVKG TLFRNDTWDV IARSPTGIVK
ERQLKQGVEL EAKTLPLFNT SLSVGYTFIT ATLGVNGPVI MGVPRHTVNI GLRYQDASNL
RGELTGHYLD WNEARQGGRY SDVIWDLHLR KEFNRWEHVS LEPFLSVRNL LNGRQYPNSL
YENPGRWVEA GVRCNF