Gene GM21_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3872 
Symbol 
ID8139246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4455577 
End bp4459143 
Gene Length3567 bp 
Protein Length1188 aa 
Translation table11 
GC content66% 
IMG OID644871489 
Producthypothetical protein 
Protein accessionYP_003023647 
Protein GI253702458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CCGAGAAGAA AGAGAAGGAT CCAACAAAAA TCGCCGTCAT CGCCTCTCTC 
GTTGCTGCAT CGGCATTGGG GCTGGTGCTG CTTTGCCTCG TCGCCTTCCG GATCTACCTC
GCCACTTCCT TACCCGCCTC ACAGCTCTCC CGCCTTGTCA CCGACCGGCT GCAGCAGGAT
TTCACCGTCA AGGAGATCGA CCTTTCCGGC AAGGCCCTCA TCCTCAAAGG GGTGCGCCTG
AAGAACCCGG CGGGGTTTGC TTCCGGCGAA CTCTTCGCGG CCGACCGGGT GGTTCTCGCC
CCCCGGTGGA ATCAACTTCT GCGGGGACGG CAGCGCTTCG AGCTGATAGA CGTCGGGGGG
GGGAAGCTCG CCTTGGAAAA AAATGGCAGC GGCACCTGGA ACTTCGAGAA CCTCCAGCGC
CGGCTCGCGG CGAGAAAGCC TGCCGCGGAA AGACCAGCCC CGGAAACGGT GATCGGGAAG
CTCCTGGTGA AAAACGGCAG CATCACCATC CAGGGGAAAG GGGTGCACGG AATCAACCTG
CAGGTCTTCA ACCTGGCCAG CGGCGGGTCC CGTCGCGCCC AAGTGGAACT TGCCTTCGAG
GACGCCGTCG GCAACCGCTA CCTTTTGAAG GGGACCGCCC GCCCCGGAGC GGACGCCGCT
GTCGACCTCT CCCTCACGGC GCCTACCATG TCCCTGAAGA ACCTAGCGCA GCTTTTCAAG
CTGAAGGATC CCGAACCCTT TCAGGATGCC CAAGGGGCGC TGCGGGCGAG CGCGGTGCTG
GCGAAGGGGG AGCTGAAAAC CACCGGGAGC TTCTCCTTTC GCAGGGTCCT GATCCCCGCC
GCCCGGGGCG ACTACCCCAT CGCAGGTCTT TTGCAATTCA ACGGCGTCTA CGACCTCTCC
GAGGACACGG CCCACTTGGA CGAAGCGACG CTCACCATAG ACAAGATGGC GCGGCTGCAG
GCAGGCGGAA CCGTCAGCGG CGTGAAGAAG GAAAGAGTGT TCGATCTCCT TCTTTCCCTG
GACGCGGTGG ACCTCGCGCT GATAAACGTG CTGCTGCCGG AAGAATCGCG CCACGACCTG
GATTTCGGCG GCAGGCTCCG CTGCGAGTCG TTGCGCCTGG AAGGAAACGG CAGCGCGGGT
ATCGAGAGCG TGGTGGGGAA CCTGCAACTG CAGGAAGGCT CGCTGGCCCG GGAACGGGAA
TTGATCGCGG CGGGGGTGGC GGGGAATCTC GCCCTCTCCC GGCACGGTGC GACCGTGTCC
GCGCGCGGCA AGTTCTCCCT CCCCCATCCG GAGCCGAAGG CGCTGGTCCA GGCGCTGGAA
CTCCCGGTGG AACTCACCCT TTCCGCTGGC CTGAAGCCGC TTCGCGCCAA AAGCGATGCG
TTTTCCGCCA GGATCCTGGG GATTGCCGTG TCAGGGCGTG CCGCTTACGA CGCTGGGAAC
TCCGAGCCGG TGCGAGCCGA TCTCGCCTTT GCGACCAGGG AACCGGAGCG GCTCAACCCG
TGGCTCTCCC GTTACGGCAT CGCGGCCTTC TCCGGAACCG GTTCCGGCAG CGTCCTGTTG
GCGGGTAAGG GGGCCCAGGA GATGAACGCC TCGGCCCGGC TCACGGTCGC GAACCTCAAG
GGGAAGCAGG GGAAGGACAA TATCGGCGTG AAGGCGGGGA CGGCGACGGC CGCGGCGACG
AAGCGGGGAG AGAAACTAGA GGTTCGCGGG GACGCCCGTC TCGCCGCCAT CTCCTTCCAG
GGGAAGAGCG GCGGCGCGCG TTTCGCCTAC CGGGTGGCGG ACCGCTACCT CTACTTGAAC
GGGGTGCAAG CCTCGTGGGG GGAAACCCGC TTCTCCGCGT CGAGCCTGAG CGGAGAACTC
CCGGCCGCGA CGGTCACCGG GCAGCTTACC CGCCGTCCGC TGCGCTTCGA CCTTGAGGGG
GGGGCGCTCG GGCAGGGGGA TTTCCAGCTC TCCGGCATCG CGGGGAGGGT AAGGGGGAGC
CTCGCCGGCG AAGGGAAGGA GAAGTGGCTG GAAGGAAGCG CCGATCTCGC CTCCCGCGCC
CTCTCCTGGC GCCAGGGGGC GATGGCGGCG CCTGCCCTGC ATGCCGCTTT CACGAGGCAG
GGGGGGCGGG CCGAACTGAA CGGACAACTG CTGGGGGGTA AGGTGGCGGC AAAGGCGGCC
TTTCGCCCCT TCGCTCCGGG AGCGCCGTCC AGCTTCAGCG TCAGCATGAC CGGCGCCGCC
GCGAAGGAGA TTGCGCGCCT TGCCGCCGGC GAAGCCGGCA TCCGCCCCAG CGCCGGTACC
GTTGACTTTA GACTGGAAGG GACCCACGCC GGGAAAAGCG GTCTCTCCTG CCGCTTCGAC
GCCAGGGGGC GGGAGATCTC CCTCGCCAAC GCCACGGGGA AGAGCCTCGT GTCGGGTGCA
GCCGCTTCGG TCAGGGGGCG CCTGTCCGGA GGCACCCTGA CCATCGAAGA GGCCAATGCT
TCCCCGGGGC GGGGCGTTGT CCTTACCGCG CGGGGGGAGA TAGCGGATGC GCAGTCGGAG
AAGCGGCGCG GGACCTTGCT GCTAACCCTG CCGGAGACCT CGCTGAACGA CCTGGTGGAG
TCGCTCATCA ACCTCGCCCC CCCGCTGATC CAGGAGGCGA CGCTGAAGGG GAGCGTGGCG
GCGGAGGGGA GGATGGAGCT GCGCGAAGGA CGCAAGGTTC TCAACGGCGG CGTGACGGTG
AGGGGGGGAA GGGTGGAGGC GTCAGCCCAG AAATTCCTGG TGGCGGATCT GAACGGAAGG
ATTCCTATAT CTCTCGACCT CGGAGGCAAG GGGAGCGCCG CCCCGCGCGC CGCCGGGGAG
TTCAGCCGGG AGAATTATCC CCGCGTCCTG GCGCAACTGC GCGCAACCCA GCAGGGGGGG
GACCTGGTCA CCGTCGAGAG GATCGGCTTC GGCTCCGTCC AGACCGGGAA GCTCAGCGCG
CAGCTTCGTT CCCAAAACGG GCTCACGGAG ATCACCTCCC TGCGCACCAC CCTTTACGAC
GGGACCGTGT TGGGGAGAGG CCATCTTCTA TTTTCCGGCC GGGCGACCTA CCGGGGCGAC
CTCCTGGTGA ACGGTCTCAG CATGAAGGCC CTGTGCAAGA GCCTCCCCAA CCTTGAGGGG
TACATCTCCG GACGCGTGGA CGGCGTTGTC AGCGTGCACG GCATCGGCGG AGACCAAAAG
CGCTTGACCG GATTCGTAGA TCTCTGGGCG CGGGAAGGGG GGGGCGAAAA GATGGTGGTG
AGCAAGCAGT TCCTGCAGCG CCTGGCCAAG CAGAAGCTCT CAGGCTTCTT CCTGAGCCGC
GACCGCCCCT ACGACAGGGC GGAGATCAAG GCTACGCTGG AGCGGGGAGA GCTCACCTTC
AACGAGCTGC AGATCCTGAA CACCAACGCC CTCGGGGTCA AGGACCTGAA CGTCAACATC
GCCCCGACGC AGAACCGGAT CGCCCTGGAC CATCTGCTTG AGTCGATAAA GGAGGCGGCG
GTGCGCGGCG TGCCGGCCTC TGGCGGAGAG GCCCCGGCGC AGAAGCCGCA GCAGGAACCG
GCGCCCGAGT TCAAGTGGGA GGAGTGA
 
Protein sequence
MSEPEKKEKD PTKIAVIASL VAASALGLVL LCLVAFRIYL ATSLPASQLS RLVTDRLQQD 
FTVKEIDLSG KALILKGVRL KNPAGFASGE LFAADRVVLA PRWNQLLRGR QRFELIDVGG
GKLALEKNGS GTWNFENLQR RLAARKPAAE RPAPETVIGK LLVKNGSITI QGKGVHGINL
QVFNLASGGS RRAQVELAFE DAVGNRYLLK GTARPGADAA VDLSLTAPTM SLKNLAQLFK
LKDPEPFQDA QGALRASAVL AKGELKTTGS FSFRRVLIPA ARGDYPIAGL LQFNGVYDLS
EDTAHLDEAT LTIDKMARLQ AGGTVSGVKK ERVFDLLLSL DAVDLALINV LLPEESRHDL
DFGGRLRCES LRLEGNGSAG IESVVGNLQL QEGSLARERE LIAAGVAGNL ALSRHGATVS
ARGKFSLPHP EPKALVQALE LPVELTLSAG LKPLRAKSDA FSARILGIAV SGRAAYDAGN
SEPVRADLAF ATREPERLNP WLSRYGIAAF SGTGSGSVLL AGKGAQEMNA SARLTVANLK
GKQGKDNIGV KAGTATAAAT KRGEKLEVRG DARLAAISFQ GKSGGARFAY RVADRYLYLN
GVQASWGETR FSASSLSGEL PAATVTGQLT RRPLRFDLEG GALGQGDFQL SGIAGRVRGS
LAGEGKEKWL EGSADLASRA LSWRQGAMAA PALHAAFTRQ GGRAELNGQL LGGKVAAKAA
FRPFAPGAPS SFSVSMTGAA AKEIARLAAG EAGIRPSAGT VDFRLEGTHA GKSGLSCRFD
ARGREISLAN ATGKSLVSGA AASVRGRLSG GTLTIEEANA SPGRGVVLTA RGEIADAQSE
KRRGTLLLTL PETSLNDLVE SLINLAPPLI QEATLKGSVA AEGRMELREG RKVLNGGVTV
RGGRVEASAQ KFLVADLNGR IPISLDLGGK GSAAPRAAGE FSRENYPRVL AQLRATQQGG
DLVTVERIGF GSVQTGKLSA QLRSQNGLTE ITSLRTTLYD GTVLGRGHLL FSGRATYRGD
LLVNGLSMKA LCKSLPNLEG YISGRVDGVV SVHGIGGDQK RLTGFVDLWA REGGGEKMVV
SKQFLQRLAK QKLSGFFLSR DRPYDRAEIK ATLERGELTF NELQILNTNA LGVKDLNVNI
APTQNRIALD HLLESIKEAA VRGVPASGGE APAQKPQQEP APEFKWEE