Gene GM21_2791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2791 
Symbol 
ID8138134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3242032 
End bp3244020 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content64% 
IMG OID644870394 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003022583 
Protein GI253701394 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.589243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCTA CGCGCGAACT TTATTGGAAC ATAGGCCATG GGGCGGCGGT GCTGGTACCG 
ATGTACCTGG TCACCTTCGC CGCCTTTGGG CTCCTCGCCT ACTACGGCTA CCAGCGGCTC
GCCATCTACC GGCAGGGAAA GCCCTTGGAC AGGACCGACA ACCGCGCCGA GCGCGTCAAG
ATGCTCCTGC GAAACGTCCT GATGCAGACC AAGGTGCTCA AGGTCAAGGC CCCCGGCTTC
GCCCACGGCT TCTTCTTCTG GTCCTTCGCC GTCCTCTTCG TCGGGACGGC ACTCGTGGCG
CTGCAGGCCG ACGGCACCGA TCTCCTCTTC AACGTCCGCT TCCTCACCGG CGGCTTCTAC
AAGATCTTCT CGCTGGCGCT GGACCTGGCC GGCCTTGCCG CCACTCTGAT GCTGGCAGGC
TTCGCGGTGC GCCGCTACCT GGTGCGCCCC GCGGGCCTTG AGAGCAACCG CGACGACGCC
ATCATGCACG CCCTTCTCTT CGCCATCCTG ATCACGGGCT TTCTCATCGA GGGGGCGCGC
ATGGCGGTTA CCGAGCTGGT CCACCAGCCA GAGCTCGCCT ACTGGTCCCC GGTGGGGCTA
CTGGTGGCGA ACCTCCTTTC GGGGCTTTCC GCGGAATCTC TCCGTTCGCT GCACGTGGGT
CTTTGGTGGG CGCACTTCGC CCTGGTCGCC GGCTTCATTT GTTCCATCCC CTTCACCAAG
TTCCGCCACA TCTTCACCAC CTCGGCCAAC TACTTCCTGG CGCCGCTGGG CCCCAAGGGG
GCGCTGGTGA CGCTGGACAT GGAAGGGGAC GCGGAGAGCT TCGGCGCCGC CAAGCTCGGG
GATCTCACCT GGAAGGACAT CTTCGACACC GACGCCTGCA CCAAGTGCAA GCGCTGCCAG
GACCGCTGTC CCGCCCACGG CACCGACAAG CCGCTCTCGC CGATGAAGGT CATCGACCAG
GTGGGGGAGA TCGCGCGCGG AAAGGCCGAG GACAACCTGG TCGAGACCCT CGGCAAGGAC
GCCGTCTGGT CCTGCACCAC CTGCCGTGCC TGTCAGGAGA TCTGCCCGGC AGCGGTGGAG
CATGTCAACA AGGTGGTCGA CGTCCGGCGC AACCTGGTCC TCATGGAAGG GGAATTCCCC
GGTGAAGAGG TGACCACCGC CATGGGCAAC GTCGAGGTCA ACGGCAATCC CCTGGGGCTC
GCCTTCGCCT CCCGCGGCGA CTGGGCGAAG GACCTGCCGG TCTCGCTCAT GTCCGAGTGC
GCCGACGTCG ACATACTCTA TTTCGTCGGC TGCTACGCCT CCTTTGACAA AAGGAACATC
AAGGTCGCCA CCGCCTTCGT CAAGCTCTGC GCCGCCGCCG GGATCAAGGT CGGCATCCTC
GGCAAGGAAG AGAAGTGCTG CGGCGAGCCG ATGCGCAAGA TGGGCAACGA GTACCTCTAC
CAGACGCTCG CGGCCGAAAA CATCGCCATG ATCAAGGAGT ACGGGGTGAA GAAGGTGGTC
ACCGCCTGCC CGCACTGCTT CAACACGCTA GCCAAGGATT ACCGGGATCT CGGCTTCGAC
GTCGAGACCG AGCACTACAC CACCTTCCTG AACCGTCTCC TCAAGGAGGG CGCGTTGAAG
CTGGCCCCGG GCACCTTCGA GTGCACCTAC CACGATTCCT GCTACTTGGG GCGCTACAAC
GACACCTACG AGGCTCCCCG CGAGCTGGTT CAGGCGGCCG GAGGCACCAT CCGCGAGATG
GATAGAAGCC AGGCGGAAAG CTTTTGCTGC GGAGCCGGCG GCGGCCGCAT CATGGCCGAG
GAAAAGCTCG GAAGCCGCAT CAGCGTCAAG CGTTCCCAGA TGGCCTCGGC GACCGGCGCA
GGCATGCTCG TTTCCAACTG CCCCTTCTGT CTGACCATGT TCGAGGACGG CATCAAGGGG
GCCGAGCTGG ATGGGCTGCT GGTCCCCAGA GACATCGCGG AGATCCTGGT GGAAAAGGTG
GTCTCGTAG
 
Protein sequence
MEATRELYWN IGHGAAVLVP MYLVTFAAFG LLAYYGYQRL AIYRQGKPLD RTDNRAERVK 
MLLRNVLMQT KVLKVKAPGF AHGFFFWSFA VLFVGTALVA LQADGTDLLF NVRFLTGGFY
KIFSLALDLA GLAATLMLAG FAVRRYLVRP AGLESNRDDA IMHALLFAIL ITGFLIEGAR
MAVTELVHQP ELAYWSPVGL LVANLLSGLS AESLRSLHVG LWWAHFALVA GFICSIPFTK
FRHIFTTSAN YFLAPLGPKG ALVTLDMEGD AESFGAAKLG DLTWKDIFDT DACTKCKRCQ
DRCPAHGTDK PLSPMKVIDQ VGEIARGKAE DNLVETLGKD AVWSCTTCRA CQEICPAAVE
HVNKVVDVRR NLVLMEGEFP GEEVTTAMGN VEVNGNPLGL AFASRGDWAK DLPVSLMSEC
ADVDILYFVG CYASFDKRNI KVATAFVKLC AAAGIKVGIL GKEEKCCGEP MRKMGNEYLY
QTLAAENIAM IKEYGVKKVV TACPHCFNTL AKDYRDLGFD VETEHYTTFL NRLLKEGALK
LAPGTFECTY HDSCYLGRYN DTYEAPRELV QAAGGTIREM DRSQAESFCC GAGGGRIMAE
EKLGSRISVK RSQMASATGA GMLVSNCPFC LTMFEDGIKG AELDGLLVPR DIAEILVEKV
VS