Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2791 |
Symbol | |
ID | 8138134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3242032 |
End bp | 3244020 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870394 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003022583 |
Protein GI | 253701394 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.589243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCTA CGCGCGAACT TTATTGGAAC ATAGGCCATG GGGCGGCGGT GCTGGTACCG ATGTACCTGG TCACCTTCGC CGCCTTTGGG CTCCTCGCCT ACTACGGCTA CCAGCGGCTC GCCATCTACC GGCAGGGAAA GCCCTTGGAC AGGACCGACA ACCGCGCCGA GCGCGTCAAG ATGCTCCTGC GAAACGTCCT GATGCAGACC AAGGTGCTCA AGGTCAAGGC CCCCGGCTTC GCCCACGGCT TCTTCTTCTG GTCCTTCGCC GTCCTCTTCG TCGGGACGGC ACTCGTGGCG CTGCAGGCCG ACGGCACCGA TCTCCTCTTC AACGTCCGCT TCCTCACCGG CGGCTTCTAC AAGATCTTCT CGCTGGCGCT GGACCTGGCC GGCCTTGCCG CCACTCTGAT GCTGGCAGGC TTCGCGGTGC GCCGCTACCT GGTGCGCCCC GCGGGCCTTG AGAGCAACCG CGACGACGCC ATCATGCACG CCCTTCTCTT CGCCATCCTG ATCACGGGCT TTCTCATCGA GGGGGCGCGC ATGGCGGTTA CCGAGCTGGT CCACCAGCCA GAGCTCGCCT ACTGGTCCCC GGTGGGGCTA CTGGTGGCGA ACCTCCTTTC GGGGCTTTCC GCGGAATCTC TCCGTTCGCT GCACGTGGGT CTTTGGTGGG CGCACTTCGC CCTGGTCGCC GGCTTCATTT GTTCCATCCC CTTCACCAAG TTCCGCCACA TCTTCACCAC CTCGGCCAAC TACTTCCTGG CGCCGCTGGG CCCCAAGGGG GCGCTGGTGA CGCTGGACAT GGAAGGGGAC GCGGAGAGCT TCGGCGCCGC CAAGCTCGGG GATCTCACCT GGAAGGACAT CTTCGACACC GACGCCTGCA CCAAGTGCAA GCGCTGCCAG GACCGCTGTC CCGCCCACGG CACCGACAAG CCGCTCTCGC CGATGAAGGT CATCGACCAG GTGGGGGAGA TCGCGCGCGG AAAGGCCGAG GACAACCTGG TCGAGACCCT CGGCAAGGAC GCCGTCTGGT CCTGCACCAC CTGCCGTGCC TGTCAGGAGA TCTGCCCGGC AGCGGTGGAG CATGTCAACA AGGTGGTCGA CGTCCGGCGC AACCTGGTCC TCATGGAAGG GGAATTCCCC GGTGAAGAGG TGACCACCGC CATGGGCAAC GTCGAGGTCA ACGGCAATCC CCTGGGGCTC GCCTTCGCCT CCCGCGGCGA CTGGGCGAAG GACCTGCCGG TCTCGCTCAT GTCCGAGTGC GCCGACGTCG ACATACTCTA TTTCGTCGGC TGCTACGCCT CCTTTGACAA AAGGAACATC AAGGTCGCCA CCGCCTTCGT CAAGCTCTGC GCCGCCGCCG GGATCAAGGT CGGCATCCTC GGCAAGGAAG AGAAGTGCTG CGGCGAGCCG ATGCGCAAGA TGGGCAACGA GTACCTCTAC CAGACGCTCG CGGCCGAAAA CATCGCCATG ATCAAGGAGT ACGGGGTGAA GAAGGTGGTC ACCGCCTGCC CGCACTGCTT CAACACGCTA GCCAAGGATT ACCGGGATCT CGGCTTCGAC GTCGAGACCG AGCACTACAC CACCTTCCTG AACCGTCTCC TCAAGGAGGG CGCGTTGAAG CTGGCCCCGG GCACCTTCGA GTGCACCTAC CACGATTCCT GCTACTTGGG GCGCTACAAC GACACCTACG AGGCTCCCCG CGAGCTGGTT CAGGCGGCCG GAGGCACCAT CCGCGAGATG GATAGAAGCC AGGCGGAAAG CTTTTGCTGC GGAGCCGGCG GCGGCCGCAT CATGGCCGAG GAAAAGCTCG GAAGCCGCAT CAGCGTCAAG CGTTCCCAGA TGGCCTCGGC GACCGGCGCA GGCATGCTCG TTTCCAACTG CCCCTTCTGT CTGACCATGT TCGAGGACGG CATCAAGGGG GCCGAGCTGG ATGGGCTGCT GGTCCCCAGA GACATCGCGG AGATCCTGGT GGAAAAGGTG GTCTCGTAG
|
Protein sequence | MEATRELYWN IGHGAAVLVP MYLVTFAAFG LLAYYGYQRL AIYRQGKPLD RTDNRAERVK MLLRNVLMQT KVLKVKAPGF AHGFFFWSFA VLFVGTALVA LQADGTDLLF NVRFLTGGFY KIFSLALDLA GLAATLMLAG FAVRRYLVRP AGLESNRDDA IMHALLFAIL ITGFLIEGAR MAVTELVHQP ELAYWSPVGL LVANLLSGLS AESLRSLHVG LWWAHFALVA GFICSIPFTK FRHIFTTSAN YFLAPLGPKG ALVTLDMEGD AESFGAAKLG DLTWKDIFDT DACTKCKRCQ DRCPAHGTDK PLSPMKVIDQ VGEIARGKAE DNLVETLGKD AVWSCTTCRA CQEICPAAVE HVNKVVDVRR NLVLMEGEFP GEEVTTAMGN VEVNGNPLGL AFASRGDWAK DLPVSLMSEC ADVDILYFVG CYASFDKRNI KVATAFVKLC AAAGIKVGIL GKEEKCCGEP MRKMGNEYLY QTLAAENIAM IKEYGVKKVV TACPHCFNTL AKDYRDLGFD VETEHYTTFL NRLLKEGALK LAPGTFECTY HDSCYLGRYN DTYEAPRELV QAAGGTIREM DRSQAESFCC GAGGGRIMAE EKLGSRISVK RSQMASATGA GMLVSNCPFC LTMFEDGIKG AELDGLLVPR DIAEILVEKV VS
|
| |