Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2131 |
Symbol | |
ID | 8137467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2487249 |
End bp | 2489240 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869746 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003021941 |
Protein GI | 253700752 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.000000140494 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACATA CCTTTTTCGT TGTCCTCTTG TCCCTTTCCA TGGCCGTTTT CGGCTTCAGC TGTTACCGCC GGCTGGCGTT GGTGGCGATC GGGAAGTACG AATATCGCTT CGACCAGCCG ACGGCCCGCC TCAAGGAGAT GCTCGTCTAC GCCCTCGGGC AGAAAAGGGT GGTGAGCCGC CCCTTCGGCC TCAACCACGG GGTCATCTTC TGGGCCTTCC TGGTGCTAGC CGTGGCGAAC CTGGAATTCC TGGTATCCGG GATCCTGCCG GGCCTCTCCT TCGCACTTTT GCCCGAACCT TTGCACGGCG CTCTGGCGTT TGTCTTCGAT ATCTGCTCGC TGGCCGTACT TATCGCGGTG GCTGTCGCCG CCGTGCGCCG CACAGTAAAG CCGCCGTTTG CCGGCGCGCG TACCTTCGAG GCCTACTTCA TCCTCTCCAT GATCGCCATC CTGATGACGG CTTACTTCGG GATGAACGGA GCCCTTATCG CTTCCGGCGC GCGCGCGGCC ACCTCCCTCA CTCCTGTATC CAACTTCTGC GCGAGCCTGC TCTCTTCGAC ACCGGCCACT GACCATCTGG ATCTCGCCGC GAAGGTGTTT TGGTGGATCC ACGCCGTGGT GTTGCTGGGG TTCATGAACT TCCTCCCCTA CAGCAAGCAC ATGCACATCC TCACCGCCAT CCCCAACGTC TTCTTGCGCA CCATGGGCAA GAGCAACACC CAGCCCCGCG AGGAGTTCAG CGAGGGGAAG AGCTTCGGAG CCGCAACGGT GGACAAGCTC ACCTGGAAGG ACCTGCTCGA TTCCTTCTCC TGTACCGAAT GCGGCCGATG CCAGGACTCC TGCCCGGCAG CCGCCACCGG GAAGGCGCTC AACCCGCGGC AGGTGATCCA CGCCATCAAG GAAAACCTGC TGCAAAACGG GATCGTGCTG GAGAAGCTTC ACGGGGACGC TGAGGCCGAG CGTTGCGTCT CGCTGATAGG AGAGGGGAAG AAAGGAACGA CTCAGGAATC CGCCCTTTGG AGCTGCACTT CCTGCGGCGC CTGCATGGAG GCATGCCCCG TTTTCATCGA GCACCTCCCG AAGATCGTCA AGATGCGGCG GCACCTGGTG CAGATGGAGG CGAAGTTTCC GGAGGAGCTT TTGAACCTCT TCGAGAATAT GGAGCAGCGC TCCAACCCTT GGGGGATGGC CCCGTCGGAG CGGAGCAAGT GGTGTTCCCA ACTGGATCTG CGCCCCTTTC AGGCTGGAGA AACCGAATAC CTCCTCTTCG TCGGCTGCTC GGGCGCCTTT GCTGCACGGA TCAAGCAGGT GAGCGTCGCC CTGACCCGGG TGCTGGACGC TGCGGGCGTC TCCTATGGCG TCCTGGGCAA GGACGAAAAA TGCTGCGGCG AGAGCGTGCG CCGGCTGGGC AACGAGTACC TCTTCGACCT GATGGCGCGG GAAAGCGTGG CGCAGTTCCG GGAAAAGGGA GTCGTCAAGG TCATCACCCA GTGTCCTCAC TGCTACAACA CGCTCAAGAA CGATTTCCGG CAGTTCGGGC TCGAACTGGA GGTGCTGCAT CACAGCGAGC TGATCGCGAC ACTTATGGCT TCGGGAAAGC TGCGGATGGA GGGGAAGATG GGGGATCTGG GCAATCTGGT CCTGCACGAC TCCTGTTATC TGGGGCGTCA TAACGACGTG TACCAGCCGC CGCGCTCGGT GATACAGGCG GTAACCGGTA CGGCGGCCGG CGAGATGGGG CGCAGCCTCG ACAAAGCCTT TTGCTGCGGT GCCGGCGGCG GGCGGATGTG GCTCGAGGAG CACGAGGGAA CGCCGATCAA CCGGGCGCGA GTGGCGGAGG CGCTGGCGTT GAAGCCGGAC ACCATCTGCG TAAGCTGCCC TTTCTGCATG ACCATGTTCG AAGACGGGGT GAAGGAGGTG CCTGGCGCCG GGGTTCAGGT GAAGGACTTG GCCGAGGTGG TGGCCCTGGC CCTCCCCATG AGGTCAACGT AA
|
Protein sequence | MEHTFFVVLL SLSMAVFGFS CYRRLALVAI GKYEYRFDQP TARLKEMLVY ALGQKRVVSR PFGLNHGVIF WAFLVLAVAN LEFLVSGILP GLSFALLPEP LHGALAFVFD ICSLAVLIAV AVAAVRRTVK PPFAGARTFE AYFILSMIAI LMTAYFGMNG ALIASGARAA TSLTPVSNFC ASLLSSTPAT DHLDLAAKVF WWIHAVVLLG FMNFLPYSKH MHILTAIPNV FLRTMGKSNT QPREEFSEGK SFGAATVDKL TWKDLLDSFS CTECGRCQDS CPAAATGKAL NPRQVIHAIK ENLLQNGIVL EKLHGDAEAE RCVSLIGEGK KGTTQESALW SCTSCGACME ACPVFIEHLP KIVKMRRHLV QMEAKFPEEL LNLFENMEQR SNPWGMAPSE RSKWCSQLDL RPFQAGETEY LLFVGCSGAF AARIKQVSVA LTRVLDAAGV SYGVLGKDEK CCGESVRRLG NEYLFDLMAR ESVAQFREKG VVKVITQCPH CYNTLKNDFR QFGLELEVLH HSELIATLMA SGKLRMEGKM GDLGNLVLHD SCYLGRHNDV YQPPRSVIQA VTGTAAGEMG RSLDKAFCCG AGGGRMWLEE HEGTPINRAR VAEALALKPD TICVSCPFCM TMFEDGVKEV PGAGVQVKDL AEVVALALPM RST
|
| |