Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2925 |
Symbol | |
ID | 8138268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3402883 |
End bp | 3404859 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870523 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003022712 |
Protein GI | 253701523 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCTG ATCCTGTTAT CTTCACGCCG CTTCTGGCTG CGGCCTGCAC CATCTTTCTT TGGAGCTGCT ACCGCCGTTT CAGCCTGGTC ACCGTTGGCG CCGCCGAAGA CCGCTTCTCC AACGCCATGG ACCGGCTTCA GGCGATGTTC GTCTACGCGT TTTTGCAAAA GAGGGTGGTG AAGCGTCCCT TCGGCGTCAA CCACGTCTTC ATCTTCTGGT CCTTCCTGAT CCTGGCCGTG GCCAACACGG AATTCCTCGT CTCCGGCGTT TTCCCTGCGG CGAAGCTTTC CCTCGTGCTC CCGGCGGCAC TCTACCAGCC GCTTCTGGCG GCGTTCGACC TGGTCTCGCT CTTCGCCCTC GCCGCCGTCG TCATCGCCCT CGTCAGGCGC GCGGTCGCTC CGCCGTACAA AGGGGCGCGC ACCGGCGAGG CGTTCGCCAT CCTCTCCATG ATCGGGGCGC TGATGCTGGC CTATTTCGGC TTGCATGCGG CGGAGATCGG TCTGGGGCAC GAGCCCGCTG CTCACGCCAT GCCGCTTTCC TCGGCGCTCG CCGCCGCTCT CTCCGGCCTG AGCCCGGAAA ACCTCGAAAC CTTCGGGACG GTCTCCTGGT GGCTCCACGC CGGGGTGCTT CTTTTCTTCC TCAACTACCT CCCCTACAGC AAGCACATGC ACATCCTCGC CGCCATCCCC AACTGCTACT TCAAGGGGCT TTCGGCCCCC AATACCCAGC AGCGCGAGGA GTTCGCCGAA GGAAACGTCT ACGGCGCAGG GAGCATCGAG AACTTCACCT GGAAGGACCT CTTCGATTCG TTCTCCTGCA CCGAGTGCGG GCGCTGCGAA AAGGCCTGCC CCGCTGCCGC TACCGGAAAG CCGCTCAACC CGCGCCTTGT AATGCACGAC ATCAAGGTGA ACCTCCTGGC CAACGGCACG CAGCTGCAGC GCGGCGGCGT GGCGACGGTC CCGGTGATAG GGGAAGGGGA GGGGAGCGTG GAAGCGGACG CCCTCTGGTC CTGCACCACC TGCGGCGCCT GCTTGAACGC CTGCCCGGTC TTCATTGAGC AGATGCCAAA GATCAACAAG ATGCGCCGGC ACCTGGTCCA GATGGAGGCG GATTTCCCCG AGGAACTTTT GAACCTCTTC GAGAACATGG AACAGCGCTC CAACCCCTGG GGGATAGCCC CCGGCGACCG CGGCAAATGG GTCGGCGGCC AGGAAGTAAA ACCGTTCGAG GCCGGCAAGA CCGAGTACCT CTTCTACGTC GGCTGCGCCG GCTCCTTCGA TTCCCGCGCC AAGCAGGTCA CCCTTTCCGT GGCGCGAATC CTCGACGCGG CGGGCGTTTC CTGGGGGATC CTCGGCAAGG AAGAGAAGTG CTGCGGCGAC AGCGTGCGCA GGCTGGGCAA CGAATATGTC TTCGACAAGA TGGCGCGCGA GAACGTGGCG ATGTTCCAGG AAAAGAAGGT CACCAAGGTG ATCACCCAGT GCCCGCACTG CTACAGCACC CTCAAAAACG ATTACCGGCA GTACGGCCTG GAACTGGAGG TCATTCCGCA CGCGGAATTG ATCGAGCAGT TGCTGGGGGA AGGGAAGTTG CAACTCGACA TGCACGAGGC CAAGGGGGGG AACATCGTGT TCCACGACTC CTGCTACCTC GGACGTCATA ACGGCATCTA CGACGCGCCG CGCAAGGTAA TCGCGCAGGC GACCGGCACA CTTCCAGCGG AAATGCCGAG AAACCGCGAG AACTCTTTCT GCTGCGGCGC CGGCGGCGGA CGCATGTGGC TCGAAGAGCA CCTGGGGGAG AGGATCAACC TGAACCGGGT CAACGAGGCG CTGGCGGGTT CCCCCGGCAC CATCTGCGTC ACCTGCCCGT ACTGCATGAC CATGATGGAA GACGGCCTCA AGGACCGCGC CAGCGGCGAA ACCAAGGTGA AGGACATCGC GGAAATCGTC GCAGAGGGAT TGAAAGGGAG GGCTTAG
|
Protein sequence | MQPDPVIFTP LLAAACTIFL WSCYRRFSLV TVGAAEDRFS NAMDRLQAMF VYAFLQKRVV KRPFGVNHVF IFWSFLILAV ANTEFLVSGV FPAAKLSLVL PAALYQPLLA AFDLVSLFAL AAVVIALVRR AVAPPYKGAR TGEAFAILSM IGALMLAYFG LHAAEIGLGH EPAAHAMPLS SALAAALSGL SPENLETFGT VSWWLHAGVL LFFLNYLPYS KHMHILAAIP NCYFKGLSAP NTQQREEFAE GNVYGAGSIE NFTWKDLFDS FSCTECGRCE KACPAAATGK PLNPRLVMHD IKVNLLANGT QLQRGGVATV PVIGEGEGSV EADALWSCTT CGACLNACPV FIEQMPKINK MRRHLVQMEA DFPEELLNLF ENMEQRSNPW GIAPGDRGKW VGGQEVKPFE AGKTEYLFYV GCAGSFDSRA KQVTLSVARI LDAAGVSWGI LGKEEKCCGD SVRRLGNEYV FDKMARENVA MFQEKKVTKV ITQCPHCYST LKNDYRQYGL ELEVIPHAEL IEQLLGEGKL QLDMHEAKGG NIVFHDSCYL GRHNGIYDAP RKVIAQATGT LPAEMPRNRE NSFCCGAGGG RMWLEEHLGE RINLNRVNEA LAGSPGTICV TCPYCMTMME DGLKDRASGE TKVKDIAEIV AEGLKGRA
|
| |