Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2785 |
Symbol | |
ID | 8138128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3234101 |
End bp | 3236071 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870388 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003022577 |
Protein GI | 253701388 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 89 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCA ATCAAATGTT CTTCATGCCG CTTTTCGTCA TCGCCCTCGT GGCCTTTTGC TTCAGCTGCT ATCAGCGCCT GCAACTGGTT GCCGTCGGCA CCCCCGAGGA CCGCTTCGAC AGGCCCGGCG AGCGGCTGGC CGGCATGTTC CGGTACGCCT TCGGTCAAGA GAGGGTCCTC GCCAGACCCT ACGGCCTGAA CCACTTCGCG CTCTTCTGGG CCTTTATGCT GCTCCTGGTC GCGAACGTCT CCTTCCTGGC CGAGGGGCTC TTTCCGGGGT TCACCCTCTC CATTCTCCCG GCCCCTCTGC ACCACGCCCT GGCGCTTTCC TTCGACCTGG TCTCGGTGGT AGCGCTCGTC AGCGTCGCGG TAGCGCTCGC GCGCCGGCTC TTCTTCGCGC CGTCTTACCT CGGTAACGAT TACACCAAGG CCTGCAGCGG AGAGGCGCTC CTGATCCTGG CCCTGATCGC CACCCTTATG GTCGCCTTCT TCCTGCTGAA CGCAGCCCAG ATCGCTCTCG GCGCCGACCA GGCGTTGAGG CCCGTCTCCG GTGCGCTAGC CACGCTTTTG CAGGGGATGC CGCAAGCGTC GCTCGAAGGT ATTGCTTCCG TCTCCTGGTG GGTGCATGCC GTGGTGCTCC TTCTTTTCAT CAACCTTTTG CCCCGCAGCA AGCACATGCA CATACTCACC GCCATTCCCA ACTGCTACTT CCGCAACCTG GAGAAGCCCA ACGTGCAGCC GCGCGAGAGC TTCGAGCTCG GCAAGCGTTT CGGCGTGAGC GAGGTGGCGC AGTTCTCCTG GAAGGATCTC CTCGATTCCT TCTCCTGCAC CGAATGCGGG CGCTGTCAGG ACCTCTGCCC GGCCCACAAC ACCGGAAAGC CCCTGAACCC GCGCCGGATC ATCCACGACA TCAAGGTGAA CCTCTTGGAG AACGGCGTCG CCAACGCCGG GAAGGAGCAG CTCCCGCTCA TCGGCGAGAA AGGAGAGGGG ACGAGCTGCG AGGACGCCCT TTGGTCCTGC ACCACCTGCG GCGCCTGCCT GTCGGTCTGC CCGGTCCTCA TCGAGCACAT GCCCAAGATC GTCAAGATGC GCCGCCACCT GGTCCAGGAA AAGGCCCGGT TCCCCGAGGA GCTTTTGAAC CTCTTCGAGA ACATGGAGCA GCGTTCCAAC CCCTGGGGCA TCGCCCCCTC CGAGCGCGGC AAGTGGGCGA ACCTCCTGGG GGACAGGGAG TTCACAGCAG GCAAGACCGA ATACCTCTTC TTCGTAGGGT GCGCCGGTTC CTTCGACAGC CGCGCCAAGC AGACTACCGT GGCTCTCGCC ACCGTCCTCG ACAAGGCCGG CGTCACCTGG GGCATCCTCG GCAGGGACGA GCTCTGCTGC GGCGACAGCG TGAGGCGCCT GGGGAACGAA TTCGTCTTCG ATAAGATGGC GCGGGAGAAC GTGGCCAAGT TCAAGGAGAA AGGGGTCACC AAGATCGTCA CCCAGTGCCC GCACTGCTTC AGCACGCTCA AGAACGACTA CCGGCAGTAC GGCCTGGAGC TGGAGGTGCT GCACCACAGC GAGCTGATCG CCGGCCTGGT GCAGGAGGGG AAACTGAGCA CCGCCAAAGG GGTCAACCTG GGCAAGACCG TCTTCCACGA CTCCTGCTAC CTGGGGCGCC ACAACGACAC GTACGCAGCA CCCCGCCAGG TGATCGAGGC CGCGACCGGT GTCGCTCCCG GTGAGTTCGA GCGCCGGAAA GAGAACGGAT TCTGCTGCGG AGCAGGCGGC GGGCGCATGT GGATGGAAGA GCAGATCGGC ACGAGGATCA ACCACGACCG TGTCAACGAG GCCCTGAAGC AGCAGCCCGA CACCATCTGC GTCAGCTGTC CCTACTGCAT GACCATGCTG GAGGACGGAC TTAAGGACCA GGGCGCGGAA AAGGTGAGGG TGAAGGATAT AGCAGAGGTA ATGGCCGAGG CAATCAACTA G
|
Protein sequence | MPTNQMFFMP LFVIALVAFC FSCYQRLQLV AVGTPEDRFD RPGERLAGMF RYAFGQERVL ARPYGLNHFA LFWAFMLLLV ANVSFLAEGL FPGFTLSILP APLHHALALS FDLVSVVALV SVAVALARRL FFAPSYLGND YTKACSGEAL LILALIATLM VAFFLLNAAQ IALGADQALR PVSGALATLL QGMPQASLEG IASVSWWVHA VVLLLFINLL PRSKHMHILT AIPNCYFRNL EKPNVQPRES FELGKRFGVS EVAQFSWKDL LDSFSCTECG RCQDLCPAHN TGKPLNPRRI IHDIKVNLLE NGVANAGKEQ LPLIGEKGEG TSCEDALWSC TTCGACLSVC PVLIEHMPKI VKMRRHLVQE KARFPEELLN LFENMEQRSN PWGIAPSERG KWANLLGDRE FTAGKTEYLF FVGCAGSFDS RAKQTTVALA TVLDKAGVTW GILGRDELCC GDSVRRLGNE FVFDKMAREN VAKFKEKGVT KIVTQCPHCF STLKNDYRQY GLELEVLHHS ELIAGLVQEG KLSTAKGVNL GKTVFHDSCY LGRHNDTYAA PRQVIEAATG VAPGEFERRK ENGFCCGAGG GRMWMEEQIG TRINHDRVNE ALKQQPDTIC VSCPYCMTML EDGLKDQGAE KVRVKDIAEV MAEAIN
|
| |