Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2380 |
Symbol | |
ID | 8137721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2777367 |
End bp | 2779115 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644869995 |
Product | CHAP domain containing protein |
Protein accession | YP_003022186 |
Protein GI | 253700997 |
COG category | [R] General function prediction only |
COG ID | [COG3942] Surface antigen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 88 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGT TCATACTATG GTTAACTCTC ATGGTTGCAG TCCCTTCACT GGCGCTCGCA ACAAAAATTA AAGATATTGA TATATCTCCA AAGCATCCTA ATGTCGGCGA TAATGTCAAA ATCTCTGTGA AAACAAGTGG TGATGTTGCC AGGGCGGTTA TTGAATTCCC AGATTTGGGA ATCCAGAAAG ATCTTGGGGT CATTGGGAAG AACACTTGGG GGATATTGAT TCCATTGAAC TATTCGGGCG AAAAAAAATT CAAAGTTGTT TTGTATCAGA AAAAAAAGGA CAAAAATCCT AAGGACACAG AAAAAGACAA AATTATCGTG TCCGCAAAAA ATGAGGACCG AAAGCCCAAG CCAAGTGCTG AACCCAAGCC TGGAGCTGAA CCCAAGCCTG GCCCTGAACC CGACAAGCAA AGACCGGTAG TTGCAGGTTT TAATATAAAT AAGGAGTCTT CGAAGGTAGG AGAAGTAATC ACTATATCTT ATGTGGTGAG CGACTCAGGT GGTTCTGGGC TAAAGCAGGT TGAACTGTGG CGTTCCCCGA ATAATAGTGA TTGGGCAGAG ATCAAAGAGA AGAGACGAAA CCTTTCAGGC GGAAGCGCCT CAGGTTCATT TACTGATACA CCCTCGGCAC CTGGTACCTA CTATTATGGG ATTCACGTAG TAGACAATAA TAATAACTGG AGTGGCGAGA GTGGTCCGAA GCGATTAATG GTTGAACAGG CGGCCAGGCA GCAGCATGGT TCAGTGTCTG GGAGAATTCA TAAGAACTTC GCTTCAGGCC CGGTGCTTTC CGGGGTCTCT GTGAATTGCG GAGGAAAGAG CAGTATAACG GACGGAAACG GATATTTCAA GATCGATGGC ATAACGGCGG GCAACCATGG GATATCATAC TCAAAGTCCG GTTATGACGG ATTTAATAGT CAGATTGCTG TCAAGGCTGG GTCGAATATT AGCGAAGGTG AACGGTGGTT GACTGAAAAA AGCCCTGCCA CCCAGTTGCC GAATGTGACC GGCGTCAACG TCAACCCCGC CAGCATTACG GCAGGTCAGG CAGCATTATT CACGGCGACT ACCGATAGTG CAGCTTCAAT GGTAACTCTG CGCTTCACCG ATGCTGGCAT CGATGTGAAG ATGTCCGGCA ATGGCACGAC TTGGAAGGCT TATCCGCAAA TCCACAATGC CGGCAATAGA CCTTTCACAG TTACTGCATT TGACAAAAAT AATAAGTATG GCAGCGGAAG AAAAGGCACG ATCCAAGTTA ACAAGGTGAA GGAGCCTGTT GCCTCCAATT TGCGTCCTGA TTTCACTTTG CCAGCTTATC GAGAGAATAA TCCTTTTTGG AATAGCGGCT ATGCCCCGAA AGAAGTTGCT CCCCCCAAAC CTAAACTCGA TAATGCCAAA GGAAATTGTA CATGGTATGC AAATGGCCGC CTACGTGAAC TTGGTTATAA TGTGCCAAAT AATAGTTTTA CACATCATGC AAAAACATGG GTTAGCGATG CGAAAAATAA TCATTTTGTC GTAGACCAAA CACCTCAAGT TGGTTCAATT GCGCAATCAG ATACTATGAA CAAAAATTAT GGACATGTTG CTGTAGTAGA GGTCGTGCAT AATGATGGAA CTATAGTGAT TTCTGAATCG AGTTACGCGC CAGGCACTAA AGATTGGGAT TTTTTGTATA ACACTAGGAC TATCCATAAA TCCATATTTA CGAGCTATAT TCATGTGCCT CGAAAATAG
|
Protein sequence | MKAFILWLTL MVAVPSLALA TKIKDIDISP KHPNVGDNVK ISVKTSGDVA RAVIEFPDLG IQKDLGVIGK NTWGILIPLN YSGEKKFKVV LYQKKKDKNP KDTEKDKIIV SAKNEDRKPK PSAEPKPGAE PKPGPEPDKQ RPVVAGFNIN KESSKVGEVI TISYVVSDSG GSGLKQVELW RSPNNSDWAE IKEKRRNLSG GSASGSFTDT PSAPGTYYYG IHVVDNNNNW SGESGPKRLM VEQAARQQHG SVSGRIHKNF ASGPVLSGVS VNCGGKSSIT DGNGYFKIDG ITAGNHGISY SKSGYDGFNS QIAVKAGSNI SEGERWLTEK SPATQLPNVT GVNVNPASIT AGQAALFTAT TDSAASMVTL RFTDAGIDVK MSGNGTTWKA YPQIHNAGNR PFTVTAFDKN NKYGSGRKGT IQVNKVKEPV ASNLRPDFTL PAYRENNPFW NSGYAPKEVA PPKPKLDNAK GNCTWYANGR LRELGYNVPN NSFTHHAKTW VSDAKNNHFV VDQTPQVGSI AQSDTMNKNY GHVAVVEVVH NDGTIVISES SYAPGTKDWD FLYNTRTIHK SIFTSYIHVP RK
|
| |