Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1298 |
Symbol | |
ID | 8136625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1522297 |
End bp | 1523532 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868912 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003021116 |
Protein GI | 253699927 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000000000000177669 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAGA GCACGCCGGT CAGATCCCCA CTCCAGAGTC TTTGCCAAAC GAACATCGAG GGGTGCACCA ACTGCGGGAA GTGCGTCCGC GAGTGCGCCT TCCTGCGCAA ATACGGCACC CCCAAGAAGA TCGCCGCGGA GTTCGACCCG GCCGACTCCA TGTCCTTGCA CCGCGCCTTC GAGTGCAACC TCTGCGGGCT CTGTTCCGCG GTCTGCCCGG AGAAGCTCGA CGTGGACGGC ATGTTCCTGG AGATGCGGCG GGAAGCGGTG GACCGCGACT TGGGCGCCTA CCCGGAACAC AAGCCCCTGC TCAATTACGA GAAGGTCGGG ACCTCGCGGC GGTTCAGCCT CTACCGGCTC CCCGAGGGAT GCAGGACCAT CTTTTTCCCC GGCTGCTCGC TCCCGGGAAC GCGCCCGGAC GCGGTGCACA ACCTCTTGGC GCTCATGCAC CAGGCCGACC CGACTGTGGG GGTGGTGTTC GACTGCTGCC TCAAGCCCTC CTATTCGCTG GGGCGCGAGC AGTACGTGAA TTCGATGTTC GAGGAGATGA ACGACTGGCT CCTGCGGCAC GGGGTGCGGG AGGTGCTGGT TGCCTGCCCC AACTGCCAGG TGATGTTCGA GCGCCTGGGG CACGGGATGC GGGTGCGCAC GGTATGGGAG GCCTTGGCCG AGGCTGGGCT TCAGCCGGAA CGGGCGGCGG GGACGGTCAC GGTGCACGAC CCTTGCGTCA TCCGCAACTC TGAGCCGGTG CACCAGGCGG TGCGCACCCT TTTGGAGCGG CAGGGACTGG TGGTCGAAGA GATGAAGCAT GCGGGGAAGA AGACGGTCTG CTGCGGCAAG GGGGGCGGGG TGAACCTTTT GAACCCATCG TTGGCGGGGG AGTGGGGGGA GCTGCGCAAA AAGGAGGCCG CCGGCAGGAG GGTGATCACC TACTGCGCCG GGTGCGTCCA GGCGCTGGAA CAGCACACCC CGACCAACCA CCTGGTGGAC CTGCTCTTCG CGCCGGCACA GACCCTGGCG GGCAAGAAGA AGGGGGCCAA AGCCCCCATT ACTTACCTGA ACCGGCTGCG TCTCAAGATG TCGTTCAAAA AGAAGAAGGG GAATGCGGTG TTGAGGGAGC GGAGCTTCGT CGCGCAGCAG GCACTGCGGA AAAAACGCAG GTGGAAGATC CCTTTCACGC AGATCCTTTG CGGGATAGCC GCGGCCGCAG CCGGGATGCA TTTGCTATCC CTCTGA
|
Protein sequence | MKQSTPVRSP LQSLCQTNIE GCTNCGKCVR ECAFLRKYGT PKKIAAEFDP ADSMSLHRAF ECNLCGLCSA VCPEKLDVDG MFLEMRREAV DRDLGAYPEH KPLLNYEKVG TSRRFSLYRL PEGCRTIFFP GCSLPGTRPD AVHNLLALMH QADPTVGVVF DCCLKPSYSL GREQYVNSMF EEMNDWLLRH GVREVLVACP NCQVMFERLG HGMRVRTVWE ALAEAGLQPE RAAGTVTVHD PCVIRNSEPV HQAVRTLLER QGLVVEEMKH AGKKTVCCGK GGGVNLLNPS LAGEWGELRK KEAAGRRVIT YCAGCVQALE QHTPTNHLVD LLFAPAQTLA GKKKGAKAPI TYLNRLRLKM SFKKKKGNAV LRERSFVAQQ ALRKKRRWKI PFTQILCGIA AAAAGMHLLS L
|
| |