Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2026 |
Symbol | |
ID | 8137362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2349930 |
End bp | 2351159 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869641 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003021836 |
Protein GI | 253700647 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000000000157118 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCTCCC TCTACCAGAG TTTCCTCATC GCGGTGCGCG CGCTTAGGGT GAACAAGATG CGGGCGCTGT TAACCATGCT GGGGATCATC ATCGGCATCG CCGCGGTCAT CGCCATGGTC GCCATCGGCG CGGGTGCCAG CAAGATGATC TCCGACCAGA TCTCCAGCAT CGGGTCGAAC CTGCTCCTGG TGCTTCCCGG CTCCACCACC AGCGGGGGGT TGCGCTCCGG CGCCGGGTCC CACCAGACGC TCACCTACGA CGACGCCATG GCCATCAAGG CGGAATGCCC GTCGGTGGGG GCTGTGGCGC CGCAGGTGCG CGGCTCGGGG CAGGTGGTTT ACGGCAACCA GAACTGGTCC ACCGTCGTCT ACGGCGCCAC GCCGGACGTG ATCCAGGTGC GCGACTGGAC CATCGTGGCC GGGCGCAACA TCACCCAGTC CGACGTCGAC GGCGCCACCA AGAACTGCCT GATCGGGCAG ACCGTCGCCG ACAACCTCTT CGGCGCGGCC GATCCCATCG GGAAGATCAT CAGGATCAAG AAGATACCCT TCACCGTGGT AGGGCTTTTG GGCGAAAAGG GGCAGTCCCC CCAGGGGCAG GACCAGGACG ACGTCATCTA CGTGCCGCTT CGGACGGCGC AGCGAAAGCT TCTGGGGAGT CAGTTCCCGA ACGTGGTCGG CTCCATCATG GTGCAGGCCA AAAGCGGCGA GGTGCTGGAC CAGGCGGAGG AGGAGGTGAC GGCGCTTTTG AACCAGAGGC ACCGCATCGG CCCCAGCCGC GAGGTCGACT TCACCATCAG GAACCTCTCC GAACTCCTGG CGGTCACCGC CCAGTCCTCG AAGGTGATGT CGATCCTCCT GGGGGCGGTC GCCTCCATCT CGCTGGTGGT TGGCGGGATC GGCATCATGA ACATCATGCT CGTCTCGGTC ACCGAGAGGA CCCGCGAGAT CGGGATCAGG ATCGCCATCG GCGCCAAGAG GCGCGACATA CTGCTGCAGT TTCTCACCGA GGCGGTGCTC CTCACCACCT GCGGCGGCAT CATCGGCATG CTGCTAGGCG TTGCGGGGGC GCGGCTGGTC GCCTCGCTGG TGGGGTGGCC CACGCTGGTA TCGGTGAACA CCATCGTCGT CGCCTTTGCC TTTTCCGCAG GTGTCGGGGT CTTCTTCGGG TTCTATCCGG CCCGCAAGGC CTCCTCTTTG AACCCAATAG AAGCGCTGAG ATACGAATAA
|
Protein sequence | MSSLYQSFLI AVRALRVNKM RALLTMLGII IGIAAVIAMV AIGAGASKMI SDQISSIGSN LLLVLPGSTT SGGLRSGAGS HQTLTYDDAM AIKAECPSVG AVAPQVRGSG QVVYGNQNWS TVVYGATPDV IQVRDWTIVA GRNITQSDVD GATKNCLIGQ TVADNLFGAA DPIGKIIRIK KIPFTVVGLL GEKGQSPQGQ DQDDVIYVPL RTAQRKLLGS QFPNVVGSIM VQAKSGEVLD QAEEEVTALL NQRHRIGPSR EVDFTIRNLS ELLAVTAQSS KVMSILLGAV ASISLVVGGI GIMNIMLVSV TERTREIGIR IAIGAKRRDI LLQFLTEAVL LTTCGGIIGM LLGVAGARLV ASLVGWPTLV SVNTIVVAFA FSAGVGVFFG FYPARKASSL NPIEALRYE
|
| |