Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4131 |
Symbol | |
ID | 8139505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4721765 |
End bp | 4722955 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644871746 |
Product | domain of unknown function DUF1745 |
Protein accession | YP_003023904 |
Protein GI | 253702715 |
COG category | [S] Function unknown |
COG ID | [COG3287] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACTT TCGCAGGCGT TGGCTTCAGC CTTGACAAGA ACCCGGCTGA CGCGGGTAAG GAAGCGGCTC TGCAGGCGAT GCAGCGGGGC AGGATGGCCA AACCGGATTT CCTCTTCGTC TTCGCCACGG TGGGGTACGA CCAGGAGCTT CTGCTACGCT CGGTCCGCGA CGCCACCGCC GGCGCCCCCT TGAGCGGCTG CTCGGGGGAG GGTGTGATCA CGGCTGGAGC CGCCGCCGAA ACCAACTTCG GGGTCTGCGT CCTCGCCATA GCCTCGGATG AGGTCCGCTT CGCCAACGCC TACGTGCAGG GGCTAGATGC CGGGTACGCC CGGGCCGGGG AGTTGCTGGG GGAGCAGGTC CGCCCGCTGC TGGATGACGA CGCCGTCGCC TGCTTCCTCT TCGCCGACGG CCTCCTCTTC GACTTCGATC CCTTCCGGGA CGCCTTCGAA ACCGCTCTTG GTTGCGGAAG GGCGCTCCCC ATGTTCGGAG GGCTTGCCGC CAACAACCTC TCCACCCGCA GGACCTACCA GTACCACGAC GACCAGGTGA TCTCCGAAGG GATCTGCTGC GTGGTCATGT CCGGCAACGC CAGGCTCGCC TGGGGGGTGA ACCACGGCTG CGTGCCGGTG GGGACGCGGC GCACCATCAC CCGCTGCCAG GGCAACATCA TCTACGAGAT CGACGGCATC CCTGCGCTGG AAGCGTTGAA GGAGTACATC GAGAACGGCT CCGGCAGCGA CTGGAACAAG GTAACCCTCA ACCTCTGCCT CGGCTTCAAG ACCCCCGAAC ACCTGAGGAA GGAGTACGGC GAGTACATCA TCCGCTACAT GATGGACAAA AACGACCAAG AGGGGTGGGT AAGCATCCAG TCGGATGTGA CCCAGGGAAG CGCTCTCTGG ATCATGAGGC GGGACAAGGA GCTGATGCGT GAAGGGCTGC AGACGATCTC GCGGCATATA CGGGAGCAGT TGGGAGAGAG TCGGCCCAAG CTGGTGATGC AGTTCGAATG CATGGGGCGC GGACGCGTGG TCTACCGCGA GCAGGAGAAG CTCGAGCTGC TCAAGTCCCT GCAAGATGAC CTGGGGGCGG AACTACCCTG GATCGGATTC TACTCCTACG CGGAAATCGG CCCCGTCTCC GGCTACAACT GCATCCACAA TTTCACGTCC GTGCTGTTGG CCGTGTACTG A
|
Protein sequence | MGTFAGVGFS LDKNPADAGK EAALQAMQRG RMAKPDFLFV FATVGYDQEL LLRSVRDATA GAPLSGCSGE GVITAGAAAE TNFGVCVLAI ASDEVRFANA YVQGLDAGYA RAGELLGEQV RPLLDDDAVA CFLFADGLLF DFDPFRDAFE TALGCGRALP MFGGLAANNL STRRTYQYHD DQVISEGICC VVMSGNARLA WGVNHGCVPV GTRRTITRCQ GNIIYEIDGI PALEALKEYI ENGSGSDWNK VTLNLCLGFK TPEHLRKEYG EYIIRYMMDK NDQEGWVSIQ SDVTQGSALW IMRRDKELMR EGLQTISRHI REQLGESRPK LVMQFECMGR GRVVYREQEK LELLKSLQDD LGAELPWIGF YSYAEIGPVS GYNCIHNFTS VLLAVY
|
| |