Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2030 |
Symbol | |
ID | 8137366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2354224 |
End bp | 2355180 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869645 |
Product | protein of unknown function DUF72 |
Protein accession | YP_003021840 |
Protein GI | 253700651 |
COG category | [S] Function unknown |
COG ID | [COG1801] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000000000273114 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCCCAG GCCGCATTCG CATAGGGACC TGTTCCTGGA CGGAGAAAAG CCTCGTCGAA GGGGGGCTCT TTTACCCCCA CGGCGCCGGC ACGCCCAAGG CACGCCTCAA GTTCTACGCC AGCCGGTTCG ACACGGTCGA GATAGAGAGC AGCTACTACC AGATCCCCAC ACTGGAGATG GCCCGGGCCT GGGCCGAGCG GACCCCGGAC CGGTTCCTGT TCCACGTAAA GGCCTACGGC GCCCTCACCG GACACAACGT GGATCCCGGC AGGCTCTCTC TGGAGCTGCG CCGGATGCTT CCCGCCGAGG ATCGGGAAGA GGAGGATGTG CACGTCTCCG ACCCGGCGGC GCTAAGGGCG ATGGCAAAGG CCATGACCGA GGCGCTCGTC CCGCTCAAGG AGGCGGGGAA GCTCGGCTTC ATCATCTTCC AATTCCCCCC CTGGTACGGC TACAAGAAGG AGAACAGGGA GTATCTCCTC TACTGCAAGG AATTAATGGC GGGGCACCCG ATAGCGGTGG AGTTCCGGCA CGGCAGTTGG CTTACCAGCC GCAACCGGGA GGAACTCTTC GCGTTCCTGC AGGAGCACAA GATCACCTAC ATCACCTGCG ACGAGCCGCA GTTGGGGACG CTGGCGACAG CCCCCTTCCG TCCCGAGGCC ACCACCTCGG TCGCCTACCT GAGGCTGCAC GGGCGAAGCG CCGAGGATTG GCAGGCTCGC GCCACGACCG CCGACGAATA TCTTTATACG GAGCCCGAGC TGAGGATGAT AGCGGCGGAA GCCCGGCGTC TGAGCGAGAA AGCGAGGCTC ACCTTCGTCA TGTTCAACAA CTGCCGCTGC GGCTATTCGG TGAAAAACGC GCTGAGGATG AAGGAGTTGT GCGGCTACCC TCCACCAGAG GGCAGGGGTT CTGACAGAAA TCCCCCCTAT CCCCCCTTCG CAAAGGGGGG GACCTGA
|
Protein sequence | MSPGRIRIGT CSWTEKSLVE GGLFYPHGAG TPKARLKFYA SRFDTVEIES SYYQIPTLEM ARAWAERTPD RFLFHVKAYG ALTGHNVDPG RLSLELRRML PAEDREEEDV HVSDPAALRA MAKAMTEALV PLKEAGKLGF IIFQFPPWYG YKKENREYLL YCKELMAGHP IAVEFRHGSW LTSRNREELF AFLQEHKITY ITCDEPQLGT LATAPFRPEA TTSVAYLRLH GRSAEDWQAR ATTADEYLYT EPELRMIAAE ARRLSEKARL TFVMFNNCRC GYSVKNALRM KELCGYPPPE GRGSDRNPPY PPFAKGGT
|
| |