Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2703 |
Symbol | |
ID | 8138045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3148040 |
End bp | 3149314 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644870307 |
Product | protein of unknown function DUF294 nucleotidyltransferase putative |
Protein accession | YP_003022497 |
Protein GI | 253701308 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0000000043685 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGATGC TTTTGGGAAC CAAGGGAGAC GACATCCTCA AGTGGCGCGA GAGCGCCCGG CTGGTGGAGG GGCTTAAGTC GCAGCTTAGC CGCAAGTGGG GGCGGAGCTC CGCGCAGGAG GCGCAGTCCT TCCTGGAAGG TATGACCGAG GCGCTTGAGG CGGAGCTCGG CTACGAGTCG CAGCTGGACG CGGAGCTTGC CGCCTTGCAC CGGGCGCTAA GCGAAGCGGT CGCCCCGGCC GGGCTCTGCG CGCTTGCCGC GCGCCACCGC GAACTTCTCT CCGCCCACTT CAGCAGGCGC GGCTCGGTTC TCGCGCTGAC CGGCGCAGGC AACGACTGGC ACGACCTCCT GGTGGCGCGG GCGGCGCTCC TGGCGGAGGA GCGGATGCTC TCCCTGGGAC AGGGACCCCC ACCCGTCTAC GCACTGCTGG TGACCGGGGA CCGGGGGCGC GAGGAGCAGA CCCTCTACGG CAAAAACCGC TACCTCCTTA TGCACCAGCT CGATTCGGAG CGCTTTTTCC TCTTCACCCG CCAGCTCGCC ACCGCCCTCA AGGAGGCGGG GGTGATCGCG GGGGAGGAGG GGCTTTGGCA CGGGTCGCTG GCAGAGTGGC GGGCGCTTTT GAAAGGTTCG GGCACGGCCC GCAAGAGGGA CCCGCGCGAG GAGATGGAAA ATCCGCTTCC CCCCTTCGCC GCGCCGATGA AGGGGGGAAG CCCCTCGATG CCCGACTGGC AGTGGCGCCT GGAGGCGATG GCGGACCTTT GTTTCGTCAC CGGGTACGAA CCGTTAGCCG ACGAGGCGAT AAACGCCGCG GCGTCGTCGC TCAAGGAGCA GAGAAGCCGC GAGGCGTTCT TCCAACTGGC GCGCAAGGCG ATCCACCTCC CGCTGGCCCT CGGGCACTTC GGGCGCTGGA GGCTTGAGAA AAGCGGGGAA CACAAGGGGG AGATCGACGT GGAAGGGCTC GGCCTCACCC CGCTGGTGAG CGCGGTCCGG GTGCTGGCCA TCCACATGGG GGTGCAGGGG GGGGGGACGC TGCACCGGGT GAGGGAGCTT CTCTACCGCG GCCTTTTCAG CGTGGAGCTG GCCGAAAGGG TGCTCGAGGC CCTGCAGTGC CTGATGCAGT TGCGGATACT GAGCGAGATA CGCGGCGAGG CGGCAGGGGC GTACGCGAAC CCCGAGGAGT TCACCCTGGA GCAGGACGAG AGGATCAGGG CCGCCTTCGA GGCGGTGCTC GACCTGCAGA AGATGGCCTA CCAGCGCATG GTGGGACAAG GATAG
|
Protein sequence | MAMLLGTKGD DILKWRESAR LVEGLKSQLS RKWGRSSAQE AQSFLEGMTE ALEAELGYES QLDAELAALH RALSEAVAPA GLCALAARHR ELLSAHFSRR GSVLALTGAG NDWHDLLVAR AALLAEERML SLGQGPPPVY ALLVTGDRGR EEQTLYGKNR YLLMHQLDSE RFFLFTRQLA TALKEAGVIA GEEGLWHGSL AEWRALLKGS GTARKRDPRE EMENPLPPFA APMKGGSPSM PDWQWRLEAM ADLCFVTGYE PLADEAINAA ASSLKEQRSR EAFFQLARKA IHLPLALGHF GRWRLEKSGE HKGEIDVEGL GLTPLVSAVR VLAIHMGVQG GGTLHRVREL LYRGLFSVEL AERVLEALQC LMQLRILSEI RGEAAGAYAN PEEFTLEQDE RIRAAFEAVL DLQKMAYQRM VGQG
|
| |