Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1037 |
Symbol | |
ID | 8136359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1218779 |
End bp | 1220605 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868648 |
Product | histidine kinase |
Protein accession | YP_003020856 |
Protein GI | 253699667 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCACG GCGCCTTCAC TGGCACACTA AAACCCGCAA CGAGGTTTCG TTTCTTCCCT CCTTTCAGCA CGAATCCTTC CCACGCCGGC ACCCGGGCGC TTTTCTCACT CCTTTTGCTG GTCCTGCTGC TGTCCCTGAC CCCGCCGGCT TACGCGCAGC AATCCGCCCC GGTACACTAC GACCCCGCGG CCACCATCGT GGTCGGCGGC GACCGCTCCT ACCCCCCATA TGAATTCATC GACAAAGACG GCAGCCCCGC CGGCTACAAC GTGGATCTCA CCAGGGCGAT CGCCGAGGTG ATGGGGATGA AGGTGGAGTT CCGCTTCGGA AGCTGGGCGG AAATGCGGGC AGGGCTCCAG CAGGGAAAAA TCGACATCCT GCAGGGGCTC TCTTACTCCG ATGAGCGCTC GCGGAGCGTC GATTTCTCAC CCCCCCACGC CATGGTGCAC CACGCCATCT TCGCCCGCCG GGACTCGAAA CGGGTCCGTA CTCTCGAGGA GCTGAAGGGG AAGAAGGTGA TCGTCTTCCA GGACGGCATC ATGCATGAGC GCCTGAAGCT TTTAGGCTTC GAGAAGGACC TGGTGCTCAC CCCGACCCCG GCTGAGGCGC TCCGGATTCT AGCCTCGGGG CAGCACGATT ACGCGGTGGT GGCGCAACTT CCGGGGATGT ACCTGATCCG CGAACTCCAC TTGACCAACC TGGTTCCGGT GGCGAAAGCC GTGGTGAGCG AGCAGTACGG CTACGGTGTC GCCGAGGGGA ACAGGGAGCT TTTGACCCGC TTCAACGAGG GGCTCGCCAT CGTGATCAAG ACCGGGCAGT ACGCCCAGAT CTACAACAGG TGGCTCGGCG TGCACGAGCC TCCCCGGGTC ACCAGGGAGA TGGCGCTCAA GTACGGCGCC ATGATCCTGG TGCCGCTTTT GCTGGTGCTG GCGGGGACCG CGCTTTGGAA CAAGACGCTG CAAAAAAGGG TCGCCGAGCG CACCACCGAG CTGGCGCAGG AGGTCTCCGA GCGAAACAAG GCGCTGGAGG AGTTAAGGCG CCACCAGGAC AAGCTGATTC AGGCCGACAA GATGGCCTCT CTCGGGACGC TGGTCTCCGG CGTCGCCCAC GAGATCAACA ACCCCAACGG CCTCTTGCTG CTCGATATCC CGATCCTGCG GCGCGTGCAC GAGGACGCGG AGGAGATCCT CGAAGCGCGC TACCTGCAGG AGGGGGATTT CATGCTGGGG GGAGTACCCT ACTCCGAGAT GCGCGAGGAG ATCCCGCGCA TCCTGGAGGA GATGCTGGAC GGGGCGCAGC GTATCAAGAG GATAGTGAAC GACCTGAAGG ACTTCGCGCG GCGCGACGAC GCAGGCCACA TGGAGTCGAT CGACCTGGAG GCGGCCGCGA AGAGGGCCGT GCGCCTGGTC GAGCCGACGA TACGTTCCGC GACGGGCAGG TTCGAGGCTT TCTATGAGGG GAACCTCCCG CCTGTCATGG GCAACGCCCA GCGCATAGAG CAGGTCATCG TCAACCTGGT GCTCAACGCC TGCCAGTCCC TCACCGGCCG GGACCAGGGG GTGACGCTTG CCACCTCGCT GGACAGCGAA AGCGATAGCG TGCTGATCGA GGTGCGGGAC GAAGGGGTGG GGATAGCGCA GGAGCACCTG CCGCATCTCG TCGATCCCTT CTTCACCACC AAGCGGGAGA CCGGGGGGAC CGGGCTCGGC CTCTCCGTCT CCGCGGGAAT CGTCAAGGAG CACGCAGGCA CGCTCCGCTT CGCCTCGACG CCGGGGGAGG GTACCACGGT CACCCTTTCC CTTCCCGTTA CTTCCAGGAG GTCATGA
|
Protein sequence | MSHGAFTGTL KPATRFRFFP PFSTNPSHAG TRALFSLLLL VLLLSLTPPA YAQQSAPVHY DPAATIVVGG DRSYPPYEFI DKDGSPAGYN VDLTRAIAEV MGMKVEFRFG SWAEMRAGLQ QGKIDILQGL SYSDERSRSV DFSPPHAMVH HAIFARRDSK RVRTLEELKG KKVIVFQDGI MHERLKLLGF EKDLVLTPTP AEALRILASG QHDYAVVAQL PGMYLIRELH LTNLVPVAKA VVSEQYGYGV AEGNRELLTR FNEGLAIVIK TGQYAQIYNR WLGVHEPPRV TREMALKYGA MILVPLLLVL AGTALWNKTL QKRVAERTTE LAQEVSERNK ALEELRRHQD KLIQADKMAS LGTLVSGVAH EINNPNGLLL LDIPILRRVH EDAEEILEAR YLQEGDFMLG GVPYSEMREE IPRILEEMLD GAQRIKRIVN DLKDFARRDD AGHMESIDLE AAAKRAVRLV EPTIRSATGR FEAFYEGNLP PVMGNAQRIE QVIVNLVLNA CQSLTGRDQG VTLATSLDSE SDSVLIEVRD EGVGIAQEHL PHLVDPFFTT KRETGGTGLG LSVSAGIVKE HAGTLRFAST PGEGTTVTLS LPVTSRRS
|
| |