Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4084 |
Symbol | |
ID | 8139458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4663966 |
End bp | 4665291 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871699 |
Product | GAF sensor signal transduction histidine kinase |
Protein accession | YP_003023857 |
Protein GI | 253702668 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 138 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGTA GCGATATTTC AAGCCACGTT TCCGATCCTG CCCGGCTTGC CGCGTTGCGG GCAGTGGCGC TGCTGGATAC GCCGACCGAG GAGGCTTTCG ATCGCCTGAC CAAGCTCGCC TCGCGTTTTG CCTCTGCACC CGTCGCCCTC GTCACGCTGG TGGACAGCGA CCGGCAGTTC TTCAAAAGCT GCGTGGGGCT GCCCGAGCCG TGGCTCTCCA GCCGCCAAAC CCCTTTGTCG CATTCGTTCT GCCAATACAA CCGCGTCGCG AAGCAGCCCC TGATCATCGA GGACGCACGC GTTCATCCGC TGTTCAAGGA AAACCTCGCC ATCAGGGATC TGAAGGTGAT CGCCTACCTG GGGATCCCGC TGGTCACTTC CGACGGCTAC GTTCTCGGCT CCTTCTGCGT CATCGATAAC AAGCCGAGGC ACTGGAGCGG CGAGGATGTC GAGGTGGTCG AGAACCTCGC CGCCGCGGTG ATGACCGAAA TCCAGTTGCG CACGGAGATA GCGGTACGGG CCCGCGGCGA GAAAGATCTG CGCCGGCAGC ACGAGGAACT GGGGCGGGCA TATCGGGATC TTGAACGGGA GACGGCAGAG CGGGTGAAGA CTGCGGAGCA GTTGCGGCAA AGGGACCAGA TGCTGATCCA GCAAAGCCGC CTGGCCGCGA TGGGGGAGAT GATCAATAAC ATCGCCCACC AGTGGCGGCA ACCGCTCAAC CTGTTGGGGT TACTGGCCCA GGAACTGCCG ATAACCTACG TGACAGAAGA GTTTTCGCAA CAATACCTGG AGTCGCGGGT GCGAAAGATG ATGGAGGCGA TCGGACACAT GTCAGCCACC ATCGACAATT TCCGCAACTT CTTCTGCCCC GAGAAAGACA AGGTGGAATT CAGCATCTTC GACGTCGTGG ACCAGACCCT ATCATTGATG GGCTTGACCC TGAACCAGGT GCAGGTAAGG ATCGAGGTGG TAGAAAGGAT CAACCCCGTC ATCACAGGGT ACCCCAACCA GTATGCGCAG GTTTTGCTCA ACATCCTCAA CAACGCGAGG GACGCCTTCG CCGAGCGCAA CGTCCCAAGC CCCAAAGTAG AGATACGGAT AGCCGCCGAG GAAGGGCGAT CGGTGGTGAC GGTCAGCGAC AACGCCGGCG GCATCCCCCC CGAGGTCATC GACAAGGTTT TCGACCCCTA TTTCACCACC AAAGATCCCG ACAAGGGGAC CGGCATCGGT CTCTACATGT CCAAGATGAT CATCGAGAAG AACATGGGCG GGTCGCTGAC CGCCTGCAAC ACCGAAGAGG GCGCCCGCTT CCGGATAGAG GTATGA
|
Protein sequence | MNGSDISSHV SDPARLAALR AVALLDTPTE EAFDRLTKLA SRFASAPVAL VTLVDSDRQF FKSCVGLPEP WLSSRQTPLS HSFCQYNRVA KQPLIIEDAR VHPLFKENLA IRDLKVIAYL GIPLVTSDGY VLGSFCVIDN KPRHWSGEDV EVVENLAAAV MTEIQLRTEI AVRARGEKDL RRQHEELGRA YRDLERETAE RVKTAEQLRQ RDQMLIQQSR LAAMGEMINN IAHQWRQPLN LLGLLAQELP ITYVTEEFSQ QYLESRVRKM MEAIGHMSAT IDNFRNFFCP EKDKVEFSIF DVVDQTLSLM GLTLNQVQVR IEVVERINPV ITGYPNQYAQ VLLNILNNAR DAFAERNVPS PKVEIRIAAE EGRSVVTVSD NAGGIPPEVI DKVFDPYFTT KDPDKGTGIG LYMSKMIIEK NMGGSLTACN TEEGARFRIE V
|
| |