Gene GM21_3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3450 
Symbol 
ID8138817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3986477 
End bp3988633 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content61% 
IMG OID644871066 
ProductHpt sensor hybrid histidine kinase 
Protein accessionYP_003023231 
Protein GI253702042 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0000325842 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCTCA ATGAAAGCGC ACAGACGCAG GTCCGTAAGC TTCAGGAGAG GGTGAACTTC 
CTTGAGGAAA CCAACCTGAA CTACCTGAAG ACGCTCGACG TGCTCACCGC ATGCAGCGAT
TTCCAGTCGG ACATCTACCG TCAGAAGGAA CCCTCCTTCG TGATCAAGGC GGTCTTCGGC
CAGTTGAAGC GGCTGATACC GTTCACCGCT CTCGGCATGC TCGGCATCGA GGCGGACGCT
TCCTTCAGCC TTACCGTCTG CGACCCCGAA TTCTCCTGCG GCGAGATCAT GAACGAGGTC
GATGCGCGGG TGCAGGACGG CACCTTCGCC TGGGCCGTCA ACCAGAACCA CCCCGTTGTG
GTGCCGACGC TCACCGGCGC CGATACGCTC GTGCTCCACG TTCTCGCCAC CAACAGCCGC
ATCAGGGGGA TGTTCGTGGG GATCCTGCCC GGCAGCCACC TAAGCGCCGA GGTCTCCACC
CTCAACGCCC TGAGCAGCAT ATTGATCAAC ACGGCCTATG CGGTTGAGAA CTCCGAGCTC
TACGACATGC TCCAGGAACA CATGCAGAAT CTGGAAATGA AGGTCGCCCA GCGCACTACT
GAACTTGAAG AAGCGCTGGT CAAGGCCGAG GCCGCCACCG CGGCAAAGAG CGTCTTTCTG
GCCAACATGA GCCACGAGAT CAGGACGCCG ATGAACGGGG TGATCGGCCT TGCCAAGCTG
CTTATGGAGA CACCGCTCGA CGAGGTGCAG CAAGGTTACA TGGAGTCGCT TTCCGATTGC
GCCGAAAACC TCCTCACCAT CATCAACGAG ATCCTCGACG TCTCCAAGGT CGAAGCCGGC
ATGATAACGC TGGAGGCTGT CGTCTTCGAC CTGAGGCGTT TCCTGGACCG CTCGCTGCAA
CCGTTCGTGC TCCGCGGCCA GAAAAAAGGA GTCCGGGTCC ATTTGGAGGC GGACTCGGGA
CTTCCGGAAC TGATGGTGGG GGATCCGGTT CGACTCCGGC AGGTACTTGC CAACCTCCTG
GGAAACGCTC TCAAATTCAC CCAGCAGGGG AGTATCACCC TGACTGCCGC CTTGACGGGG
AGCGGAGAAA ACCGCGTCGC TTTGAAGTTC TCCGTCGCCG ACACCGGTAT CGGCATAGCC
GCGGAAGCAA TGGAAGTCAT CTTCGAGAAA TTTTCCCAGG CCGACAGCTC CACCACCCGC
CTCTACGGCG GGACCGGGTT GGGTCTGTCC ATCAGCAAAA GCCTGGTCGA GCTGATGGGG
GGGGAACTGA GCGTGCAAAG TACCCTGGGA GAAGGGAGCG TTTTCAGCTT CAGTATCGAG
ATGAGCCTGC CGAAAGCAGG CGAGCGTCCC GCCGAGGAAG AGGGCGAAGC GCCGGGCGAG
AAGGTCGAAC GCGAGCTGAA GATCCTCGTG GTGGACGACG TGCCGCTCAA TCAGCTGATT
TCCGCCAAAC TCATCGCCAA GGCAGGAAAC CACCATATCT CCACCGCGCA AAACGGACGA
GAGGCCGTGG AAAAGTGGGA ACAGGAGTAC TTCGACCTCA TCTTCATGGA CGTGCAGATG
CCGGTCATGG ACGGACTGGA AGCGACGCGG ATCATCAGGA GCCGCGAAGA GGGGACCGGG
TGGCGGGTCC ACATCTGCGC TATGACCGCC AATGCGATGA AGGAGGACGT CACCATCTGC
ACCGGCGCCG GCATGGACAG TTACCTGTCG AAGCCGGTGC GGGAGCGGGA AATCGCCTCC
ATGATCCGGA AGGTGGCCTG CTCCCATGCC TGCCAGCCAT CCCGCGCGGC CGCAACGCCA
GCGCCGGAGC CCGAAGCGGC ACCTCTACCG GCCTTCGACC GGGCCGACCT CCTGGAGCGC
CTGGGGGGGG AAGATGAAGC GGTGGGGATA TTCGTGGTGA AATTCATCGC GGCGGTAACC
GAGCACCTGG ATCGACTGAA AGAGGCGGTC TCGGACCGGG ATCTTGCCGC AGTCTATTAC
CGTGCCCACA CCATAGCCGG AACTTCCGCC AACATGGGGG CGCCCGTGAT GCGCAAACTC
GCAGCAAGGA TGGAATCCGC AGCCAAGACC GAGGATGCTG AGCCTCTCGA AGCCCTTTTC
CTGGAACTGC TGCAGGCTTT TTCCGCTTTC AAGTCCGAAG CGCAGACAGC GCCGTAA
 
Protein sequence
MHLNESAQTQ VRKLQERVNF LEETNLNYLK TLDVLTACSD FQSDIYRQKE PSFVIKAVFG 
QLKRLIPFTA LGMLGIEADA SFSLTVCDPE FSCGEIMNEV DARVQDGTFA WAVNQNHPVV
VPTLTGADTL VLHVLATNSR IRGMFVGILP GSHLSAEVST LNALSSILIN TAYAVENSEL
YDMLQEHMQN LEMKVAQRTT ELEEALVKAE AATAAKSVFL ANMSHEIRTP MNGVIGLAKL
LMETPLDEVQ QGYMESLSDC AENLLTIINE ILDVSKVEAG MITLEAVVFD LRRFLDRSLQ
PFVLRGQKKG VRVHLEADSG LPELMVGDPV RLRQVLANLL GNALKFTQQG SITLTAALTG
SGENRVALKF SVADTGIGIA AEAMEVIFEK FSQADSSTTR LYGGTGLGLS ISKSLVELMG
GELSVQSTLG EGSVFSFSIE MSLPKAGERP AEEEGEAPGE KVERELKILV VDDVPLNQLI
SAKLIAKAGN HHISTAQNGR EAVEKWEQEY FDLIFMDVQM PVMDGLEATR IIRSREEGTG
WRVHICAMTA NAMKEDVTIC TGAGMDSYLS KPVREREIAS MIRKVACSHA CQPSRAAATP
APEPEAAPLP AFDRADLLER LGGEDEAVGI FVVKFIAAVT EHLDRLKEAV SDRDLAAVYY
RAHTIAGTSA NMGAPVMRKL AARMESAAKT EDAEPLEALF LELLQAFSAF KSEAQTAP