Gene GM21_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0538 
Symbol 
ID8135849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp658575 
End bp660227 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content59% 
IMG OID644868154 
Producthistidine kinase 
Protein accessionYP_003020373 
Protein GI253699184 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones105 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCAG GAAGCCGAAA GGACATTGGC GATCAGAAGC ACCAGCAGTA TTACGCGCTC 
TATAAGGAGC TCATCTTCGC GGAACAGGTC AAGCAGCTCT ACCGTCTGGC TCCGCTGGGG
ATGGTGGCGA CATTAGTGAA CGCGCTGCTT GTGTTCTTCG TCATGAAGGA CGTCATGCCT
CGGCGGTTTC TCATCTTCTG GCTGTTCGGG ATCGTCTTGG TCACCCTCCT CAGAGGCTTG
CTCGGTTTCC AATATGCGAA GCATCAACCT GAACCGGCGC AGGCGCGGCT TTGGGCAAAC
AGGTTCCTGG TCGGCCTGCT GGCTATAGGC GCTGCGTGGG GAAGCATCGG AGTCTTCTCC
TTTGCCGAAG CATCCATGGA GCATCAAGTC TTCATCGCCT TCGTCTTGGG CGGGATGGCG
GCCGGAGCGT CAACGACCTT CGCGACGGTG CGCCATGCCT ACCTCGCATT CAGCATCCCG
GTCCTCGTGC CGCTGGCCGT ACACTTCGTT CTGATCCAGG ACATCTTCCA TTACCTCATG
GCTGCGATGA CCACTCTCTT TGGTTTTCTG CTTTGGCGCA TCTCGCTGCA CAATTACTCG
ATAAACCGCG ACTCGTTGCT GCTCAGCTAC GAGAACAGGG AGATGATCGA GACCCTGAAG
CAGGCGAAGG AGCGGGTCGA AGGTTTGAAC TCACAGTTAA TGGAGGAGAT TACCGCCAGG
CTCGAAGCTG AAGCGGCGTT AAGGGGTAAT CAGGAGCAGC TCGAAAATCT GGTGGAGGTC
CGGACCGCGG ATCTTGTGAG CAGCAACGAG CAGCTGAAAA AAGAGATCGA GGAGAGAAAG
CAGTACGAAC AGGCGCTGCT ACAGGCCGGT GAACGGCTGG CCGTCGCCCA GCGGCAGTCG
GAGGCGGCGA ACAGGGCGAA AACCGAGTTT CTCGCCAATA TGAGCCACGA GATGAGGACG
CCCCTGGCTG GGGCGCTCGG GATGATCAGG CTGGTCCTCG ACATGAATAT TGGTGCGGAG
GAGCGGCAAC TCCTTGAGAT GGCAAAACGG TCGGCGGACT CCCTGGTTAG GATCATCGCC
GATCTGCTCG ACTTCTCCCG GCTGGAGGCC GGGGTGATGA CCTTCGAAGA TAAGCCGTTC
TTATTGAAGG AGGTGGTCAG GTCGGCGGTG GAGGTGGTTT CCCTGGTTGC GGAAGAAAAG
GGGCTCAGCC TCTCCTGGGC GGTCAACGCC GCAGTTCCCG AGCAATTGAG GGGCGACGAG
GGAAGGCTTA GGCAGGTGCT GGTGAATCTC TTGGGGAACG CAGTGAAATT CACCGAGCGA
GGCGGGATAG AGGTCGGCAT CGGAACCTTC GAGCCTCTTG AGGCACAAGG GGAGCAGTAC
GTCGAGTTTT CCGTGAGGGA CACGGGAGTC GGTATTCCCG CCGATCAGTT GGAGAGGATA
TTCGACCGCT TTACCCAGGT GGACTCATCG CTTACCAGGA GGCATGGCGG CACCGGCCTG
GGCCTCGCCC TCACGCGCCA GATCGTCGAG AAGATGGGTG GGAGCATCTG GGCCGAGAGC
GTTGTAGGCT CGGGAAGCAC GTTCCATTTC ACCGTCCCCA TGGTGTCGAA CGCGGCAGCC
GGACCTGAGC GCGATTCGGA CCGTCTTTCG TAA
 
Protein sequence
MMPGSRKDIG DQKHQQYYAL YKELIFAEQV KQLYRLAPLG MVATLVNALL VFFVMKDVMP 
RRFLIFWLFG IVLVTLLRGL LGFQYAKHQP EPAQARLWAN RFLVGLLAIG AAWGSIGVFS
FAEASMEHQV FIAFVLGGMA AGASTTFATV RHAYLAFSIP VLVPLAVHFV LIQDIFHYLM
AAMTTLFGFL LWRISLHNYS INRDSLLLSY ENREMIETLK QAKERVEGLN SQLMEEITAR
LEAEAALRGN QEQLENLVEV RTADLVSSNE QLKKEIEERK QYEQALLQAG ERLAVAQRQS
EAANRAKTEF LANMSHEMRT PLAGALGMIR LVLDMNIGAE ERQLLEMAKR SADSLVRIIA
DLLDFSRLEA GVMTFEDKPF LLKEVVRSAV EVVSLVAEEK GLSLSWAVNA AVPEQLRGDE
GRLRQVLVNL LGNAVKFTER GGIEVGIGTF EPLEAQGEQY VEFSVRDTGV GIPADQLERI
FDRFTQVDSS LTRRHGGTGL GLALTRQIVE KMGGSIWAES VVGSGSTFHF TVPMVSNAAA
GPERDSDRLS