Gene GM21_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1037 
Symbol 
ID8136359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1218779 
End bp1220605 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content64% 
IMG OID644868648 
Producthistidine kinase 
Protein accessionYP_003020856 
Protein GI253699667 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCACG GCGCCTTCAC TGGCACACTA AAACCCGCAA CGAGGTTTCG TTTCTTCCCT 
CCTTTCAGCA CGAATCCTTC CCACGCCGGC ACCCGGGCGC TTTTCTCACT CCTTTTGCTG
GTCCTGCTGC TGTCCCTGAC CCCGCCGGCT TACGCGCAGC AATCCGCCCC GGTACACTAC
GACCCCGCGG CCACCATCGT GGTCGGCGGC GACCGCTCCT ACCCCCCATA TGAATTCATC
GACAAAGACG GCAGCCCCGC CGGCTACAAC GTGGATCTCA CCAGGGCGAT CGCCGAGGTG
ATGGGGATGA AGGTGGAGTT CCGCTTCGGA AGCTGGGCGG AAATGCGGGC AGGGCTCCAG
CAGGGAAAAA TCGACATCCT GCAGGGGCTC TCTTACTCCG ATGAGCGCTC GCGGAGCGTC
GATTTCTCAC CCCCCCACGC CATGGTGCAC CACGCCATCT TCGCCCGCCG GGACTCGAAA
CGGGTCCGTA CTCTCGAGGA GCTGAAGGGG AAGAAGGTGA TCGTCTTCCA GGACGGCATC
ATGCATGAGC GCCTGAAGCT TTTAGGCTTC GAGAAGGACC TGGTGCTCAC CCCGACCCCG
GCTGAGGCGC TCCGGATTCT AGCCTCGGGG CAGCACGATT ACGCGGTGGT GGCGCAACTT
CCGGGGATGT ACCTGATCCG CGAACTCCAC TTGACCAACC TGGTTCCGGT GGCGAAAGCC
GTGGTGAGCG AGCAGTACGG CTACGGTGTC GCCGAGGGGA ACAGGGAGCT TTTGACCCGC
TTCAACGAGG GGCTCGCCAT CGTGATCAAG ACCGGGCAGT ACGCCCAGAT CTACAACAGG
TGGCTCGGCG TGCACGAGCC TCCCCGGGTC ACCAGGGAGA TGGCGCTCAA GTACGGCGCC
ATGATCCTGG TGCCGCTTTT GCTGGTGCTG GCGGGGACCG CGCTTTGGAA CAAGACGCTG
CAAAAAAGGG TCGCCGAGCG CACCACCGAG CTGGCGCAGG AGGTCTCCGA GCGAAACAAG
GCGCTGGAGG AGTTAAGGCG CCACCAGGAC AAGCTGATTC AGGCCGACAA GATGGCCTCT
CTCGGGACGC TGGTCTCCGG CGTCGCCCAC GAGATCAACA ACCCCAACGG CCTCTTGCTG
CTCGATATCC CGATCCTGCG GCGCGTGCAC GAGGACGCGG AGGAGATCCT CGAAGCGCGC
TACCTGCAGG AGGGGGATTT CATGCTGGGG GGAGTACCCT ACTCCGAGAT GCGCGAGGAG
ATCCCGCGCA TCCTGGAGGA GATGCTGGAC GGGGCGCAGC GTATCAAGAG GATAGTGAAC
GACCTGAAGG ACTTCGCGCG GCGCGACGAC GCAGGCCACA TGGAGTCGAT CGACCTGGAG
GCGGCCGCGA AGAGGGCCGT GCGCCTGGTC GAGCCGACGA TACGTTCCGC GACGGGCAGG
TTCGAGGCTT TCTATGAGGG GAACCTCCCG CCTGTCATGG GCAACGCCCA GCGCATAGAG
CAGGTCATCG TCAACCTGGT GCTCAACGCC TGCCAGTCCC TCACCGGCCG GGACCAGGGG
GTGACGCTTG CCACCTCGCT GGACAGCGAA AGCGATAGCG TGCTGATCGA GGTGCGGGAC
GAAGGGGTGG GGATAGCGCA GGAGCACCTG CCGCATCTCG TCGATCCCTT CTTCACCACC
AAGCGGGAGA CCGGGGGGAC CGGGCTCGGC CTCTCCGTCT CCGCGGGAAT CGTCAAGGAG
CACGCAGGCA CGCTCCGCTT CGCCTCGACG CCGGGGGAGG GTACCACGGT CACCCTTTCC
CTTCCCGTTA CTTCCAGGAG GTCATGA
 
Protein sequence
MSHGAFTGTL KPATRFRFFP PFSTNPSHAG TRALFSLLLL VLLLSLTPPA YAQQSAPVHY 
DPAATIVVGG DRSYPPYEFI DKDGSPAGYN VDLTRAIAEV MGMKVEFRFG SWAEMRAGLQ
QGKIDILQGL SYSDERSRSV DFSPPHAMVH HAIFARRDSK RVRTLEELKG KKVIVFQDGI
MHERLKLLGF EKDLVLTPTP AEALRILASG QHDYAVVAQL PGMYLIRELH LTNLVPVAKA
VVSEQYGYGV AEGNRELLTR FNEGLAIVIK TGQYAQIYNR WLGVHEPPRV TREMALKYGA
MILVPLLLVL AGTALWNKTL QKRVAERTTE LAQEVSERNK ALEELRRHQD KLIQADKMAS
LGTLVSGVAH EINNPNGLLL LDIPILRRVH EDAEEILEAR YLQEGDFMLG GVPYSEMREE
IPRILEEMLD GAQRIKRIVN DLKDFARRDD AGHMESIDLE AAAKRAVRLV EPTIRSATGR
FEAFYEGNLP PVMGNAQRIE QVIVNLVLNA CQSLTGRDQG VTLATSLDSE SDSVLIEVRD
EGVGIAQEHL PHLVDPFFTT KRETGGTGLG LSVSAGIVKE HAGTLRFAST PGEGTTVTLS
LPVTSRRS