Gene TM1040_2774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2774 
Symbol 
ID4076542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2930923 
End bp2933208 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content58% 
IMG OID638008099 
Producthistidine kinase 
Protein accessionYP_614768 
Protein GI99082614 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.622194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTG CTGAAAAATT GGCTCAGGAA CGCAGGGCGC GATTGGCAGC CGAGCGACTT 
CTGGAGCTGA AACAAGCTGA ACTGACCGAT GCGAACCGCA AGCTCGGCCG CCATGCGCTT
GCGCTAACCA AACGGATCGG CGCCACTCAG GCAGAAATCG CGACCTTCAA GAACGAAAAT
GAAAAGGTCA AAAGCGACCT CAACACAGCC AATGAAAAGG TCGAGCAGGC CGAACGGCGC
CTGTGGCACT CTATTCAGGC CTTTCAGGAC GGTTTTGCCT TCTTTGACAG TGACAGCAAG
TTGATCGGCG CCAACACTGC CTATCTCAAT ATCTTTGAGG GGGTAGAGGA GGTATCGCCG
GGCGTCTCCT ATGTGCGCAT CCTGCAAATC CTGACCGATG AGGGGCTGAT CGACACCGAG
GAGCTCTCGG CGGATTCGTG GCGCGCGATG ATGACGGAGC GCTGGATGTC GCCCGCGCCA
GAACCAACGG TGGTGCGGCT GTGGAATGAT CGCTACCTGA AGCTGATCGA CCAACGCGGC
CATGACGGTG ACATCGTGAG TCTCGCCCTC GATATCACCG CAACTGTCCA CTACGAGGAA
GAGCTTCGCA GCGCCCGAGA GCGCGCCGAG GCTGCGAATC GCGCGAAATC CGCCTTTCTG
GCCAATATGA GCCACGAGAT CCGTACGCCA ATGAACGGTG TCGTCGGCAT GGCCGAGCTC
CTGAGCGACA CCACGCTGAC CGAAGAGCAG CGCCTTTATG CCAATACAAT CAAGAACTCG
GGCGAGGCGC TTCTGGTCAT CATCAACGAC GTTCTCGACT ATTCCAAGAT CGAGGCGGAC
AAATTGGCGC TGCATCCCGA GGCCTTTGAT CTTGAAGCCT GCATGCATGA GGTCGTCATG
CTGCTGCAGG CCAATGCCCG CGACAAAGGG CTGACGCTGC TGGTTGACTA CGACCTCTTT
ATGCCGACGA ATTTCATTGG CGACCCAGGG CGCATCCGCC AAATCCTCAC CAATCTTATC
GGCAATGCGA TCAAATTCAC CAACGAAGGG CATGTTACGG CTCGCGTGAC CGGGGTGCCG
CATCCCGAAG ACAATACCGT GATGCTGCAT ATCTCGATCG AGGACAGCGG CATCGGCATT
CCTGACGACA AGATTCAGCA CATCTTTGGC GAATTCAATC AGGTCGAGGA TGAAACCAAC
CGGCAGTTTG AGGGCACGGG GCTTGGTCTC GCGATCACCG AACGCCTGAT CAAGTTGATG
GACGGCGAGA TCTGGGTCGA AAGCCAGTTG GGCAAAGGCT CGTGTTTTGG CTTCCGCCTG
CCGCTTGTGG TGGCGGATGG CTCACAGGTG AATGCGCCCG AACTGCCCGA AAACCTGAAC
TGCGTGATGA TCGTCGATGA TATGGATATC AACCGCGAGA TCCTGAACCG TCAACTTCAG
CGCCTGGACC TCAAGGTGGT GGCCGTAACC AGCGGTGCCG AGGCACTCGA CGTTCTCAAT
GCCGAAATCG ATTTGGTCAT CACCGATCGC AACCTACCGG GGATGGACGG GCTGAAACTG
GCGCGCAGCC TGCACCATGC TTCGCCACAG ACGCCGGTAC TGCTGCTTTC TTCTGATCCT
GCTGATGGGA GCGACCCTGC CCTCAGCGAT CTCTTTGCTG GCATTCTGCA GAAACCGATT
CCTCGCGGCG CCCTGTTCCG AGCCCTCTCC AAACTTGATG CGCCATCTGT ACATCCCGCC
CGACCGCAAG ATGTGCCACC GGCTGCTCCC GCCATCGAGC CGCCTTCAGA CGCCCCCGTG
CAAGTGCCTG CACCCGAAGC GGGCGTGGAC TTCGGATCCG GCGCGACGAT CGCTTCGGAT
CTGCGCCGGA TGCGGGTTCT GGCGGCGGAG GACAACCGCA CCAACCAGCT GGTCTTCCGC
AAGATGGTCA AGGATATGAA CATCGACTTG CGCTTTGCAG GCAACGGGTT GGAGGCGGTT
GAGCTCTACC AGAGTTTTAA ACCGGATATG ATTTTCATGG ATATTTCCAT GCCCAAAATG
GATGGCAAAG AGGCCACGCA GGCCATTCGC GAGCTTGAAA AAGAAAACGG TCGCAGAGTG
CAGATAGTGG CCTTGACGGC GCATGCGATG GATGGCGATT CCGAGGGCAT ACTGGCCGCA
GGCCTTGACG ACTACCTCAC CAAGCCCCTG CGGAAATCGG TGATCCACGA CCGGATCATG
AAAAACATGC CCACGGCCGT GGCTCCGCTG CAAGGCGAAG GCGATGATCT GCAAGCCACA
GGCTGA
 
Protein sequence
MSLAEKLAQE RRARLAAERL LELKQAELTD ANRKLGRHAL ALTKRIGATQ AEIATFKNEN 
EKVKSDLNTA NEKVEQAERR LWHSIQAFQD GFAFFDSDSK LIGANTAYLN IFEGVEEVSP
GVSYVRILQI LTDEGLIDTE ELSADSWRAM MTERWMSPAP EPTVVRLWND RYLKLIDQRG
HDGDIVSLAL DITATVHYEE ELRSARERAE AANRAKSAFL ANMSHEIRTP MNGVVGMAEL
LSDTTLTEEQ RLYANTIKNS GEALLVIIND VLDYSKIEAD KLALHPEAFD LEACMHEVVM
LLQANARDKG LTLLVDYDLF MPTNFIGDPG RIRQILTNLI GNAIKFTNEG HVTARVTGVP
HPEDNTVMLH ISIEDSGIGI PDDKIQHIFG EFNQVEDETN RQFEGTGLGL AITERLIKLM
DGEIWVESQL GKGSCFGFRL PLVVADGSQV NAPELPENLN CVMIVDDMDI NREILNRQLQ
RLDLKVVAVT SGAEALDVLN AEIDLVITDR NLPGMDGLKL ARSLHHASPQ TPVLLLSSDP
ADGSDPALSD LFAGILQKPI PRGALFRALS KLDAPSVHPA RPQDVPPAAP AIEPPSDAPV
QVPAPEAGVD FGSGATIASD LRRMRVLAAE DNRTNQLVFR KMVKDMNIDL RFAGNGLEAV
ELYQSFKPDM IFMDISMPKM DGKEATQAIR ELEKENGRRV QIVALTAHAM DGDSEGILAA
GLDDYLTKPL RKSVIHDRIM KNMPTAVAPL QGEGDDLQAT G