Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2774 |
Symbol | |
ID | 4076542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2930923 |
End bp | 2933208 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638008099 |
Product | histidine kinase |
Protein accession | YP_614768 |
Protein GI | 99082614 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.622194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTTG CTGAAAAATT GGCTCAGGAA CGCAGGGCGC GATTGGCAGC CGAGCGACTT CTGGAGCTGA AACAAGCTGA ACTGACCGAT GCGAACCGCA AGCTCGGCCG CCATGCGCTT GCGCTAACCA AACGGATCGG CGCCACTCAG GCAGAAATCG CGACCTTCAA GAACGAAAAT GAAAAGGTCA AAAGCGACCT CAACACAGCC AATGAAAAGG TCGAGCAGGC CGAACGGCGC CTGTGGCACT CTATTCAGGC CTTTCAGGAC GGTTTTGCCT TCTTTGACAG TGACAGCAAG TTGATCGGCG CCAACACTGC CTATCTCAAT ATCTTTGAGG GGGTAGAGGA GGTATCGCCG GGCGTCTCCT ATGTGCGCAT CCTGCAAATC CTGACCGATG AGGGGCTGAT CGACACCGAG GAGCTCTCGG CGGATTCGTG GCGCGCGATG ATGACGGAGC GCTGGATGTC GCCCGCGCCA GAACCAACGG TGGTGCGGCT GTGGAATGAT CGCTACCTGA AGCTGATCGA CCAACGCGGC CATGACGGTG ACATCGTGAG TCTCGCCCTC GATATCACCG CAACTGTCCA CTACGAGGAA GAGCTTCGCA GCGCCCGAGA GCGCGCCGAG GCTGCGAATC GCGCGAAATC CGCCTTTCTG GCCAATATGA GCCACGAGAT CCGTACGCCA ATGAACGGTG TCGTCGGCAT GGCCGAGCTC CTGAGCGACA CCACGCTGAC CGAAGAGCAG CGCCTTTATG CCAATACAAT CAAGAACTCG GGCGAGGCGC TTCTGGTCAT CATCAACGAC GTTCTCGACT ATTCCAAGAT CGAGGCGGAC AAATTGGCGC TGCATCCCGA GGCCTTTGAT CTTGAAGCCT GCATGCATGA GGTCGTCATG CTGCTGCAGG CCAATGCCCG CGACAAAGGG CTGACGCTGC TGGTTGACTA CGACCTCTTT ATGCCGACGA ATTTCATTGG CGACCCAGGG CGCATCCGCC AAATCCTCAC CAATCTTATC GGCAATGCGA TCAAATTCAC CAACGAAGGG CATGTTACGG CTCGCGTGAC CGGGGTGCCG CATCCCGAAG ACAATACCGT GATGCTGCAT ATCTCGATCG AGGACAGCGG CATCGGCATT CCTGACGACA AGATTCAGCA CATCTTTGGC GAATTCAATC AGGTCGAGGA TGAAACCAAC CGGCAGTTTG AGGGCACGGG GCTTGGTCTC GCGATCACCG AACGCCTGAT CAAGTTGATG GACGGCGAGA TCTGGGTCGA AAGCCAGTTG GGCAAAGGCT CGTGTTTTGG CTTCCGCCTG CCGCTTGTGG TGGCGGATGG CTCACAGGTG AATGCGCCCG AACTGCCCGA AAACCTGAAC TGCGTGATGA TCGTCGATGA TATGGATATC AACCGCGAGA TCCTGAACCG TCAACTTCAG CGCCTGGACC TCAAGGTGGT GGCCGTAACC AGCGGTGCCG AGGCACTCGA CGTTCTCAAT GCCGAAATCG ATTTGGTCAT CACCGATCGC AACCTACCGG GGATGGACGG GCTGAAACTG GCGCGCAGCC TGCACCATGC TTCGCCACAG ACGCCGGTAC TGCTGCTTTC TTCTGATCCT GCTGATGGGA GCGACCCTGC CCTCAGCGAT CTCTTTGCTG GCATTCTGCA GAAACCGATT CCTCGCGGCG CCCTGTTCCG AGCCCTCTCC AAACTTGATG CGCCATCTGT ACATCCCGCC CGACCGCAAG ATGTGCCACC GGCTGCTCCC GCCATCGAGC CGCCTTCAGA CGCCCCCGTG CAAGTGCCTG CACCCGAAGC GGGCGTGGAC TTCGGATCCG GCGCGACGAT CGCTTCGGAT CTGCGCCGGA TGCGGGTTCT GGCGGCGGAG GACAACCGCA CCAACCAGCT GGTCTTCCGC AAGATGGTCA AGGATATGAA CATCGACTTG CGCTTTGCAG GCAACGGGTT GGAGGCGGTT GAGCTCTACC AGAGTTTTAA ACCGGATATG ATTTTCATGG ATATTTCCAT GCCCAAAATG GATGGCAAAG AGGCCACGCA GGCCATTCGC GAGCTTGAAA AAGAAAACGG TCGCAGAGTG CAGATAGTGG CCTTGACGGC GCATGCGATG GATGGCGATT CCGAGGGCAT ACTGGCCGCA GGCCTTGACG ACTACCTCAC CAAGCCCCTG CGGAAATCGG TGATCCACGA CCGGATCATG AAAAACATGC CCACGGCCGT GGCTCCGCTG CAAGGCGAAG GCGATGATCT GCAAGCCACA GGCTGA
|
Protein sequence | MSLAEKLAQE RRARLAAERL LELKQAELTD ANRKLGRHAL ALTKRIGATQ AEIATFKNEN EKVKSDLNTA NEKVEQAERR LWHSIQAFQD GFAFFDSDSK LIGANTAYLN IFEGVEEVSP GVSYVRILQI LTDEGLIDTE ELSADSWRAM MTERWMSPAP EPTVVRLWND RYLKLIDQRG HDGDIVSLAL DITATVHYEE ELRSARERAE AANRAKSAFL ANMSHEIRTP MNGVVGMAEL LSDTTLTEEQ RLYANTIKNS GEALLVIIND VLDYSKIEAD KLALHPEAFD LEACMHEVVM LLQANARDKG LTLLVDYDLF MPTNFIGDPG RIRQILTNLI GNAIKFTNEG HVTARVTGVP HPEDNTVMLH ISIEDSGIGI PDDKIQHIFG EFNQVEDETN RQFEGTGLGL AITERLIKLM DGEIWVESQL GKGSCFGFRL PLVVADGSQV NAPELPENLN CVMIVDDMDI NREILNRQLQ RLDLKVVAVT SGAEALDVLN AEIDLVITDR NLPGMDGLKL ARSLHHASPQ TPVLLLSSDP ADGSDPALSD LFAGILQKPI PRGALFRALS KLDAPSVHPA RPQDVPPAAP AIEPPSDAPV QVPAPEAGVD FGSGATIASD LRRMRVLAAE DNRTNQLVFR KMVKDMNIDL RFAGNGLEAV ELYQSFKPDM IFMDISMPKM DGKEATQAIR ELEKENGRRV QIVALTAHAM DGDSEGILAA GLDDYLTKPL RKSVIHDRIM KNMPTAVAPL QGEGDDLQAT G
|
| |