Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0406 |
Symbol | |
ID | 4078800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 416325 |
End bp | 418301 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005701 |
Product | periplasmic sensor signal transduction histidine kinase |
Protein accession | YP_612401 |
Protein GI | 99080247 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.135462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAT CTGTCCGCTT GCGCCTGCTC ATTCTGGCGC TGCTACCGCT TATCGTGTTG ATGCCGCTTT TGATGGTGGT GGCGATGGCG CGTTGGAATG CGGATTACGA TAAAATATTG ATCGAAAAGG TCGAAAGCGA TCTGCGCATC GCCGAGCAAT ATCTCGGCCA CCTTCAGAGC AACTCGGCGC GTGCGCTGCA AGGACTTGCC CAATCAGCCG AGATCGCAGA CGTCCTCACG CGCGACCCAG GTTTAACGCC AGCCTTTCTG GAGCAAAGCC AGGATCGCCT CGGACTCGAC TTTCTCTATC TTCTACAAGG CGATGAGCTG ACTGACGCCA AAGAAAAATG GCCGGTCATC TTTCAAGCCG CCGCTGGCCG CATGGTCTCG GAGATCGATA TTCTCTCGGC CAGCGATCTG GCGGCCCTTT CCCCTGCGCT GGCGGCAGAA GCCCGGATTG AGCTGATCGA GACCGAAGCC GCCGCCCCCG CCGATCACAC CGTGGAGGAT CGGGGTATGG TGGTGCACAC CGCCGCCCCA GTGCGCGTGA ATGGCGAGAT CCGGGTGTTG GTGGGGGGGA TCTTGCTCAA CCGCAACCTT GATTTCATCG ATACGCTCAA CGAACTCGTC TATCTGCAAG GACAGTACAG CGATGGGCGA CAAGGCACCG CCACCCTGTT TTTATCAGAC ACCCGCGTCT CCACAAATGT GCGCCTCTTC GAGGACGTGC GTGCGCTTGG CACGCGGGTG TCTGCTGCCG TACGTGACGC GGTGCTCGGC AAGGGGCGCA CATGGCTAGA CAGCGCCTTT GTCGTGAATG ATTGGTACAT TTCGGGGTAT CTGCCGATCA GCGACAGCTT TGGCAAACGG GTCGGCATGC TCTACGTCGG CTTCCTGGAA GCCCCCTATG AGACCGCAAA ACGCGCGGCT TATTGGACGG TGGTTGGGGC ATTTGTGCTG GTGATCGCAC TCACCGCGCC GCTGTTCTTG TGGTTGGCCA AAGGGATCTT TGCCCCACTC GAGCAGATGA CGCGCACCAT GGCGAGGGTG GAGCGCGGCG ACCTCACCGC CCGCAATCAG ACCTCGGCGC GCGGTGACGA GATCGGTCAG GTGGCCAATC ACCTCAACAC ATTGCTTGAT CAGGTCCAGG AACGCGACCG GCGGCTGCGC AACTGGGCCG GAGAGCTGAA CCTGCGCGTC GAAGAGCGCA CGCGAGAGTT GCAAGTCGCA AATGCCAAAC TCGAAGAGAC CTTTCGCCAG CTGGTGATGA GTGAAAAACT GGCCTCGATC GGGGAAATCA CCGCAGGCGT CGCCCATGAG ATCAACAATC CCGTCGCGGT GATCCAGGGC AATGTGGATG TGATGCGTAT GACCCTTGGC GACGCAGCAG AGCCGGTGCG CACCGAGCTC GATCTGATCG ACAATCAAGT GATGCGCATC GGGGCCATCG TGGGCAAACT CTTGCAGTTC GCGCGCCCGT CGGAGTTTGG CACCTTTGCC GAGCATGTGG ATGTGGCCGC CGTGGTCCGC GACTGTCTTG TGTTGGTGGA TCACGTCATC TCGAAACAAG AGGTGCGGGT CCTCACACAG TTTGACGCCG CGCCACACGT CCGCATCGAC CCGGGCGAAT TGCAACAGGT GGTGATCAAT ATCATCATCA ACGCCAGCCA GGCGATGGAG GGCAAAGGCA CGCTTGCGAT CTCTGTTGCG GGCGAGGTAC GCGATGCACA GACAGGTGTT GCCCTGCGGA TCGCCGATAC CGGCCCCGGC ATCCCCATGG AGCGGATCGA TCACATATTC GACCCTTTCT TTACGACAAA ACAGGGCGAA GGCACGGGGC TTGGGCTGTC CATCAGCCAG ACCCTCATTC AGCGGGCTGG CGGGCGCATC ACGGTGCGCA ACAGACGCGC GCGGGGGGCG GAGTTCCTGA TCTGGCTGCC AGAGCCAACT GACGCGGATG ACCCGGATCA GAGATAA
|
Protein sequence | MLKSVRLRLL ILALLPLIVL MPLLMVVAMA RWNADYDKIL IEKVESDLRI AEQYLGHLQS NSARALQGLA QSAEIADVLT RDPGLTPAFL EQSQDRLGLD FLYLLQGDEL TDAKEKWPVI FQAAAGRMVS EIDILSASDL AALSPALAAE ARIELIETEA AAPADHTVED RGMVVHTAAP VRVNGEIRVL VGGILLNRNL DFIDTLNELV YLQGQYSDGR QGTATLFLSD TRVSTNVRLF EDVRALGTRV SAAVRDAVLG KGRTWLDSAF VVNDWYISGY LPISDSFGKR VGMLYVGFLE APYETAKRAA YWTVVGAFVL VIALTAPLFL WLAKGIFAPL EQMTRTMARV ERGDLTARNQ TSARGDEIGQ VANHLNTLLD QVQERDRRLR NWAGELNLRV EERTRELQVA NAKLEETFRQ LVMSEKLASI GEITAGVAHE INNPVAVIQG NVDVMRMTLG DAAEPVRTEL DLIDNQVMRI GAIVGKLLQF ARPSEFGTFA EHVDVAAVVR DCLVLVDHVI SKQEVRVLTQ FDAAPHVRID PGELQQVVIN IIINASQAME GKGTLAISVA GEVRDAQTGV ALRIADTGPG IPMERIDHIF DPFFTTKQGE GTGLGLSISQ TLIQRAGGRI TVRNRRARGA EFLIWLPEPT DADDPDQR
|
| |