Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1018 |
Symbol | |
ID | 5134525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 996931 |
End bp | 998877 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640531340 |
Product | sensory box sensor histidine kinase |
Protein accession | YP_001215854 |
Protein GI | 147672043 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACC GTGACGCAGA AGAAATGGAC AGTAACCCGA TGTTCAGCCG TATTGGCCGA CGCATCATTC TCATCATGGT GATACTCAGT GGTGCCGTGA CTTTAGCCAT GACGATTACC CAAACCTTCA TCGACTACAA CCGTGAGTTC AATAATGTCC AAGCTAGGCA TGATGAAATC CAAACCATTC ACGCCGAGCT TCTCGCCAGC TCACTGTGGA ATTATGACTT AGTCGTGCTT ACCCAAAGGC TGGAAGGCTT GGTCAACTTA CCCAATGTCG ATTATATGAA GATCACCTCT GGCGACTACC ATTTCTCAGC CGGAGAACCC GTAACCAGCA TGGCATTAAA TAGCGAAATA GCACTGGAAT ACACCAACCC AGATACTCAG GTGACTGAGA ATATCGGCAC CTTATATGTC GAATCGGACG CCCAAGGTAT TTATAACTAT CTGATTCGCC AGTTTCTGCT CACCTTAGCC GTCAATGCGC TGAAAACCGC CATTGTGTGC TATTTAATCT TGCTGATTTT CCACGCCAGT GTGAATCAGC GGATTTTTGC GATTGCGCAA TTTCTACGTC GCTACAATCC CCGCCACCCT AAAAAACCAC TACAACTGCC TTATAACCCT TGGATTATGG AGAAAAATGA TGAGTTACAA TGGCTGGGAG ATGAAACTAA TCGGATTGCT AACAACGTAA CAACCCTTTA CCGCACCATC AAATCGGAGC AAGAACGGTT GGAAGATTTT GCACAAGCCG CCTCCGATTG GTTATGGGAA ACCAACTGCC ATGGCGAGCT GATTTATAGC TCAGAAGCCA TGTCTACTGC GTTAGCGATT GAGGAAGATT CCAAACCACT CATAGTAAGT ATTGCTCCGC TTCAATCCTC AACCGCGCTC ATGAACTGCT TACTCAAACA GCAAGATTTC TCAAATTGTG AAGTGGAATT GACACTCAGT GATGGCACTC AAGCCTATTT ACTGTTTCAA GGCATTGCTC GCTATGCCGA TGAGCAATTT CTCGGATTTC GTGGTACTGC AATCAACATT ACCTCGCTCA AACTGGCTCA ATTGAGTTTA GAAATCATGA ACCAAGATCT GGAGCAGCAA GTCGCGAATC GAACGCAAGA TTTAGCACTC AGTTTAACTC GTTTGCAAGA AACCCAAACC CAACTGATTG AATCTGAAAA GCTCGCCGCT CTCGGTGGCT TAGTGGCAGG CGTCGCACAC GAAGTGAATA CGCCGTTAGG TATTGCGGTG ACAGCCACTT CTGTGATTCA GGAAACACGA GAAAGCTTGC TCAACGCCTT TAATCAGCAA ACCCTCACCA GCCAACAGTT TGCAGAATTG ATGGAGAGGA TGACTCAAAG CACCCTGATG CTAGAAACCA ATCTTAACCG TGCCGCACGA CTGGTTCGAG ATTTTAAGCA GACCGCCGTC GACCAAGTTT CGGAAAGCCG TAGCCAATTT CACGTAAAAC AAGTCCTCGA CGCGCTGATG GCCAGCTTAC ACAGTGAAAC CCGAAAAATT CCGGTGACTC CGCAACTGCA TGGGGAGGAT TCTGTGATGA TGAACAGCTT ACCTGGTGTA CTGACACAGA TTATGACTAA CTTGGTCATG AACAGTGTGA ATCACGCTTT CGCAGAGACT GCTCAGCCAG AGATTGATAT CCACTTCTAT CAAAAAGATC AGCAGATCAT GATTGAATAT CGAGACAATG GATGCGGCGT AGCAAAAGAA CTGCATCAAA AAATCTTTGA ACCATTTTTT ACCACTAAGC GAGGTCAAGG TGGCTCAGGA TTAGGATTAA ATCTGGTGTT TAATTTGGTT AAGCAAAAGC TGCATGGCCA ACTGGCGTTT TCTTCCGAAC CGGGGCACGG CGTGCATTAC GTGATCACAT TACCCCAAGC GCTATCGATG CCTCAAGTAG CCGACTGTGC GACCTAG
|
Protein sequence | MKNRDAEEMD SNPMFSRIGR RIILIMVILS GAVTLAMTIT QTFIDYNREF NNVQARHDEI QTIHAELLAS SLWNYDLVVL TQRLEGLVNL PNVDYMKITS GDYHFSAGEP VTSMALNSEI ALEYTNPDTQ VTENIGTLYV ESDAQGIYNY LIRQFLLTLA VNALKTAIVC YLILLIFHAS VNQRIFAIAQ FLRRYNPRHP KKPLQLPYNP WIMEKNDELQ WLGDETNRIA NNVTTLYRTI KSEQERLEDF AQAASDWLWE TNCHGELIYS SEAMSTALAI EEDSKPLIVS IAPLQSSTAL MNCLLKQQDF SNCEVELTLS DGTQAYLLFQ GIARYADEQF LGFRGTAINI TSLKLAQLSL EIMNQDLEQQ VANRTQDLAL SLTRLQETQT QLIESEKLAA LGGLVAGVAH EVNTPLGIAV TATSVIQETR ESLLNAFNQQ TLTSQQFAEL MERMTQSTLM LETNLNRAAR LVRDFKQTAV DQVSESRSQF HVKQVLDALM ASLHSETRKI PVTPQLHGED SVMMNSLPGV LTQIMTNLVM NSVNHAFAET AQPEIDIHFY QKDQQIMIEY RDNGCGVAKE LHQKIFEPFF TTKRGQGGSG LGLNLVFNLV KQKLHGQLAF SSEPGHGVHY VITLPQALSM PQVADCAT
|
| |