Gene VC0395_A1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1856 
SymbolthiL 
ID5135771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1974586 
End bp1975590 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content50% 
IMG OID640533313 
Productthiamine monophosphate kinase 
Protein accessionYP_001217780 
Protein GI147673943 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0710389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTT TCAACCAAAG AGTCTGGATG ATGTTTGGTG AATTTAATTT AATCGATAAA 
TACTTTTCCA ACCGACAAGC ACAACGCAAA GACGTTCACT TAGCATTGGG TGATGACTGT
GCCATTGTCA AAGTTCCTGA AAATTCGCGC GTGGCGATCA GCACGGATAC CTTAGTTGCT
GGAACCCATT TTCTGGCTCA GGCCAATCCG GCTTGGGTCG CGCACAAAGC CTTAGCGTCG
AACATCAGTG ATTTGGCGGC GATGGGCGCG ACTCCGGCTT GGGTTTCGCT TGCGTTGACT
CTGCCTGAGA TTGATGATGC GTGGCTAACG CCATTTTGCG ATGCTTTTTT CGAGTTGGCC
AAGTACTACA ATGTGCAGCT CATTGGTGGC GATACCACCA AAGGGCCACT CAGTATCACA
TTAACCGTGC AGGGCTTTTT ACCCAAAGAG CAGGCTATGC TGCGTAGCGG GGCGAAAGTT
GGCGATTGGC TCTATGTCAC GGGCGATCTA GGCGATAGCC AAGCCGGCTT AGACGTGATT
TTAGATCCGG AAAAGCGCCA TTTACCCTTC GCCGATATTC TTGAGCAGCG CCATTACTTG
TCGACACCAC GCATTGTGGC AGGGCAAGCC TTAGTGCATT TGGCCTCTTC GGCGATTGAT
ATTTCCGACG GCCTGATTGC CGATCTGCAA CACATTTTGC GGCGTTCCAA TGTTGGGGCG
AGTATTGATG TCAGCTTGTT GCCGCTGTCT AAAGAGTTAT TGCAGTTTGT AGACAGTGTC
ACTAGTGCAC AGCAGTATGC CTTGACCAGC GGTGAAGAGT ACGAACTCTG CTTCACCATT
CCAGAAGAAA ATCGAGGCTC GCTAGAAAAT GCGCTCGCAC ATTGCGGCAC GAAAGTGACC
TGTATTGGCC AAATTCGCCC GGCCGGTACG TTTGAGCTGC ACAATCAGGG GAAAAAGCTC
GATTGGCAAC TGGCTGGGTA TGATCATTTC AAGGTGAAAG CATGA
 
Protein sequence
MQRFNQRVWM MFGEFNLIDK YFSNRQAQRK DVHLALGDDC AIVKVPENSR VAISTDTLVA 
GTHFLAQANP AWVAHKALAS NISDLAAMGA TPAWVSLALT LPEIDDAWLT PFCDAFFELA
KYYNVQLIGG DTTKGPLSIT LTVQGFLPKE QAMLRSGAKV GDWLYVTGDL GDSQAGLDVI
LDPEKRHLPF ADILEQRHYL STPRIVAGQA LVHLASSAID ISDGLIADLQ HILRRSNVGA
SIDVSLLPLS KELLQFVDSV TSAQQYALTS GEEYELCFTI PEENRGSLEN ALAHCGTKVT
CIGQIRPAGT FELHNQGKKL DWQLAGYDHF KVKA