Gene VC0395_A0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0417 
SymbolthiI 
ID5135323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp443586 
End bp445133 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content48% 
IMG OID640531875 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001216372 
Protein GI147673207 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000383743 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGTGT ATAATCGCGC CCCTTTAGCG GTAAGCGCCT TGCCAAGGTT TACCTGCTGC 
TCGCCAGATT GTCTAACTCA GATTGCGAAT AACTGTATGA AATTTATCGT TAAACCCCAT
CCGGAAATTT TTGTGAAAAG TGAATCGGTA CGTAAGCGTT TCACAAAGAT CCTTGAGAGC
AATATTCGAA TTATTGTGAA AGCCCGCACA CAAGGGGTGG CGGTATTCAA TCGTCGTGAT
CATATTGAAG TGACGTCAAA CAGCGATACT TATTACGCCG AAGTGTTGGA GATTCTGACG
ACCACACCGG GTATCCAGCA AGTGTTGGAA GTGCAGCAAT CAAGCTTTAC CGATCTGCAC
AACATCTACG AGCAAGTGCT GGAGCTAAAT CGCGCTAACC TCGAAAACAA AACCTTTGTT
GTGCGCGCGA AACGCCGTGG TAAGCATGAT TTTACCTCTA TTGAACTCGA ACGTTATGTT
GGGGGTGGCC TCAATCAAGC CATCGCCAGT GCCAAGGTAA AATTGATTAA CCCTGACGTG
ACCGTGCAAG TGGAAGTGGT CGATGAGCTG CTTAACCAAG TGATCGCGCG TCATAAAGGT
TTAGGTGGTT TCCCTCTGGG GACCCAAGAA GATGTATTGA GCCTGATTTC TGGTGGCTTC
GACTCCGGTG TGTCGAGCTA TCTGCACATT AAACGTGGTT CAAAAGTGCA TTACTGCTTC
TTTAATCTGG GGGGACCCGC CCACGAAATT GGTGTGAAGC AAACCGCTTA CTACCTGTGG
CAAAAATACG GTTCATCGGC CAAAGTGCGA TTTATCGCGA TCGATTTTGC TCCTGTGGTG
GCTGAGATCC TCGAGAAGAT CGATGATGGT CAAATGGGCG TGGTGCTCAA GCGTATGTTT
ATGCGCACCG CCGGTATGGT GGCTGAGAAG TTTGGCATTC AAGCTTTGGT TACGGGTGAA
GCGCTAGGCC AGGTTTCTAG CCAAACCCTG ACTAACCTGC GCCATATCGA TAACGTGACC
GATACTTTGA TTCTGCGTCC GCTCATCAAC TGGGATAAAG AAGACATCAT CCGTCTGGCG
CGTGAAATTG GTACGGAAGA TTTCGCCAAA ACCATGCCTG AATTCTGCGG GGTGATTTCA
AAAAGCCCAA CCGTAAAAGC GGTAAAAGAG AAATTGGAAG AAGAAGAAGC CAAATTCGAT
TTTGCTCTGC TTGATCAAGT GGTGTACAAC GCGCGCCAAA TCGACATTCG TGATATTGGT
AAAGAGTCGC TGGAAAAAGC CCCTGAAGTG GAGTTGGTTA ACAGCGCAGA AGAGGGTAAC
GCCGTCGTAC TGGATATTCG TAGCCCAGAC GAAGAGGACG AAAGCCCGCT AGAGATTGCG
GGTGTGGAAG TGAAGCATCT GCCTTTCTAT AAGCTTGCGA CTCAATTTTG TGACCTCGAT
CAGTCAAAAA CCTACTTGCT GTACTGCTCA CGTGGCGTGA TGAGCCGCTT ACAAGCGCTG
TACCTGCAAG AACAAGGGTT TAACAATGTG AAAGTTTATC GTCCATAG
 
Protein sequence
MIVYNRAPLA VSALPRFTCC SPDCLTQIAN NCMKFIVKPH PEIFVKSESV RKRFTKILES 
NIRIIVKART QGVAVFNRRD HIEVTSNSDT YYAEVLEILT TTPGIQQVLE VQQSSFTDLH
NIYEQVLELN RANLENKTFV VRAKRRGKHD FTSIELERYV GGGLNQAIAS AKVKLINPDV
TVQVEVVDEL LNQVIARHKG LGGFPLGTQE DVLSLISGGF DSGVSSYLHI KRGSKVHYCF
FNLGGPAHEI GVKQTAYYLW QKYGSSAKVR FIAIDFAPVV AEILEKIDDG QMGVVLKRMF
MRTAGMVAEK FGIQALVTGE ALGQVSSQTL TNLRHIDNVT DTLILRPLIN WDKEDIIRLA
REIGTEDFAK TMPEFCGVIS KSPTVKAVKE KLEEEEAKFD FALLDQVVYN ARQIDIRDIG
KESLEKAPEV ELVNSAEEGN AVVLDIRSPD EEDESPLEIA GVEVKHLPFY KLATQFCDLD
QSKTYLLYCS RGVMSRLQAL YLQEQGFNNV KVYRP