Gene VC0395_A2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2452 
SymbolthiC 
ID5136006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2607353 
End bp2609290 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content48% 
IMG OID640533904 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001218352 
Protein GI147675426 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACC GTAAACAAGC AAGACTGGAA GCCAAGCGCT TTATTGATAC CCTTTCTGTT 
GAACCCTATC CTAACTCTCA AAAATCTTAC CTATTAGGCT CTCGCCCTGA TATTCGGGTG
CCTGTCAGAG AAATTACTCT CAGCGATACT TTGGTCGGTG GCAGTAAAGA TGCACCCATC
TTTGAGCCCA ATGAGCCTAT CTGTGTGTAT GACACATCTG GCGTCTATAC TGACCCTTCA
CATGATATTG ATCTCTACAA GGGGCTTCCT AAGCTCAGAG AGGAGTGGAT TGAAGAGCGT
CGAGATACGC ACATTCTGCC TAGTATGAGC TCTCATTTCG CCCGTGAACG CTTAGCGGAT
GAAACTCTAG ATGAACTGCG TTATGGCCAT TTACCGCGAA TTCGCCGAGC GATGGGCCAG
CATCGAGTCA CTCAGTTACA TTACGCACGG CAGGGAATCA TTACGCCGGA AATGGAGTTT
GTGGCGATCC GTGAAAACTC TCGTCGTCTT GCTCATCAAG ATCCAAGTCT ACTTCAGCAG
CATGCTGGGC AGAATTTTGG TGCTCATTTA CCCGATCTGA TTACTCCTGA GTTTGTGCGT
CGTGAGATTG CAGAAGGGCG CGCCATCATC CCATGCAATA TTAATCATCC TGAATCTGAA
CCCATGATTA TTGGCCGCAA TTTCTTGGTT AAGGTGAATG CCAATATCGG TAACTCTTCT
GTTAGCTCTT CTATTGAAGA AGAAGTCGAG AAGTTAGTTT GGGCGACCCG CTGGGGCGCA
GATACTGTGA TGGATCTGTC GACAGGGCGA AATATCCATG AAACCCGCGA GTGGATCTTA
CGTAATAGCC CCGTGCCAAT TGGTACTGTA CCTATGTATC AGGCGCTGGA AAAAGTGAAT
GGTGTAGCAG AAAACCTCAC ATGGGAAGTG ATGCGCGATA CCTTACTAGA GCAAGCTGAG
CAAGGTGTGG ACTATTTTAC AATTCATGCG GGCTTGCTAT TGCGTTATGT GCCGATGACA
GCCAAGCGCG TAACGGGCAT CGTTTCTCGT GGCGGCTCGA TTATTGCGAA ATGGTGTCTT
TCTCACCATC AAGAGAATTT CCTTTATACC CATTTTCGCG AAATCTGTGA GATTTGTGCG
CAATATGATG TGGCTCTATC TCTAGGGGAT GGACTTCGTC CTGGCTCGAT TGCTGATGCT
AACGATGAAG CGCAATTTGC TGAGTTACGT ACCCTTGGTG AGCTAACCCA AATAGCTTGG
GAATATGATG TGCAGGTCAT GATTGAAGGG CCGGGTCATG TACCTATGCA CTTAATTAAA
GCCAATATGG ATGAGCAGCT TAAGCATTGT CATGAGGCGC CATTCTATAC CCTTGGTCCA
TTAACTACCG ATATTGCCCC GGGTTATGAT CACATTACCT CCGGTATCGG TGCGGCCATG
ATTGGTTGGT TTGGCTGCGC CATGCTCTGC TATGTCACAC CTAAAGAACA TTTGGGGCTG
CCAAACAAAG AAGATGTCAA AACAGGATTG ATTACTTATA AGTTAGCGGC TCATGCAGCA
GATTTGGCGA AAGGGCATCC GGGAGCGCAA ATTCGTGATA ATGCATTATC AAAAGCACGT
TTTGAGTTTC GTTGGGAAGA CCAGTTCAAT CTTGCGCTTG ATCCCGTCAC AGCACGCGCT
TTCCATGATG AGACCCTACC GCAGGAATCG GGCAAAGTCG CGCACTTTTG CTCTATGTGT
GGCCCCAAAT TCTGCTCGAT GAAGATCTCG CAAGAGGTCA GGGATTACGC AAATAACCAG
ACATTAGACA CCACCGTCAT TGACTTGGTT ATGCCTGCAG AATCTATACA GCTGGCGATG
CAAGATAAGT CTCGTGAGTT TTTAGCCTCA GGTGCTGAAC TCTATCATCC TTTGGTGAAA
GAGCCGATCG AGGAGTAA
 
Protein sequence
MSNRKQARLE AKRFIDTLSV EPYPNSQKSY LLGSRPDIRV PVREITLSDT LVGGSKDAPI 
FEPNEPICVY DTSGVYTDPS HDIDLYKGLP KLREEWIEER RDTHILPSMS SHFARERLAD
ETLDELRYGH LPRIRRAMGQ HRVTQLHYAR QGIITPEMEF VAIRENSRRL AHQDPSLLQQ
HAGQNFGAHL PDLITPEFVR REIAEGRAII PCNINHPESE PMIIGRNFLV KVNANIGNSS
VSSSIEEEVE KLVWATRWGA DTVMDLSTGR NIHETREWIL RNSPVPIGTV PMYQALEKVN
GVAENLTWEV MRDTLLEQAE QGVDYFTIHA GLLLRYVPMT AKRVTGIVSR GGSIIAKWCL
SHHQENFLYT HFREICEICA QYDVALSLGD GLRPGSIADA NDEAQFAELR TLGELTQIAW
EYDVQVMIEG PGHVPMHLIK ANMDEQLKHC HEAPFYTLGP LTTDIAPGYD HITSGIGAAM
IGWFGCAMLC YVTPKEHLGL PNKEDVKTGL ITYKLAAHAA DLAKGHPGAQ IRDNALSKAR
FEFRWEDQFN LALDPVTARA FHDETLPQES GKVAHFCSMC GPKFCSMKIS QEVRDYANNQ
TLDTTVIDLV MPAESIQLAM QDKSREFLAS GAELYHPLVK EPIEE