Gene SeSA_A4374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4374 
SymbolthiC 
ID6515514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4246493 
End bp4248388 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content59% 
IMG OID642749326 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002117065 
Protein GI194734581 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.701948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT 
ACGCTGGAAG GCACCGCATT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT
GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCAA CGCTCATCGG CGGCAGCAAA
GACAACCCGC AGTTTGAAGA GAACGAAGCC GTGCCGGTGT ACGACACCTC CGGCCCCTAT
GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CGAAACTGCG CCAGCCATGG
ATTGACGCAC GTAACGATAG CGAAGAATTA GACGACCGTA GCTCGGCTTA TACCAGAGAA
CGTCTGGCCG ACGATGGCCT GGACGATCTG CGTTTTACCG GCCTGCTGAC GCCAAAACGC
GCTAAAGCGG GCAAGCGCGT CACCCAGTTA CACTACGCCC GCAACGGGAT CGTCACTCCC
GAAATGGAGT TCATCGCCAT CCGTGAAAAT ATGGGCCGCG AGCGCATTCG CAGTGAAGTA
CTGCGCCACC AGCATCCGGG GATGAGCTTT GGCGCACGCC TGCCGGAAAA CATCACCCCG
GAATTCGTGC GTGATGAAGT CGCCGCCGGA CGCGCCATCA TCCCCGCCAA CATCAACCAC
CCGGAATCGG AGCCGATGAT TATTGGCCGC AACTTCCTGG TGAAGGTCAA CGCCAATATC
GGCAACTCGG CGGTGACCTC CTCTATCGAA GAAGAGGTGG AAAAACTGGT GTGGGCGACG
CGCTGGGGGG CGGATACGGT AATGGACCTT TCCACCGGGC GCTATATTCA CGAAACCCGT
GAGTGGATCC TGCGTAATAG CCCAGTACCA ATCGGCACGG TGCCGATCTA CCAGGCGCTG
GAAAAGGTCA ACGGGATCGC CGAGGATCTT ACCTGGGAAG CCTTCCGCGA CACGCTGCTG
GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATTCACG CGGGCGTGCT GCTGCGTTAC
GTGCCGATGA CCGCCAAGCG CCTGACCGGC ATTGTCTCGC GCGGCGGTTC GATCATGGCG
AAGTGGTGTC TCTCCCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATTTGC
GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGCTGGGCG ACGGCCTGCG ACCGGGATCG
ATTCAGGACG CGAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAGCTGACC
AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGCCA CGTGCCGATG
CATATGATTC AGCGCAACAT GACCGAAGAG CTGGAGCACT GCCATGAAGC GCCGTTCTAC
ACCTTAGGGC CGCTGACTAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGCATT
GGCGCGGCGA TGATCGGCTG GTTCGGCTGT GCGATGCTGT GTTACGTGAC GCCGAAAGAG
CATCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TGATCACCTA CAAGATCGCC
GCCCACGCCG CTGACTTAGC CAAAGGGCAT CCGGGCGCGC AGATTCGCGA TAACGCGATG
TCGAAAGCGC GCTTTGAATT CCGCTGGGAG GATCAGTTTA ACCTCGCGCT CGACCCGTTC
ACCGCCCGCG CCTGGCACGA TGAAACCCTG CCACAGGAGT CCGGCAAAGT GGCGCACTTT
TGCTCGATGT GCGGGCCGAA ATTCTGCTCA ATGAAAATCA GCCAGGAAGT TCGTGATTAC
GCCGCCGCGC AAACTATTGA AGTCGGGATG GCGGATATGT CGGAAAACTT CCGCGCCAAA
GGCGGCGAAA TCTATCTCAA GCGGGAGGAA GTCTGA
 
Protein sequence
MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK 
DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IDARNDSEEL DDRSSAYTRE
RLADDGLDDL RFTGLLTPKR AKAGKRVTQL HYARNGIVTP EMEFIAIREN MGRERIRSEV
LRHQHPGMSF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI
GNSAVTSSIE EEVEKLVWAT RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL
EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA
KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT
KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LEHCHEAPFY TLGPLTTDIA PGYDHFTSGI
GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM
SKARFEFRWE DQFNLALDPF TARAWHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY
AAAQTIEVGM ADMSENFRAK GGEIYLKREE V