Gene SeHA_C4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4494 
SymbolthiC 
ID6492374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4375891 
End bp4377786 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content58% 
IMG OID642744569 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002048149 
Protein GI194449579 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0642711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0188033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT 
ACGCTGGAAG GCACCGCGTT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT
GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCCA CGCTCATCGG CGGCAGTAAA
GACAACCCGC AGTTTGAAGA GAACGAAGCC GTACCGGTAT ACGACACCTC CGGCCCCTAT
GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CGAAACTGCG CCAGCCATGG
ATTGACGCAC GTAACGATAG CGAAGAATTA GACGACCGTA GCTCGGCTTA TACCAGAGAA
CGTCTGGCCG ACGATGGCCT GGACGATCTG CGCTTTACCG GCCTGCTGAC GCCAAAACGC
GCTAAAGCGG GCAAGCGCGT CACCCAGTTA CACTACGCCC GCAAGGGGAT CGTCACTCCC
GAAATGGAGT TCATCGCCAT CCGTGAAAAC ATGGGCCGCG AACGCATTCG TAGTGAAGTG
CTGCGCCACC AGCATCCGGG GATGAGCTTT GGCGCGCGCC TGCCGGAAAA CATTACGCCG
GAATTCGTGC GTGATGAAGT CGCCGCGGGC CGCGCGATTA TTCCCGCCAA CATCAACCAC
CCGGAATCGG AGCCGATGAT TATCGGCCGC AACTTCCTGG TCAAAGTGAA TGCCAACATC
GGTAACTCGG CGGTGACCTC CTCTATCGAA GAAGAGGTGG AAAAACTGGT GTGGTCAACC
CGCTGGGGCG CGGATACGGT TATGGACCTC TCCACCGGCC GCTATATCCA CGAAACCCGC
GAATGGATCC TGCGTAACAG CCCGGTACCG ATCGGCACCG TCCCGATCTA CCAGGCGCTG
GAGAAGGTCA ACGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG
GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATTCACG CGGGCGTGCT GTTGCGCTAC
GTGCCGATGA CCGCCAAACG CCTGACCGGG ATTGTCTCGC GCGGCGGTTC GATCATGGCG
AAGTGGTGCC TCTCGCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATCTGC
GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGTTAGGCG ACGGCCTGCG CCCCGGCTCC
ATTCAGGACG CCAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAGCTGACC
AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGTCA TGTACCCATG
CATATGATTC AGCGCAACAT GACCGAAGAG CTGGAGAGCT GCCATGAAGC ACCGTTCTAC
ACCTTAGGGC CATTGACCAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGCATT
GGCGCCGCGA TGATCGGCTG GTTCGGTTGT GCGATGCTGT GTTACGTCAC GCCGAAAGAG
CATCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TGATCACCTA CAAAATAGCC
GCCCACGCCG CTGACTTAGC GAAAGGCCAT CCGGGCGCGC AGATCCGCGA TAACGCCATG
TCGAAAGCGC GCTTCGAATT CCGCTGGGAA GATCAGTTTA ACCTCGCGCT CGACCCGTTC
ACCGCCCGCG CTTATCACGA TGAAACCCTA CCGCAGGAGT CCGGTAAGGT CGCTCACTTC
TGTTCCATGT GCGGGCCGAA GTTCTGCTCG ATGAAAATCA GCCAGGAAGT CCGCGACTAT
GCCGCCGCAC AAACCATCGA AGTCGGCATG GCGGATATGT CGGAAAACTT CCGCGCCAAA
GGCGGCGAAA TCTATCTCAA GCGGGAGGAA GCCTGA
 
Protein sequence
MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK 
DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IDARNDSEEL DDRSSAYTRE
RLADDGLDDL RFTGLLTPKR AKAGKRVTQL HYARKGIVTP EMEFIAIREN MGRERIRSEV
LRHQHPGMSF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI
GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL
EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA
KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT
KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LESCHEAPFY TLGPLTTDIA PGYDHFTSGI
GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM
SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY
AAAQTIEVGM ADMSENFRAK GGEIYLKREE A