Gene SeD_A4570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4570 
SymbolthiC 
ID6874685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4410927 
End bp4412822 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content59% 
IMG OID642787478 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002218080 
Protein GI198244147 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0133622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT 
ACGCTGGAAG GCACCGCGTT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT
GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCCA CGCTCATCGG CGGCAGTAAA
GACAACCCGC AGTTTGAAGA GAACGAAGCC GTACCGGTAT ACGACACCTC CGGCCCCTAT
GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CGAAACTGCG CCAGCCATGG
ATTGACGCAC GTAACGATAG CGAAGAATTA GACGACCGTA GCTCGGCTTA TACCAGAGAA
CGTCTGGCCG ACGATGGCCT GGACGATCTG CGTTTTACCG GCCTACTGAC GCCAAAACGC
GCTAAAGCGG GCAAGCGCGT CACCCAGTTA CACTACGCCC GCCAGGGGAT CGTCACTCCC
GAAATGGAGT TCATCGCCAT CCGTGAAAAT ATGGGCCGTG AGCGCATTCG CAGCGAAGTG
CTGCGCCACC AGCATCCGGG GATGAACTTT GGCGCGCGCC TGCCGGAAAA CATCACCCCG
GAATTCGTGC GTGATGAAGT CGCCGCGGGC CGCGCGATTA TTCCCGCCAA CATCAACCAC
CCGGAATCGG AGCCGATGAT TATCGGCCGC AACTTCCTGG TGAAGGTCAA CGCTAATATC
GGTAACTCGG CGGTCACCTC CTCCATCGAA GAAGAGGTGG AAAAACTGGT GTGGTCAACC
CGCTGGGGCG CGGATACGGT TATGGACCTC TCCACCGGCC GCTATATCCA CGAAACCCGC
GAATGGATCC TGCGTAACAG CCCGGTACCG ATCGGCACCG TCCCGATCTA CCAGGCGCTG
GAAAAGGTCA ACGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG
GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATCCACG CCGGCGTGCT GCTGCGCTAC
GTGCCGATGA CCGCCAAACG CCTGACCGGT ATTGTCTCGC GCGGCGGTTC GATCATGGCG
AAGTGGTGTC TCTCCCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATCTGC
GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGTTGGGCG ACGGCCTGCG CCCCGGCTCC
ATTCAGGACG CCAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAGCTGACC
AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGTCA TGTACCCATG
CATATGATTC AGCGCAACAT GACCGAAGAG CTGGAGAGCT GCCATGAAGC ACCGTTCTAC
ACCTTAGGGC CATTGACTAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGGATC
GGTGCCGCGA TGATCGGCTG GTTTGGCTGC GCGATGCTGT GTTATGTGAC GCCGAAAGAG
CATCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TAATCACCTA CAAAATCGCC
GCCCACGCCG CGGATTTAGC CAAAGGACAT CCGGGCGCGC AGATCCGCGA TAACGCCATG
TCGAAAGCGC GCTTCGAATT CCGCTGGGAA GATCAGTTTA ACCTCGCGCT CGACCCGTTC
ACCGCCCGCG CTTATCACGA TGAAACCCTG CCGCAGGAGT CCGGCAAAGT CGCCCACTTC
TGTTCCATGT GCGGGCCGAA ATTCTGCTCG ATGAAAATCA GCCAGGGGGT CCGCGACTAC
GCCGCCGCGC AAGCCATTGA AGTCGGCATG GCGGATATGT CGGAGAACTT CCGCGCCAAA
GGCGGCGAAA TTTATCTCAA ACGGGAGGAA GCCTGA
 
Protein sequence
MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK 
DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IDARNDSEEL DDRSSAYTRE
RLADDGLDDL RFTGLLTPKR AKAGKRVTQL HYARQGIVTP EMEFIAIREN MGRERIRSEV
LRHQHPGMNF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI
GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL
EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA
KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT
KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LESCHEAPFY TLGPLTTDIA PGYDHFTSGI
GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM
SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQGVRDY
AAAQAIEVGM ADMSENFRAK GGEIYLKREE A