Gene SbBS512_E4485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4485 
SymbolthiC 
ID6270709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4195060 
End bp4196955 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content56% 
IMG OID641728278 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001882680 
Protein GI187730178 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCAA CAAAACTGAC CCGCCGCGAA CAACGCGCCC GGGCCCAACA TTTTATCGAC 
ACCCTGGAAG GCACCGCCTT TCCCAACTCA AAACGCATTT ACCTCACTGG CACACACTCT
GGCGTTCGCG TGCCGATGCG TGAGATCCAG CTTAGCCCGA CGCTAATCGG CGGCAGCAAA
GAACAGCCGC AGTTTGAAGA AAACGAAGCG ATTCCGGTCT ACGACACCTC CGGCCCATAT
GGCGATCCGC AGATCGCCAT TAACGTGCAG CAAGGGCTGG CAAAACTACG CCAGCCGTGG
ATCGATGCGC GCGGCGATAC CGAAGAACTT ACCGTGCGCA GTTCCGATTA CACTAAAGCG
CGGCTGGCAG ATGATGGCCT CGACGAACTG CGTTTTAGCG GCGTACTAAC ACCAAAACGC
GCCAAAGCAG GACGCCGTGT CACCCAACTG CACTACGCCC GCCAGGGCAT CATCACACCG
GAAATGGAAT TCATCGCCAT CCGCGAGAAT ATGGGCCGCG AGCGCATACG TAGCGAAGTT
TTACGCCACC AGCATCCGGG AATGAGCTTT GGCGCACGTC TGCCGGAAAA TATCACTGCG
GAATTTGTCC GTGATGAAGT TGCTGCCGGA CGTGCGATTA TCCCGGCCAA CATTAATCAT
CCGGAATCGG AGCCGATGAT TATTGGCCGT AACTTCCTGG TAAAAGTTAA CGCCAATATC
GGCAACTCGG CGGTCACCTC TTCCATCGAA GAAGAAGTGG AAAAGCTGGT ATGGTCCATG
CGCTGGGGAG CGGATACGGT GATGGATCTC TCCACCGGTC GCTATATTCA CGAAACCCGC
GAGTGGATTT TGCGTAACAG TCCGGTGCCG ATCGGTACAG TGCCGATCTA CCAGGCGCTG
GAGAAGGTTA ACGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG
GAACAGGCCG AGCAAGGTGT GGATTACTTC ACTATCCATG CGGGCGTACT GTTGCGCTAT
GTGCCGATGA CCGCGAAACG CCTGACCGGT ATCGTCTCTC GCGGCGGTTC GATTATGGCG
AAATGGTGCC TCTCCCATCA TCAGGAAAAT TTCCTCTATC AACACTTCCG CGAAATTTGT
GAAATCTGTG CCGCTTATGA CGTTTCGCTG TCGCTGGGCG ACGGTCTGCG CCCCGGTTCT
ATTCAGGACG CCAACGATGA AGCGCAGTTT GCCGAGCTGC ATACGCTGGG CGAACTGACC
AAAATTGCCT GGGAATATGA CGTGCAGGTG ATGATTGAAG GCCCAGGCCA CGTGCCGATG
CAGATGATCC GCCGCAATAT GACCGAGGAG TTAGAGCACT GCCACGAAGC GCCGTTTTAC
ACTCTGGGGC CGCTAACTAC CGATATTGCG CCGGGCTATG ACCACTTCAC GTCGGGGATT
GGTGCGTCGA TGATTGGCTG GTTTGGCTGC GCGATGCTCT GTTACGTAAC GCCAAAAGAG
CATCTGGGTC TGCCTAATAA AGAAGATGTT AAGCAGGGGC TTATCACCTA TAAGATTGCC
GCCCACGCCG CTGACCTGGC GAAAGGGCAT CCGGGCGCGC AAATTCGCGA TAACGCCATG
TCGAAAGCCC GCTTCGAATT TCGCTGGGAA GACCAGTTTA ATCTGGCCCT CGACCCGTTT
ACCGCCCGCG CTTATCACGA TGAAACCCTG CCGCAAGAGT CAGGTAAAGT CGCCCATTTT
TGCTCCATGT GTGGGCCGAA ATTCTGCTCG ATGAAAATCA GCCAGGAAGT GCGTGATTAC
GCCGCCACGC AAACTATTGA AATGGGAATG GCGGATATGT CGGAGAACTT CCGTGCCAGA
GGCGGAGAAA TCTACCTGCG TAAGGAGGAA GCGTGA
 
Protein sequence
MSATKLTRRE QRARAQHFID TLEGTAFPNS KRIYLTGTHS GVRVPMREIQ LSPTLIGGSK 
EQPQFEENEA IPVYDTSGPY GDPQIAINVQ QGLAKLRQPW IDARGDTEEL TVRSSDYTKA
RLADDGLDEL RFSGVLTPKR AKAGRRVTQL HYARQGIITP EMEFIAIREN MGRERIRSEV
LRHQHPGMSF GARLPENITA EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI
GNSAVTSSIE EEVEKLVWSM RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL
EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA
KWCLSHHQEN FLYQHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF AELHTLGELT
KIAWEYDVQV MIEGPGHVPM QMIRRNMTEE LEHCHEAPFY TLGPLTTDIA PGYDHFTSGI
GASMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM
SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY
AATQTIEMGM ADMSENFRAR GGEIYLRKEE A