Gene Bcer98_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_3762 
Symbol 
ID5346925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp3818478 
End bp3820238 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content42% 
IMG OID640841252 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001376948 
Protein GI152977431 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAT CTGTTTCAGC TGAGCAAATT GAATTAAAGT CGAGTTTACC AGGGAGTAAG 
AAAGTGTATG TGGAGGGGTC ACGTGAAGGT ATGAAAGTGC CGATGCGTGA GATTGAATTA
AGTGACACAA ACGGAGTGCC AAATTCCCCG ATTCGTGTAT ATGATACGAG CGGTCCCTAT
ACGGATCCAG AATATAAAGT GGAACTCGAA AAAGGGATTC CAACTCCTCG TCGCAATTGG
ATTATAGAAC GCGGAGATGT AGAGGAGTAT GAGGGACGCG AAATCAAGCC GGAAGATGAT
GGTGTAAAAG CTGCTTCGAA CCATACACCT GTGTTCCCGC AAATGGATCG TAAACCTCTT
AGAGCAAAGA AAGGTGCGAA TGTCACGCAG ATGCATTATG CTCGTAAAGG GATTATTACA
TCTGAGATGG AATATGTTGC AATTCGTGAA GGGGTAGAAC CGGAGTTTGT TCGAAAAGAG
ATTGCCGAAG GACGGGCCAT TTTACCAGCG AATATTAACC ATCCAGAAGC AGAACCAATG
ATTATCGGCC GTAATTTCCA TGTGAAAGTA AATGCGAATA TCGGAAATTC TGCTGTTTCG
TCTTCGATTG CCGAAGAAGT AGAAAAGATG ACATGGGCAA CGCGCTGGGG TGCGGATACA
ATTATGGATT TATCAACGGG GAAAAATATT CATACAACGC GCGAATGGAT TATTCGTAAT
GCCCCTGTAC CAGTTGGGAC AGTGCCGATT TACCAAGCGT TAGAAAAGGT ACAGGGGATT
GCTGAAAACT TAACTTGGGA AGTATACCGC GATACATTAA TTGAACAAGC GGAACAAGGT
GTGGATTATT TTACAATTCA CGCGGGCGTA TTATTGCGCT ACATCCCGCT CACAGCAAAA
CGCATGACGG GAATTGTTTC GCGCGGCGGA TCAATTATGG CGCAGTGGTG TCTATACCAT
CATAAAGAAA ACTTCTTGTA TACTCATTTT GAAGAAATTT GTGAGATTAT GAAACAGTAT
GATGTTTCCT TCTCTTTAGG AGATGGGTTA CGCCCAGGAT CTATTGCAGA TGCAAATGAT
GAAGCTCAGT TTGCAGAGCT TGAAACGTTA GGGGAATTAA CGAAAATTGC TTGGAAGCAT
GATGTGCAAG TCATGATTGA GGGGCCAGGG CACGTACCGA TGCATTTAAT TAAAGAAAAT
ATGGAGAAAG AAATAGATAT TTGTCAAGGT GCGCCTTTCT ATACGCTGGG GCCTTTAACA
ACAGATATCG CACCAGGTTA TGATCATATT ACTTCTGCAA TTGGTGCAGC GATGATTGGA
TGGTTTGGGA CTGCGATGCT TTGTTATGTA ACACCGAAAG AGCATTTAGG ATTGCCAAAT
AAAGATGATG TTCGAGAAGG TGTGATTACT TACAAAATTG CAGCGCATGC AGCTGACCTT
GCAAAAGGAC ATAAAACAGC GCAACAAAGA GATGATGCGT TATCAAAAGC GCGCTTTGAA
TTCCGTTGGC GCGATCAATT CAATCTATCG TTAGATCCTG AAAGAGCAAT GGAATTTCAT
GATGAAACAT TGCCAGCAGA AGGAGCGAAA ACAGCTCATT TCTGTTCGAT GTGTGGCCCG
AAATTTTGTA GTATGAAAAT TTCACATGAT ATTCGGGAGT ATGCAAAAGA AAATAATTTA
GAAACGACAG AAGCAATTGA AAAAGGAATG AAAGAAAAAG CAAAAGAATT TAAAGAGGCT
GGTAGTCACT TATATCAATA A
 
Protein sequence
MEQSVSAEQI ELKSSLPGSK KVYVEGSREG MKVPMREIEL SDTNGVPNSP IRVYDTSGPY 
TDPEYKVELE KGIPTPRRNW IIERGDVEEY EGREIKPEDD GVKAASNHTP VFPQMDRKPL
RAKKGANVTQ MHYARKGIIT SEMEYVAIRE GVEPEFVRKE IAEGRAILPA NINHPEAEPM
IIGRNFHVKV NANIGNSAVS SSIAEEVEKM TWATRWGADT IMDLSTGKNI HTTREWIIRN
APVPVGTVPI YQALEKVQGI AENLTWEVYR DTLIEQAEQG VDYFTIHAGV LLRYIPLTAK
RMTGIVSRGG SIMAQWCLYH HKENFLYTHF EEICEIMKQY DVSFSLGDGL RPGSIADAND
EAQFAELETL GELTKIAWKH DVQVMIEGPG HVPMHLIKEN MEKEIDICQG APFYTLGPLT
TDIAPGYDHI TSAIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVREGVIT YKIAAHAADL
AKGHKTAQQR DDALSKARFE FRWRDQFNLS LDPERAMEFH DETLPAEGAK TAHFCSMCGP
KFCSMKISHD IREYAKENNL ETTEAIEKGM KEKAKEFKEA GSHLYQ