Gene BCG9842_B4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4541 
Symbol 
ID7181711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp736484 
End bp737503 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content36% 
IMG OID643548529 
Productthiamine/molybdopterin biosynthesis ThiF/MoeB-like protein 
Protein accessionYP_002444200 
Protein GI218895789 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.43862 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATAATC GATATTCTCG CCAAGAACTA TTTTCTCCAA TTGGAGAAGA AGGACAGCAA 
AAGATAAGAG AAAAGCATGT GCTTATTATC GGTGCAGGTG CATTAGGTAG TGCAAATGCA
GAGATGTTTG TAAGAGCAGG TGTTGGCAAG ATAACAATTG TTGACCGTGA TTATGTAGAT
TGGAGTAATT TACAAAGACA ACAATTGTAT GCAGAGAGCG ACGTAAAGAA TAATCTTCCA
AAAGCTATAG CGGCTAAAAA ACGTTTAGAA GAGATCAATA GTGATGTAAC AATAGAAGCT
CTCGTTCAAG ATGTAACAGC TGAAGAGCTA GAAGAACTTG TTACAAATGT TGATGTAATT
ATTGATGCGA CTGATAATTT TGAAACACGC TTTATTGTGA ATGATATAGC ACAAAAATAT
TCTATTCCAT GGATTTACGG TGCATGTGTA GGTAGTTACG GTCTTTCTTA CACAATCCTT
CCTAGTAAAA CACCATGTTT ATCATGTTTA TTACAGTCGA TTCCGCTTGG CGGGGCAACA
TGTGATACAG CGGGTATTAT ATCACCTGCT GTATCTCTCG TCGTTTCTCA TCAAGTAACA
GAAGCTCTTA AACTGTTAGT AGAAGATTAC GAATCACTGC GAGATGGACT TGTGTCATTT
GATGTATGGA AGAATGAATA TTCATGTATG AATGTGCAAA AGCTTCGTAA ACACAATTGT
CCTTCGTGCG GAGAGAATGC GATATACCCG TATTTAAATA AAGAAAACAC ATCGAAAACA
GCAGTTTTAT GCGGGCGAAA TACAGTTCAA ATTAGACCAC CTCATAAAGA GGAAATGGAT
TTTGAGAGGT ACAAAAAGCT GCTGGATGAT CGTGTAAATG ATTTAAATGT AAATCCATAT
TTATTATCAT TTTCTGTGGA AGAAAAGAGA TTAGTTGCTT TTAAAGATGG TCGTGTACTC
GTACATGGAA CGAAAGATAT AAGCGAAGCA AAAACGATTT ATCATCGCTA TTTTGGATAG
 
Protein sequence
MNNRYSRQEL FSPIGEEGQQ KIREKHVLII GAGALGSANA EMFVRAGVGK ITIVDRDYVD 
WSNLQRQQLY AESDVKNNLP KAIAAKKRLE EINSDVTIEA LVQDVTAEEL EELVTNVDVI
IDATDNFETR FIVNDIAQKY SIPWIYGACV GSYGLSYTIL PSKTPCLSCL LQSIPLGGAT
CDTAGIISPA VSLVVSHQVT EALKLLVEDY ESLRDGLVSF DVWKNEYSCM NVQKLRKHNC
PSCGENAIYP YLNKENTSKT AVLCGRNTVQ IRPPHKEEMD FERYKKLLDD RVNDLNVNPY
LLSFSVEEKR LVAFKDGRVL VHGTKDISEA KTIYHRYFG