Gene BAS5049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5049 
Symbol 
ID2853025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4922039 
End bp4923427 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content36% 
IMG OID637508304 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_031288 
Protein GI49188035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAGGG AGAATTCTAT GTATTCGTAT ATTTTCTATT GTATATTCCT GTGCGAAAAG 
GAAATGAAAA TTACGATAAT AGGTACCGGA TATGTTGGAT TGATTACTGG GGTAGGATTG
GCAGTATTGG GACATTCGGT TACATGTTTT GATATAAATG ATGAGAAAAT TGAACGGATA
AAACAAGGGG ATCTGCCTAT TTATGAAGCT GGCTTATATG AATTAATACA TGATGCGTGT
GAGAATAATC GTTTAACTTT TACAACAAGT AAAGAGGAAG CATTTAAAGA TGCAGAATTT
ATATTTATTG CAGTTGGAAC GCCATCGTTA TTAGATGGGA CAGCGGATTT AACATATATT
CAAAACGCAT GTGTTGATAT TGGGACATAT GCGACTAAAG ACATTATTGT TGTTACAAAA
AGTACTGTTC CTGTAGGGAC GAATGGTGCT ATGAGGGGAT GGATTGAGGA GACATTACAA
AACAGACATG AATTACATAT TGTATCGAAT CCTGAATTTT TACGGGAGGG TTCAGGGATT
TACGATTTTT TTCAGGGAGA TCGTATTGTA ATTGGGGCCG ATAATGAGGA AGCGGCACGA
AAAGTGGAGA ATTTATATAG TGAATTACAC TTAAAAACAT ATGTTACAGA TATAAAAAGT
GCAGAGATGA TTAAATATGC ATCGAATGCT TTTTTAGCGA CAAAAATTAG TTTTATCAAT
GAAATTTCAA ATATATGTGA GAAGGTAGGA GCAAATGTAT TAGACGTCGC AAAAGGGATG
GGAATGGATA AAAGAATTGG AGCGTCTTTT TTAAATGCGG GCATTGGTTA CGGGGGATCG
TGTTTTCCGA AAGATACGAA AGCGCTTGTG CAAATTGCAG GGAATGTGGC ACATGATTTT
CGCTTGTTAA AAGCCGTCAT TGAAGTGAAT AATAAACAAC AGTTATTATT GATTGAGAAA
GCGAAGAAAG TAATAAATAT GAACAAGAAG CGGATTGCGG TGCTGGGAGC GGCATTTAAA
CCGAATACAG ATGATATAAG AGATGCACCG TCCCTTATTA TGATACAAGA GTTAATAAAT
CTAGGTGCGG ACATAGTTCT ATATGACCCG AAGGCGATTC AAAATATGAA AAATATCTTT
GGGGAGACTA TACAATATAG TGAATGTATA GATGAATCGA TTAGAGGAGC GAGTGCGGCT
TTTATCATGA CAGAATGGGA AGATATCCGG ACTTATCCGT TAGAAAATTA CGTACAACTT
ATGAGAGAAC CTATTTTATT CGATGGAAGA AATTGCTATA CTAATGAAGA TGTCAAGAAG
CAAGGAATAG ATTATTATTC TGTTGGAAGA GAAAGTATTT ATAAGAGGGA TTTTTCCGTT
ATTCGTTAA
 
Protein sequence
MHRENSMYSY IFYCIFLCEK EMKITIIGTG YVGLITGVGL AVLGHSVTCF DINDEKIERI 
KQGDLPIYEA GLYELIHDAC ENNRLTFTTS KEEAFKDAEF IFIAVGTPSL LDGTADLTYI
QNACVDIGTY ATKDIIVVTK STVPVGTNGA MRGWIEETLQ NRHELHIVSN PEFLREGSGI
YDFFQGDRIV IGADNEEAAR KVENLYSELH LKTYVTDIKS AEMIKYASNA FLATKISFIN
EISNICEKVG ANVLDVAKGM GMDKRIGASF LNAGIGYGGS CFPKDTKALV QIAGNVAHDF
RLLKAVIEVN NKQQLLLIEK AKKVINMNKK RIAVLGAAFK PNTDDIRDAP SLIMIQELIN
LGADIVLYDP KAIQNMKNIF GETIQYSECI DESIRGASAA FIMTEWEDIR TYPLENYVQL
MREPILFDGR NCYTNEDVKK QGIDYYSVGR ESIYKRDFSV IR