Gene BAS5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5056 
Symbol 
ID2849017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4931485 
End bp4932810 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content38% 
IMG OID637508311 
Product6-phospho-beta-glucosidase 
Protein accessionYP_031295 
Protein GI49188042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAA TTAAAATTGC TACAATCGGC GGTGGATCTA GTTATACACC AGAGTTAATT 
GAAGGATTTA TTAAACGTTA TGATGAGCTT CCTGTTCGTG AAATTTGGTT AGTAGATATT
GAGGCAGGAA AAGAAAAGTT AGAAATCGTT GGTAACTTAG CGAAACGTAT GGTGAAAAAA
TCTGGTTTAC CAATTGAGGT ACATTTAACG CTTGATCGTC GCGAAGCATT AAAAGATGCT
GACTTCGTAA CGACGCAGCT TCGAGTAGGT TTATTAGAAG CTCGCGCAAA AGATGAAGCA
ATCCCGTTAA AATATGATGT AATCGGTCAG GAAACGAATG GTCCTGGTGG TTTATTCAAA
GCGTTGAGAA CGATTCCTGT TATTTTAGAT ATTTGTAAGG ACATGGAGGA GCTTTGTCCG
AATGCATGGT TAATTAACTT TGCAAACCCA GCTGGTATGG TAACAGAAGC TGTTCTTCGT
TATACAAATA TTCAAAGAGT AGTTGGTCTA TGTAACGTTC CAATTGGAAT CCGTATGGGT
CTTGCGAGAT TACTTGAAGT AGATGCAAGT CGTGTACACG TTGACTTCGC AGGTTTAAAC
CATATGGTAT ACGGACTAGA CGTGTATTTA GATGGCGTAA GTGTAATGGA TCGTGTGTTA
GAACTTGTAA CAGATCCAGA AAAGCAAATT ACGATGGAAA ATATCGCAGC GCTTAACTGG
GAACCAGACT TTATTCGTGG CCTTCGTGCA ATTCCATGTC CATATCACCG TTACTACTAC
AAAACACGTG AAATGTTAGA AGAAGAGAAA GAAGCTTCGG TTGAAAAAGG TACACGTGCA
GAAGTAGTAA AACAATTAGA AGATGATTTA TTCGAGTTAT ATAAAGACCC GAACTTAGAT
ATTAAACCAC CACAATTAGA AAAACGTGGA GGCGCTTATT ATAGTGATGC AGCATGTAGC
TTAATTACGT CTATTTACAA TAATAAAGGT GATATCCAGC CTGTTAATAC ACGAAACAAC
GGAACGATTG CAAGCTTACC ACATGATTCT GCTGTTGAAG TGAACTGTAT TATTACGAAA
GAAGGTCCAA AACCAATTGC AGTTGGAGAT CTTCCGGTAC CAGTTCGCGG TTTAGTACAA
CAAATTAAAT CATTTGAGCG CACAACAATT GAAGCTGCTG TTACAGGGGA TTATCATAAA
GCGCTGCTTG CTATGACAAT TAATCCACTT GTACCATCAG ATAAAGTTGC AAAACAAATT
TTAGATGAAA TGTTGGAAGC GCATAAAGAA TATCTTCCGC AGTTCTTCAA AAAGGTAGAG
AAATAA
 
Protein sequence
MTGIKIATIG GGSSYTPELI EGFIKRYDEL PVREIWLVDI EAGKEKLEIV GNLAKRMVKK 
SGLPIEVHLT LDRREALKDA DFVTTQLRVG LLEARAKDEA IPLKYDVIGQ ETNGPGGLFK
ALRTIPVILD ICKDMEELCP NAWLINFANP AGMVTEAVLR YTNIQRVVGL CNVPIGIRMG
LARLLEVDAS RVHVDFAGLN HMVYGLDVYL DGVSVMDRVL ELVTDPEKQI TMENIAALNW
EPDFIRGLRA IPCPYHRYYY KTREMLEEEK EASVEKGTRA EVVKQLEDDL FELYKDPNLD
IKPPQLEKRG GAYYSDAACS LITSIYNNKG DIQPVNTRNN GTIASLPHDS AVEVNCIITK
EGPKPIAVGD LPVPVRGLVQ QIKSFERTTI EAAVTGDYHK ALLAMTINPL VPSDKVAKQI
LDEMLEAHKE YLPQFFKKVE K