Gene BAS1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1079 
Symbol 
ID2847988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1132237 
End bp1133538 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content33% 
IMG OID637504337 
Productalpha-amylase family protein 
Protein accessionYP_027351 
Protein GI49184099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000334423 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTGG GGAAAATACA TACTAGGAAA CTATTCATTT GTTTTTGTTT AGCTGTCGTT 
TTGTTTGTAC CAATACATAC ATTTGCAGAC GAAAAAAGAG AGTGGCGAGA TGAAGTTATA
TATTCCATTA TGATTGATCG TTTCAATAAT GGGGAACCGA AAAATGACAA ACAGTTAGAA
GTTGGTAATT TAGAAGGATA TCAGGGCGGA GATATAAGAG GCATTATAAA AAGACTGGAT
TACATAAAAG AAATAGGATT TACCACTGTT ATGCTTTCGC CGCTGTTTGA AAGTGTAAAA
TACGATGGAG TAGACGTGCG CAATTTTCAG AAGGTAAATG AACATTTCGG AACAGAAAAT
GATGTAAAAG AACTTGTGCA AGAAGCTCAT ACAAAAGGAA TGAAAGTTAT ACTTCAATTT
CCGCTTGGAG AAAACGAACA ACAAGTAATC GACTCGATGA AATGGTGGGT CAAAGAAGTT
GATTTAGATG CAAGTTATGT AATGCATAGT GAAAAAAAGT CTCCTGCTTT TTGGGATGAT
GTGCAAAAAG ATATGCAAGT GATAAAAAAA GATTTTCGAG TTATGACAAA AGAAGATAGT
GAATACAACG AAAAAATAGT AGAATCGTTT TCTAAAGCGG ACGTATCGGT AAAATCTTTA
TATGATGTGA GTAAAAAAGA CGAGGAATTC ATTACATTTT TAGATAATCA AGATACAAAA
AGATTTGCTC GTATTGCAAA GGAAAATATG AATTATCCGC CATCGCGTTT GAAACTAGCT
CTTACATATT TATTGACATC ACCAGGCATT CCGAATTTTT ATTACGGGAC TGAAATTGCA
TTAGATGGAG GGGATACTCC AGATAATAGA CGATTAATGG ATTTCAAATC GGATGAAAAG
TTTATGCAGC ATATAACAAA ACTTGGTGAA CTTAGACAAA TGAGACCATC TTTACGACGC
GGTACATTTG AACTCTTATA CGATAAAAAT GGAATGAGTG TACTAAAACG AAAGTATAAA
GGTGAAGTCA CATTAGTAGC GATTAATAAT ACGAAAGAGA CGCAAAAAGT TGCTTTACCT
GCAAGTACGA TTGGTGAAAA ACAAGAGTTA AGAGGATTGT TAGAAGATGA AATTATAAGA
GAAGAAAATG GAAAGTTTTA TCTCGTTTTA AAGCGTGAAG AATCAAATGT GTATAAAGTT
AATAGAGAAA CAGGTGTGAA TTGGTTATTT ATCTCCTTAA TAGTTGGTGT GAACGTATTA
TTTATTACTT TTTTAATTGC GGTTAAAAAG AGACGGAAAT GA
 
Protein sequence
MRVGKIHTRK LFICFCLAVV LFVPIHTFAD EKREWRDEVI YSIMIDRFNN GEPKNDKQLE 
VGNLEGYQGG DIRGIIKRLD YIKEIGFTTV MLSPLFESVK YDGVDVRNFQ KVNEHFGTEN
DVKELVQEAH TKGMKVILQF PLGENEQQVI DSMKWWVKEV DLDASYVMHS EKKSPAFWDD
VQKDMQVIKK DFRVMTKEDS EYNEKIVESF SKADVSVKSL YDVSKKDEEF ITFLDNQDTK
RFARIAKENM NYPPSRLKLA LTYLLTSPGI PNFYYGTEIA LDGGDTPDNR RLMDFKSDEK
FMQHITKLGE LRQMRPSLRR GTFELLYDKN GMSVLKRKYK GEVTLVAINN TKETQKVALP
ASTIGEKQEL RGLLEDEIIR EENGKFYLVL KREESNVYKV NRETGVNWLF ISLIVGVNVL
FITFLIAVKK RRK