Gene BAS3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3062 
Symbol 
ID2848293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3040292 
End bp3041461 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content38% 
IMG OID637506306 
Producttransporter 
Protein accessionYP_029319 
Protein GI49186067 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0534836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATT TTAAAAATAA AGTAATTGAT AAGGATATTT CATCAGGTTT AATCATTCTT 
TTAGCAACTG CATGTGGTAT TATTGTGGCT AATCTTTATT ATGCACAGCC TTTAATTGGG
GTAATTAGTA ATGAAATTGG GCTTTCTAAT AGTAGCGCTG GATTAATTGT AACGCTAACT
CAAATTGGAT ATGTTGTTGG CTTACTATTT CTTGTGCCTT TGGGGGATAT TGTTGAGAAT
AAAAAATTGA TACTTATATT GTTATTTTTA AGTGCATTTG CACTCATTTC CATGGTTTTT
GTAAAAAGCG CAACTTTGTT GTTAATTGCT TCATTCTTTA TCGGACTGGG TTCGGTCGCA
GCGCAAGTAC TCGTACCTCT TGTATCATAT CTTTCATCTG AGAATGCACG CGGTCGCGTA
GTTGGCAATG TCATGAGTGG TCTGTTATTA GGTATTATGC TTGCGCGACC GATATCTAGT
CTAGTAGCCG ATATGTGGGG ATGGAATGCA ATATTTGCTT TATCTGCTAC TGTAATTATT
GTCTTAGCGT TTGTATTATC GAAAGTACTC CCTACTAGGA AACCACAGGT AAAAACAAAT
TATATAGCCT TACTTAATTC AATGTGGCAA CTGCTACGAA CTACTCCAAT TTTACGCCGT
CGCGCCATTT ATCATGCTTG TGTATTTGGG GCTTTCAGCT TATTCTGGAC CACTGTTCCA
TTATTATTAT CTAGTCCTGC TTTTCATTTT TCTCAGACTG CCATAGCATT ATATGCACTT
GTCGGAATTA CAGGTGCAAT AGCCGCTCCA ATAGGTGGTC GTCTAGCTGA TCTTGGCTGG
ACACGATCCG CCACTGGGAT AGCTCTCACT GTTGTTATTA TTTCTTTATT ACTACCACTT
ATTATTCAAA GTAGTTCGCC CATCGGAATA GCTGTTTTAG TAATTGCTGC AATTCTGTTA
GACATGGGAG TATCTGCAAA CCTTGTGCTT AGCCAACGTT TAATTTTCTC GTTAAGTCCA
GAAATTCGTA GTCGATTAAA CGGACTATTT ATGGCTATTT TCTTTTTAGG AGGTGCTGTT
GGATCCTTTA TTGGAGGATG GAATCTAACA TTATGGATAG GAATCGCTTT TCCGACCATA
GCCTTGCTTT ATTTTGCTAG AGAAAAATAG
 
Protein sequence
MSNFKNKVID KDISSGLIIL LATACGIIVA NLYYAQPLIG VISNEIGLSN SSAGLIVTLT 
QIGYVVGLLF LVPLGDIVEN KKLILILLFL SAFALISMVF VKSATLLLIA SFFIGLGSVA
AQVLVPLVSY LSSENARGRV VGNVMSGLLL GIMLARPISS LVADMWGWNA IFALSATVII
VLAFVLSKVL PTRKPQVKTN YIALLNSMWQ LLRTTPILRR RAIYHACVFG AFSLFWTTVP
LLLSSPAFHF SQTAIALYAL VGITGAIAAP IGGRLADLGW TRSATGIALT VVIISLLLPL
IIQSSSPIGI AVLVIAAILL DMGVSANLVL SQRLIFSLSP EIRSRLNGLF MAIFFLGGAV
GSFIGGWNLT LWIGIAFPTI ALLYFAREK