Gene BAS0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0222 
Symbol 
ID2849626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp223944 
End bp224954 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content39% 
IMG OID637503427 
Productoligopeptide ABC transporter ATP-binding protein 
Protein accessionYP_026507 
Protein GI49183255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAC CATTATTAGA AGTAAAAAAC TTAAAAACAT ATTTTCCAAT TAAAGGCGGC 
ATATTTAGTA GAACGGTTGG ACATGTAAAA GCAGTTGATG GAGTAAGTTT TACTATTAAG
AAAGGCGAAG TATTCGGTCT CGTTGGTGAA TCAGGAAGCG GAAAAACGAC GATAGGAAAA
ACAATTTTAC GTCTCGTCCA AAAAACGGAG GGAGAAGTGA AATTTAAAGG ACACGATGTT
CATTCTCTAT CAAAAGAGGA ATTAAGAAAA CATCGTCCTA ATATGCAGCT TGTGTTTCAA
GATCCATTTA GCTCATTAAA TCCGAGAATG AGAATTGGAG AGGCACTTGG TGAGCCGATG
TTGGCTCACG GATTAGCGAC GAAAGAAAAT GTTCGCGAAA AAGTAACGGA AGTATTAGAG
TTATGTGGCT TAGCCCCATA TCATATTGAC CGGTACCCTC ATGAATTTTC TGGTGGACAA
CGTCAACGTA TCGTTATCGC AAGAGCCATG GTATTAAACC CGGAATTTAT TGTAGCTGAT
GAACCTGTGG CAGCACTAGA CGTATCTATT CAAGCACAGA TCATTAATTT ATTTAGTGAG
CTACAGGAGA AAAAGGGACT ATCTTATTTG TTCATTTCAC ATGATTTAAG CGTAGTAGAG
CATTTATGTA CGAAGATTGG AATTATGTAT TTAGGAACAA TTGTGGAAAC AGCACCGCGT
GATGAGTTAT TTACAAACCC ACTTCATCCG TATACAAAAG CATTGTTATC CGCTGTGCCA
ATACCAGATC CAACAGTGAA GCGAGAGCGA ATTATACTAG AGGGTGATAT TCCAAGCCCA
GCGAATCCGC CTTCAGGTTG TTGCTTTCAT ACACGCTGCC CGTTTGCAAC AGATATTTGT
AAACAAACGG GGAATTCCGT AATGTTGGTG AAGAGCACTT TGTTGCTTGT CATCATGTAT
AAAAGAGAAG GACTCTTTCA GAAATTGAAA GAGTCCTTTT TTATTTACTA G
 
Protein sequence
MSEPLLEVKN LKTYFPIKGG IFSRTVGHVK AVDGVSFTIK KGEVFGLVGE SGSGKTTIGK 
TILRLVQKTE GEVKFKGHDV HSLSKEELRK HRPNMQLVFQ DPFSSLNPRM RIGEALGEPM
LAHGLATKEN VREKVTEVLE LCGLAPYHID RYPHEFSGGQ RQRIVIARAM VLNPEFIVAD
EPVAALDVSI QAQIINLFSE LQEKKGLSYL FISHDLSVVE HLCTKIGIMY LGTIVETAPR
DELFTNPLHP YTKALLSAVP IPDPTVKRER IILEGDIPSP ANPPSGCCFH TRCPFATDIC
KQTGNSVMLV KSTLLLVIMY KREGLFQKLK ESFFIY