Gene GBAA_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_0235 
Symbol 
ID2817134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp223931 
End bp224941 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content39% 
IMG OID637787197 
Productoligopeptide ABC transporter ATP-binding protein 
Protein accessionYP_016841 
Protein GI47525492 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAC CATTATTAGA AGTAAAAAAC TTAAAAACAT ATTTTCCAAT TAAAGGCGGC 
ATATTTAGTA GAACGGTTGG ACATGTAAAA GCAGTTGATG GAGTAAGTTT TACTATTAAG
AAAGGCGAAG TATTCGGTCT CGTTGGTGAA TCAGGAAGCG GAAAAACGAC GATAGGAAAA
ACAATTTTAC GTCTCGTCCA AAAAACGGAG GGAGAAGTGA AATTTAAAGG ACACGATGTT
CATTCTCTAT CAAAAGAGGA ATTAAGAAAA CATCGTCCTA ATATGCAGCT TGTGTTTCAA
GATCCATTTA GCTCATTAAA TCCGAGAATG AGAATTGGAG AGGCACTTGG TGAGCCGATG
TTGGCTCACG GATTAGCGAC GAAAGAAAAT GTTCGCGAAA AAGTAACGGA AGTATTAGAG
TTATGTGGCT TAGCCCCATA TCATATTGAC CGGTACCCTC ATGAATTTTC TGGTGGACAA
CGTCAACGTA TCGTTATCGC AAGAGCCATG GTATTAAACC CGGAATTTAT TGTAGCTGAT
GAACCTGTGG CAGCACTAGA CGTATCTATT CAAGCACAGA TCATTAATTT ATTTAGTGAG
CTACAGGAGA AAAAGGGACT ATCTTATTTG TTCATTTCAC ATGATTTAAG CGTAGTAGAG
CATTTATGTA CGAAGATTGG AATTATGTAT TTAGGAACAA TTGTGGAAAC AGCACCGCGT
GATGAGTTAT TTACAAACCC ACTTCATCCG TATACAAAAG CATTGTTATC CGCTGTGCCA
ATACCAGATC CAACAGTGAA GCGAGAGCGA ATTATACTAG AGGGTGATAT TCCAAGCCCA
GCGAATCCGC CTTCAGGTTG TTGCTTTCAT ACACGCTGCC CGTTTGCAAC AGATATTTGT
AAACAAACGG GGAATTCCGT AATGTTGGTG AAGAGCACTT TGTTGCTTGT CATCATGTAT
AAAAGAGAAG GACTCTTTCA GAAATTGAAA GAGTCCTTTT TTATTTACTA G
 
Protein sequence
MSEPLLEVKN LKTYFPIKGG IFSRTVGHVK AVDGVSFTIK KGEVFGLVGE SGSGKTTIGK 
TILRLVQKTE GEVKFKGHDV HSLSKEELRK HRPNMQLVFQ DPFSSLNPRM RIGEALGEPM
LAHGLATKEN VREKVTEVLE LCGLAPYHID RYPHEFSGGQ RQRIVIARAM VLNPEFIVAD
EPVAALDVSI QAQIINLFSE LQEKKGLSYL FISHDLSVVE HLCTKIGIMY LGTIVETAPR
DELFTNPLHP YTKALLSAVP IPDPTVKRER IILEGDIPSP ANPPSGCCFH TRCPFATDIC
KQTGNSVMLV KSTLLLVIMY KREGLFQKLK ESFFIY