Gene BAS0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0799 
Symbol 
ID2852949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp846009 
End bp847748 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content35% 
IMG OID637504061 
ProductABC transporter substrate-binding protein 
Protein accessionYP_027075 
Protein GI49183823 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA TGGACTATTA CATTAGACTA AGACTACATG CACAAGATCA ACAACATATA 
CGAAATAGCT TACAAGAATT AGCGGATGTT TTATATTGCA GTACGAAAAA CGTAAAGATT
TTATTAAAGA AAATGAGCGA GGAGCAATTA ATTAGCTGGA CTCCAGGACG AGGACGCGGA
AATAAAACGG AAATTCTATT TTCACATAAT TTCGTAGAAG CGATTGAATC CTATACAGAT
GAACTGTTAG CGCAAGAAAA ATTAAAGGAC GTTTTTCTTC TTTTAAAAGA ACCTCTCCCC
CCTGCACTTC AAAAAAAGAT AGAAAATAAA CTACATCATC ATTTTGGATA CGAACCTTCA
AATGATATGT ACGACATATT AAAGATTCCT ATATCGAGAA AGATATTCCC ATTAGATCCG
GCTTTTGTCG CTGTAACTAC AGAAAGCCAT CTTACGAGTC AAATTTTTGA TACGTTAGTC
GTTTATAACG ATGTTACGGA AAAAATGGAA CCACACATTG CGCATACGTG GGAATTAAGT
GAAGACTGGC TTACGTGGAC GTTCTATTTA CGAAAGGATA TTCATTTTCA TAACGAAACA
ATTTTAACTT CTAAAGATGT GCAATTTTCA TTTGAAAGGC TAAAAGAAGT TCATTCCCCT
TTCGAATGGT TAACAGAAGA AATTGTTCAA ATTGAAACAC CATCGCCACT GCAAATAAGG
TTCCATTTAG CAAAACCCAA CCTTTTTTCC TTACACTATG TAAGTTCCAT ACAACTAGCA
ATTTTACCGC GTGATGTGAG TATCGAAAAT CATCATTATA TAGGTACGGG ACCTTTCAAA
CTTGCTCATT ACTCTGAAGA TAATATCGTA CTCGAAGCAT TCACTCATTA TTTTAAAGAG
CGCGCATTAT TAGACCGTAT TGAATTTTGG GGTATTCCGG ATCATGTACA AATCGATGCT
GATTATGAAC TACCAAATGA AGAGGAAAAT GAAAGGCATG ATATACAAAT AGAAGAAATA
GGTTGTATTT ATGCTGGCTT CAACTTTACA AAACCTGGTC CTCATCACGA CATGTACTTT
CGAAAAGCTT GGAGAGAACT ATATGACGTT GAAACGATAC TTCGTAGCAT AGAAGGAAGA
CGAACAATTG CTGCATCAAG CTTTTTCCCT GAAAGAAGCC GCCAAGCTTT TAAAAGGTCT
TACTCTTTAG AAAAAGCGAA AAAGTATTTA AAAAAGAGCA CTTATAATGG AGAGGCGATA
CATATTTACT TCTTCGCATT TAAAGATAGT GCAAATGATG CATATTTCCT AAAAGAACGA
TGTGACGGTC TAGGTATACA AGTAGAGCTT CACCCATTTC TCGTTTCAGA TTATATGAAC
CGTTCTATCG ATCAACATGC CGATATTATT TTCATGGGAG AAGTGTTTGC TTCCAATCAC
GAACTTGCAT TCTTAAATGT ATTTAAAAAT AAAAGTTGCT TCGTAAACCG GTTCATGGAC
CAGCACTATG AAAAACAAAT TAACTGTTTG TTAGATACAT TTTTATTAGA AGAAAATAAA
GAGAAACGCT ATGAGCTCAT GTATGAGATC GAAGAATTTT TACAAGCAGA ACACATCATT
TTATTTAACT ATCACGTTTT AAAAAGAAAG ACATACCCTT CTTCTTTGAA AAACGTAACA
ATTGATTCAT TCGGTTGGGC AAATTTTGCG AAGTTATGGA TACAGCCATC CATGTCTTAA
 
Protein sequence
MKIMDYYIRL RLHAQDQQHI RNSLQELADV LYCSTKNVKI LLKKMSEEQL ISWTPGRGRG 
NKTEILFSHN FVEAIESYTD ELLAQEKLKD VFLLLKEPLP PALQKKIENK LHHHFGYEPS
NDMYDILKIP ISRKIFPLDP AFVAVTTESH LTSQIFDTLV VYNDVTEKME PHIAHTWELS
EDWLTWTFYL RKDIHFHNET ILTSKDVQFS FERLKEVHSP FEWLTEEIVQ IETPSPLQIR
FHLAKPNLFS LHYVSSIQLA ILPRDVSIEN HHYIGTGPFK LAHYSEDNIV LEAFTHYFKE
RALLDRIEFW GIPDHVQIDA DYELPNEEEN ERHDIQIEEI GCIYAGFNFT KPGPHHDMYF
RKAWRELYDV ETILRSIEGR RTIAASSFFP ERSRQAFKRS YSLEKAKKYL KKSTYNGEAI
HIYFFAFKDS ANDAYFLKER CDGLGIQVEL HPFLVSDYMN RSIDQHADII FMGEVFASNH
ELAFLNVFKN KSCFVNRFMD QHYEKQINCL LDTFLLEENK EKRYELMYEI EEFLQAEHII
LFNYHVLKRK TYPSSLKNVT IDSFGWANFA KLWIQPSMS