Gene BAS4886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4886 
Symbol 
ID2849701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4768821 
End bp4770137 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content37% 
IMG OID637508143 
Productamino acid permease family protein 
Protein accessionYP_031128 
Protein GI49187875 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000145673 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCATG ATGAGAAGAA CAAAATTGGT TTAACGGTAG CACTTTCTAT CGTAGTAGGA 
ACGATTATTG GGTCTGGTGT GTTTATGAAA CCAGGGAGCG TATTAGATTA CTCGGGGAGT
TCTAATATGG CGATTCTTGC TTGGGTAATT GGTGGTCTGT TAACGTTAGC AAGTGGTTTA
ACAGTAGCTG AAATTGGAGC GCAAATCCCG AAAAATGGTG GGTTGTATAC GTATTTAGAG
GAGATTTACG GAAGTTTTTG GGGATATTTA TCAGGCTGGA TGCAAACGAT TGTTTATGGG
CCAGCTATTA TTGGAACATT AGGGTTATAC TTTAGTTCTT TAATGATTAA TTTTTTCTAT
TTAGATAAAG TATGGAATTT ACCAATCGCA ATTGGAACAG TTGTGTTCCT TGGCGTTGTA
AATAGTATGG GAACAAAATA CGGAGGTATC GTCCAAACGA TCACGACAAT CGGGAAGATG
ATTCCAATCG TATTAATTGT TGTGTTAGGT TTTTGGAAAG GGAATAGCGA TATCTTTAAC
GTAGTTGTGC CGATATCAGA AAATCAAAGT ATCGGGATGG CGATCTTAGC AACGTTATTT
GCTTATGACG GCTGGATTTT ACTTGCTTCG ATTGGCGGAG AAATGAAGAA TCCAACAAAG
CTATTACCGA AAGCAATGAC AGTTGGGATT TTAATTGTAA CAGCTGCTTA CGTATTAATT
AACTTAGCGT TACTGAATGT ATTACCAGCA ACGCAAATTG TAGAACTTGG AGAAAATGCA
ACAGCGACAG CTGCGGGCAT GCTACTTGGG GAATATGGCG GGAAAATTAT TAGTATCGGT
ATTATCGTAT CTATTTTCGG TTGTTTAAAT GGAAAGATTT TAACGTTCCC ACGTATCCCG
ATGTCGATGG CAGAACGTGG ACAACTTCCA TTTGCTAAGT TTATTGCAAA GGAAAGTCCA
AGATTTAAAA CACCAGCAAA TGCGATTACT GTTGAAATCA TTTTAGGAAT TATTTTAATG
ATTATTAGTG ATCCAAATAA GCTATCTGAG ATTTCCGTAT TCATTATTTA TATTTTCTAC
GTAATGACGT TTATCGGTGT CTTCATTTTA AGAAAACGTA ATAAGAATAA AGAGCGTGCA
TACAGTGTAC CGTTATTCCC AATCGTCCCA ATCGTTGCGA TTTTGGGCTC ACTCTTTGTA
ATCGGTAGTG CGATTATTAA CGATCCACTA AGTTGTTTCT TATCAATTGG AATTGTCTTT
ACGGGACTTC CGGTATATTG GTATTTAAAT AAGAAGAACA AAACTGAAGT GTCATAA
 
Protein sequence
MHHDEKNKIG LTVALSIVVG TIIGSGVFMK PGSVLDYSGS SNMAILAWVI GGLLTLASGL 
TVAEIGAQIP KNGGLYTYLE EIYGSFWGYL SGWMQTIVYG PAIIGTLGLY FSSLMINFFY
LDKVWNLPIA IGTVVFLGVV NSMGTKYGGI VQTITTIGKM IPIVLIVVLG FWKGNSDIFN
VVVPISENQS IGMAILATLF AYDGWILLAS IGGEMKNPTK LLPKAMTVGI LIVTAAYVLI
NLALLNVLPA TQIVELGENA TATAAGMLLG EYGGKIISIG IIVSIFGCLN GKILTFPRIP
MSMAERGQLP FAKFIAKESP RFKTPANAIT VEIILGIILM IISDPNKLSE ISVFIIYIFY
VMTFIGVFIL RKRNKNKERA YSVPLFPIVP IVAILGSLFV IGSAIINDPL SCFLSIGIVF
TGLPVYWYLN KKNKTEVS