Gene BAS4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4404 
Symbol 
ID2851669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4314364 
End bp4315584 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content33% 
IMG OID637507641 
Producthypothetical protein 
Protein accessionYP_030651 
Protein GI49187399 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1668] ABC-type Na+ efflux pump, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAAT TTTCACATGT ATTTTCATTT TATTTTAGAG AAGCGTTTTT ATCTAAAAAA 
TCATTAATTA CGAGTGCGAT TTTATTTTTA ATTGTGTTTG GGCTTTTTGC TTTTAATCAT
TTCACAGCTG ATAACGATAA AGGAAAAGAT AAATTTGCTG TAGTAACAGA AAGCGATACG
TATAAAGTGC AAGAAAGTGA TGTAACAAAG TTACTTCCGT CTGCTAAAGT AACGGTAGAA
AATAAGGATA AGTTTAATGA GTTGCGTAAG CAAGTAGAAG ATGGGGATTT AGACGGTTTA
TTCCGTATTA CAGAAAAGAA TGGTATTCCA GAAGTGACGT ATATGTATAG TGGTTTTCCA
AGTCAATCAA CTGCTACACT TATGGCGAGT TATTTGAAGG AACAGTATAC AGCGGTAATG
ATTGAAAAAA ATAACGTATC TGCAGAAGTA GCAAAGCAGT TACAGATGGA AATTCCGTTA
AAGCAAGAGG CGGTAAAGGA TCACGCATCT TCCTTTGGGA TTGGCTATGT TTTCTCGTTT
GCTTTATATA TGTTTATTGT TATCTTTGGT GCTGCAATTG GAACAACTGT AGCATCTGAA
AAATCATCCC GTGTTATGGA GTTAATGCTT CCGAAAGTAA AACCATTAAC AATGCTACAT
GCCAAAATTT TAGCAATTGT TTCTAGTGCT TTACTATTAC TTGTTATTGC TTCTTTCGGT
TTTGTTGTAC CAAATTTATT AGGATGGGTA GATTTAGAAA ATGCATCTTT AATAGGACTC
ACACTAGATT TTTCTAAATT GGATGCTATA GTAATTAGTA TGTTCTTTGT GTATTTTGTT
ACGGGTTACT TACTATACGC AATGCTATAT GCAGCGGTTG GTGCTGTTGT GTCAAAAATT
GAGGATGTAC AATCTCTTTC GTTCCCAATT ACAATGTTAG GTATGGCAGC ATTTTTTATT
AGTCTTAAAT CATTATTTGA TCCAAATAGT ACATTAGCTA TAGTTAGTTC ATATATTCCA
TTCTTTACAC CTATGGTTAC TTTTTCAAGG CTTGTATCTG GTGAAGCGGG CACTGTAGAA
ATTATTGTGA CGTTAGTCAT TTTATTGGTG ACTATTGTTA TCGTCAATAT GCTTACAAGC
CGCATTTACG TAAACGGTGT AATGAATTAC TCAGACAAAG TGAAATTTAA AGATTTAGCA
AAATTCATAA AGCGTCAATA A
 
Protein sequence
MRKFSHVFSF YFREAFLSKK SLITSAILFL IVFGLFAFNH FTADNDKGKD KFAVVTESDT 
YKVQESDVTK LLPSAKVTVE NKDKFNELRK QVEDGDLDGL FRITEKNGIP EVTYMYSGFP
SQSTATLMAS YLKEQYTAVM IEKNNVSAEV AKQLQMEIPL KQEAVKDHAS SFGIGYVFSF
ALYMFIVIFG AAIGTTVASE KSSRVMELML PKVKPLTMLH AKILAIVSSA LLLLVIASFG
FVVPNLLGWV DLENASLIGL TLDFSKLDAI VISMFFVYFV TGYLLYAMLY AAVGAVVSKI
EDVQSLSFPI TMLGMAAFFI SLKSLFDPNS TLAIVSSYIP FFTPMVTFSR LVSGEAGTVE
IIVTLVILLV TIVIVNMLTS RIYVNGVMNY SDKVKFKDLA KFIKRQ