Gene BAS3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3958 
Symbol 
ID2850347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3902303 
End bp3904015 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content38% 
IMG OID637507195 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_030208 
Protein GI49186956 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTTA ACATTCAAGG GATCGCTGCA TCAAGTGGGA TTGCTATTGC AAAGGCTTTC 
AGACTTGAAA ATCCTGAATT TAACATCGAA CAGAAATCAA TTACAAACGA AGCTGCAGAA
ATTGCACGCT TAGAAGCTGC GCTTGAGAAA GCAAAAACTG AATTAGAAGC TATTAAGGAC
CACGCTTTTG CTGAGCTAGG TGCTGACAAA GCTGCGATCT TTGAAGCACA TTTATTAGTG
TTAAATGATC CAGAACTAGT AAACCCAGTA AAAGATAAAG TAAATAGCGA AAAAGTAAAT
GCTGAATTTG CAATGGATGA AGTTGCATCA ATGTTTATCT CTATGTTTGA AAACATGGAT
AACGAATATA TGAAAGAACG TGCTGCGGAC ATTCGTGACG TAACAAAACG TGTTCTTGCG
CATTTACTAG GCATTAACTT CTCAAATCCT GGTACAAATT CTGAAGAAGT AATCATTATT
GCTGAAGATT TAACACCATC TGATACAGCT CAGTTAAACC GTAAGTATGC AAAAGGTTTT
ACTACCGATA TCGGCGGACG TACATCTCAC TCTGCAATTA TGGCTCGCTC TATGGAAATT
CCAGCTGTTG TTGGTACAAA AGTTGTTATG GAGAAAATCC AAAACGGCGA TATCGTAATC
ATCGATGGTT TAGATGGGGA AGTAATTGTT AATCCATCAG AAGAAACTCT TCGCTCGTTT
GAAGAAAAGA AAGCGAAATT TGAAGAGCAA AAAGCTGAAT GGGCAAAATT AAAAGACCAA
GCTACTGTAA CAAGTGATGG ACATCACGTT GAGCTTGTTG CAAATATCGG AACACTAAAT
GATGTACAAG GTATTATCGA TAATGGCGGA GAAGGCGTTG GTTTATACCG TACAGAATTC
TTATACATGG GCCGTGACAA TCTTCCAACA GAAGAAGAGC AGTTCGAAGC GTATAAAGCA
GTTCTTGAAG GTGTAAAAGA AGGTCAACCT GTTGTTGTTC GTACACTTGA CATCGGTGGA
GATAAAGAGC TTCCATACTT ACATTTACCA AAAGAAATGA ACCCATTCTT AGGCTACCGT
GCAATTCGCT TATGTCTTGA TGAGCAAGAT GTGTTCCGTA CACAACTTCG TGCATTACTT
CGTGCTAGCG TATACGGTAA CTTAAAAATT ATGTTCCCAA TGATTGCAAC TCTTGATGAG
TTCCGTCAAG CAAAAGCGAT CTTATTAGAA GAAAAAGCGA AACTTGTAGA AGTGGGTACA
ACTGTTTCTG ATTCTATTGA AGTTGGTATG ATGGTTGAAA TCCCAGCTTC AGCAGTATTA
GCAGATCAAT TCGCAAAAGA AGTTGATTTC TTCTCTATCG GAACAAATGA CTTAATTCAA
TACACAATGG CTGCAGACCG TATGAACGAA CAAGTATCTT ACTTATACCA ACCATATAAC
CTATCTATTT TACGTCTTGT AAAAATGGTT ATCGATGCTG CTCATAAAGA AGGCAAATGG
GCTGGTATGT GTGGTGAGAT GGCGGGCGAT TCACTTGCTA TCCCATTATT ATTAGGATTA
GGTTTAGATG AGTTCAGTAT GAGTGCAACA TCTATTCTTC CTGCAAGAAC ACAACTAAGC
AAGTTGTCAA AAGCAGAAAT GGAAACATTA GCAGAAAAAG CATTAATGAT GTCAACTGCT
GAAGAAGTTG TTGAACTAGT TAAAAGCATA TAA
 
Protein sequence
MTLNIQGIAA SSGIAIAKAF RLENPEFNIE QKSITNEAAE IARLEAALEK AKTELEAIKD 
HAFAELGADK AAIFEAHLLV LNDPELVNPV KDKVNSEKVN AEFAMDEVAS MFISMFENMD
NEYMKERAAD IRDVTKRVLA HLLGINFSNP GTNSEEVIII AEDLTPSDTA QLNRKYAKGF
TTDIGGRTSH SAIMARSMEI PAVVGTKVVM EKIQNGDIVI IDGLDGEVIV NPSEETLRSF
EEKKAKFEEQ KAEWAKLKDQ ATVTSDGHHV ELVANIGTLN DVQGIIDNGG EGVGLYRTEF
LYMGRDNLPT EEEQFEAYKA VLEGVKEGQP VVVRTLDIGG DKELPYLHLP KEMNPFLGYR
AIRLCLDEQD VFRTQLRALL RASVYGNLKI MFPMIATLDE FRQAKAILLE EKAKLVEVGT
TVSDSIEVGM MVEIPASAVL ADQFAKEVDF FSIGTNDLIQ YTMAADRMNE QVSYLYQPYN
LSILRLVKMV IDAAHKEGKW AGMCGEMAGD SLAIPLLLGL GLDEFSMSAT SILPARTQLS
KLSKAEMETL AEKALMMSTA EEVVELVKSI