Gene BAS4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4979 
Symbol 
ID2852496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4853669 
End bp4854979 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID637508233 
Producthypothetical protein 
Protein accessionYP_031218 
Protein GI49187965 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTTT TTAGTGGCTT ATTTTCAAAA AAACCTGTAC TGGAAGAGCG TTCAACATAT 
GACTCAATGG AAGTTGACGG AGCTTTTTCT CTTGATAGTT TACTAGTTAC AGATGCTGTA
ACTGAAGAAA AAGTATTGAA AATTCCTACA GCTCGTTCGT GTGTTGAATT AATTACAAGC
TCAATCGCAC AGATGCCTGT TTATTTATAC AAGGAAAATG CTGATGGCTC AGTCGAGCGA
ATTTTAGACG ACAATCGTGT TCATTTGCTA AATCATGAAG CCAATGACTT TTTAAACGGT
TATTCCTTAA AGAAACACAT GGTCAAGGAT TACTTATTAC ATGGTTCCTC TTATGTATCA
ATCATTGAGG CAGGTAACAC AATTTTAGAG CTTCACCCAT TGCTTTCAAA AGCTATTGTG
GTTAACAAAC GAATTAAGCA CGGTTATCGC ACAGTTGGTG CTGATATTTT CTTATCAAAC
AGTGAAAATG GCGCTGTTAA CGAGCTTAAT CGCCAACAAA CTAAGTTTAA ACCACATGAG
CTTATGATTA CATTGCAAGA TACAAACGAT GGTTTAACAA GTCATGGTGT TATTAAACAT
GGTCAAGACA TTTTTAAACA AGCACTTTCT GAATCAGTTT ACACTCATAA CTTGTATGAA
AATGGAGCGC TTCCTTTGGG TCTTCTTAAG ACAGACGCTC GACTAAATAA AAAGCAAGCT
TCTAGTTTAC GAGAAGCTTG GCAAAAACTT TACGGTGGAG TTAAGAATGC AGCTAAAACT
GTTGTACTTC AAGAAGGTAT GAAATACGAA GCTCTTTCAA TGAATCCTTC AGAAATTCAA
ATGTCTGAAA CTCGTAAAGC TACAAACTCT GAAATTTGTA AATTGTTTGG TGTGCCTGAA
AGCATGGTTA ACGCAACAAT TGGCAAACAG TACGTTTCAC TGGAGCAAAA TCAATTATAT
CTCTTAAAGA ATACTCTATC TCCAATCATC GTTGCTATGG AAAGCTCGAT GGACAAAGCA
CTTTTGCTTG AGTCTGAAAA AGACAAAGGT TATTTCTTCA GATTTGATAC TTCAGAGCTT
ATCCGCTCGA CTGAAAAAGA GCTAGTTGAT ACTGTGGTAA CTGCTGTTCA AGGCGGTATT
TTCACAATTA ACGAAGGACG AGCTAAGTTT AACTTGCCAT CGATTGATGA AGGCGACAAT
GTACTCGTAA CGCCTGGTGC TAGTCAAATG GGCGATAAAA ATACAAAAGA AACAACCGAT
CCACACGAGG AGGAACAATT AAATGACAAA ACTGGAACTC AGACAGATTG A
 
Protein sequence
MGLFSGLFSK KPVLEERSTY DSMEVDGAFS LDSLLVTDAV TEEKVLKIPT ARSCVELITS 
SIAQMPVYLY KENADGSVER ILDDNRVHLL NHEANDFLNG YSLKKHMVKD YLLHGSSYVS
IIEAGNTILE LHPLLSKAIV VNKRIKHGYR TVGADIFLSN SENGAVNELN RQQTKFKPHE
LMITLQDTND GLTSHGVIKH GQDIFKQALS ESVYTHNLYE NGALPLGLLK TDARLNKKQA
SSLREAWQKL YGGVKNAAKT VVLQEGMKYE ALSMNPSEIQ MSETRKATNS EICKLFGVPE
SMVNATIGKQ YVSLEQNQLY LLKNTLSPII VAMESSMDKA LLLESEKDKG YFFRFDTSEL
IRSTEKELVD TVVTAVQGGI FTINEGRAKF NLPSIDEGDN VLVTPGASQM GDKNTKETTD
PHEEEQLNDK TGTQTD