Gene GBAA_5358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_5358 
Symbol 
ID2815748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4852458 
End bp4853768 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID637792031 
Producthypothetical protein 
Protein accessionYP_022016 
Protein GI47530667 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTTT TTAGTGGCTT ATTTTCAAAA AAACCTGTAC TGGAAGAGCG TTCAACATAT 
GACTCAATGG AAGTTGACGG AGCTTTTTCT CTTGATAGTT TACTAGTTAC AGATGCTGTA
ACTGAAGAAA AAGTATTGAA AATTCCTACA GCTCGTTCGT GTGTTGAATT AATTACAAGC
TCAATCGCAC AGATGCCTGT TTATTTATAC AAGGAAAATG CTGATGGCTC AGTCGAGCGA
ATTTTAGACG ACAATCGTGT TCATTTGCTA AATCATGAAG CCAATGACTT TTTAAACGGT
TATTCCTTAA AGAAACACAT GGTCAAGGAT TACTTATTAC ATGGTTCCTC TTATGTATCA
ATCATTGAGG CAGGTAACAC AATTTTAGAG CTTCACCCAT TGCTTTCAAA AGCTATTGTG
GTTAACAAAC GAATTAAGCA CGGTTATCGC ACAGTTGGTG CTGATATTTT CTTATCAAAC
AGTGAAAATG GCGCTGTTAA CGAGCTTAAT CGCCAACAAA CTAAGTTTAA ACCACATGAG
CTTATGATTA CATTGCAAGA TACAAACGAT GGTTTAACAA GTCATGGTGT TATTAAACAT
GGTCAAGACA TTTTTAAACA AGCACTTTCT GAATCAGTTT ACACTCATAA CTTGTATGAA
AATGGAGCGC TTCCTTTGGG TCTTCTTAAG ACAGACGCTC GACTAAATAA AAAGCAAGCT
TCTAGTTTAC GAGAAGCTTG GCAAAAACTT TACGGTGGAG TTAAGAATGC AGCTAAAACT
GTTGTACTTC AAGAAGGTAT GAAATACGAA GCTCTTTCAA TGAATCCTTC AGAAATTCAA
ATGTCTGAAA CTCGTAAAGC TACAAACTCT GAAATTTGTA AATTGTTTGG TGTGCCTGAA
AGCATGGTTA ACGCAACAAT TGGCAAACAG TACGTTTCAC TGGAGCAAAA TCAATTATAT
CTCTTAAAGA ATACTCTATC TCCAATCATC GTTGCTATGG AAAGCTCGAT GGACAAAGCA
CTTTTGCTTG AGTCTGAAAA AGACAAAGGT TATTTCTTCA GATTTGATAC TTCAGAGCTT
ATCCGCTCGA CTGAAAAAGA GCTAGTTGAT ACTGTGGTAA CTGCTGTTCA AGGCGGTATT
TTCACAATTA ACGAAGGACG AGCTAAGTTT AACTTGCCAT CGATTGATGA AGGCGACAAT
GTACTCGTAA CGCCTGGTGC TAGTCAAATG GGCGATAAAA ATACAAAAGA AACAACCGAT
CCACACGAGG AGGAACAATT AAATGACAAA ACTGGAACTC AGACAGATTG A
 
Protein sequence
MGLFSGLFSK KPVLEERSTY DSMEVDGAFS LDSLLVTDAV TEEKVLKIPT ARSCVELITS 
SIAQMPVYLY KENADGSVER ILDDNRVHLL NHEANDFLNG YSLKKHMVKD YLLHGSSYVS
IIEAGNTILE LHPLLSKAIV VNKRIKHGYR TVGADIFLSN SENGAVNELN RQQTKFKPHE
LMITLQDTND GLTSHGVIKH GQDIFKQALS ESVYTHNLYE NGALPLGLLK TDARLNKKQA
SSLREAWQKL YGGVKNAAKT VVLQEGMKYE ALSMNPSEIQ MSETRKATNS EICKLFGVPE
SMVNATIGKQ YVSLEQNQLY LLKNTLSPII VAMESSMDKA LLLESEKDKG YFFRFDTSEL
IRSTEKELVD TVVTAVQGGI FTINEGRAKF NLPSIDEGDN VLVTPGASQM GDKNTKETTD
PHEEEQLNDK TGTQTD