Gene GBAA_5360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_5360 
Symbol 
ID2819353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4854297 
End bp4855337 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content39% 
IMG OID637792033 
ProductHK97 family phage major capsid protein 
Protein accessionYP_022018 
Protein GI47530669 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAG AAAAAGAATT AAAAGAATTA CGTGCCAAAA TGGAAGCAAT GGAAGCGGAA 
GTTCGTGCAG AGCAAGAAAC AGCTCAAGAA GTAGAAGTTC GTGATGTAGA AGTTGACCAA
ACAGAAGTTG AGCTACGTGG AGTAGAACAA TTCTTAAAAG GCGACATTCA CGGTGCTGAA
GTTCGTACAA TGACAACTGG TACTGGTGCT ATCACAGTTC CAACATCTTT ATCAAACGTT
ATCGTAGAAA AACTTGTTGA AGAAGCAGCT CTATTTGGTC GTGCTAAATC TTTCACGCCA
GTATCTGGTA CTTTAGAAGT ATTACGTGAG AAAAATATCG GAGACGCTAC ATTCATCGGT
GAAATGGAAG ATGCTTCAAT GTCTGATTTC ACATTCGACA AAGTAACTCT TGAACAACGT
CGTGCTGCTA CAGCTATCGA ATTATCTCAA CAACTTGTAA ACGACTCAGG AATTGACGTT
GTTAACTATG CTGTTGGTGT AATGACTCGT CGTCTTGCTC GTAAGCTTGA TGAAACAGTT
CTTAACGGTG ACAAAACTAA AAAGCAATTC GAAGGTATCT TAACTTCAAC TGTTGCTGAA
GTAGTTGGCA CTCATGAAGC TGGTAAGATT TCTCTTGACA ATCTTTTAGA TATGACTCTT
GCTGTTCACC CAGACCACTT AGCTGGTTCA GTGTTCGTAA TGGGACGTCC TGCTTTCAAC
CAAGTTGCTA AGCTTAAAGA TGCTCAAGGA AACTACCATG TAGTTAAAGA CGTTGTAAAT
GGCAAACCAG TTTACAAAAT CTTCGGACAT GAAATCTTAA TCCAAGACAA AATGCCGTCA
GCTGCTGCTG GTTCAATCAC TGTAGTATTC ATCAACTTCG CTGAAGCTTA CGCTACTATG
ATTAAAAAAG GCGCTCAAAT GAAACGTATC TCTGACGATA CTAAACAAGC TTTACGTGGC
TCTCATATGT TAATGCTTGA TATGTATTGT GACGGAAAAA TTCTTAATGA AGATGCAATT
AAGTTCTTAA AACAAGCTTA A
 
Protein sequence
MSKEKELKEL RAKMEAMEAE VRAEQETAQE VEVRDVEVDQ TEVELRGVEQ FLKGDIHGAE 
VRTMTTGTGA ITVPTSLSNV IVEKLVEEAA LFGRAKSFTP VSGTLEVLRE KNIGDATFIG
EMEDASMSDF TFDKVTLEQR RAATAIELSQ QLVNDSGIDV VNYAVGVMTR RLARKLDETV
LNGDKTKKQF EGILTSTVAE VVGTHEAGKI SLDNLLDMTL AVHPDHLAGS VFVMGRPAFN
QVAKLKDAQG NYHVVKDVVN GKPVYKIFGH EILIQDKMPS AAAGSITVVF INFAEAYATM
IKKGAQMKRI SDDTKQALRG SHMLMLDMYC DGKILNEDAI KFLKQA