Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_5360 |
Symbol | |
ID | 2819353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 4854297 |
End bp | 4855337 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637792033 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_022018 |
Protein GI | 47530669 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAAG AAAAAGAATT AAAAGAATTA CGTGCCAAAA TGGAAGCAAT GGAAGCGGAA GTTCGTGCAG AGCAAGAAAC AGCTCAAGAA GTAGAAGTTC GTGATGTAGA AGTTGACCAA ACAGAAGTTG AGCTACGTGG AGTAGAACAA TTCTTAAAAG GCGACATTCA CGGTGCTGAA GTTCGTACAA TGACAACTGG TACTGGTGCT ATCACAGTTC CAACATCTTT ATCAAACGTT ATCGTAGAAA AACTTGTTGA AGAAGCAGCT CTATTTGGTC GTGCTAAATC TTTCACGCCA GTATCTGGTA CTTTAGAAGT ATTACGTGAG AAAAATATCG GAGACGCTAC ATTCATCGGT GAAATGGAAG ATGCTTCAAT GTCTGATTTC ACATTCGACA AAGTAACTCT TGAACAACGT CGTGCTGCTA CAGCTATCGA ATTATCTCAA CAACTTGTAA ACGACTCAGG AATTGACGTT GTTAACTATG CTGTTGGTGT AATGACTCGT CGTCTTGCTC GTAAGCTTGA TGAAACAGTT CTTAACGGTG ACAAAACTAA AAAGCAATTC GAAGGTATCT TAACTTCAAC TGTTGCTGAA GTAGTTGGCA CTCATGAAGC TGGTAAGATT TCTCTTGACA ATCTTTTAGA TATGACTCTT GCTGTTCACC CAGACCACTT AGCTGGTTCA GTGTTCGTAA TGGGACGTCC TGCTTTCAAC CAAGTTGCTA AGCTTAAAGA TGCTCAAGGA AACTACCATG TAGTTAAAGA CGTTGTAAAT GGCAAACCAG TTTACAAAAT CTTCGGACAT GAAATCTTAA TCCAAGACAA AATGCCGTCA GCTGCTGCTG GTTCAATCAC TGTAGTATTC ATCAACTTCG CTGAAGCTTA CGCTACTATG ATTAAAAAAG GCGCTCAAAT GAAACGTATC TCTGACGATA CTAAACAAGC TTTACGTGGC TCTCATATGT TAATGCTTGA TATGTATTGT GACGGAAAAA TTCTTAATGA AGATGCAATT AAGTTCTTAA AACAAGCTTA A
|
Protein sequence | MSKEKELKEL RAKMEAMEAE VRAEQETAQE VEVRDVEVDQ TEVELRGVEQ FLKGDIHGAE VRTMTTGTGA ITVPTSLSNV IVEKLVEEAA LFGRAKSFTP VSGTLEVLRE KNIGDATFIG EMEDASMSDF TFDKVTLEQR RAATAIELSQ QLVNDSGIDV VNYAVGVMTR RLARKLDETV LNGDKTKKQF EGILTSTVAE VVGTHEAGKI SLDNLLDMTL AVHPDHLAGS VFVMGRPAFN QVAKLKDAQG NYHVVKDVVN GKPVYKIFGH EILIQDKMPS AAAGSITVVF INFAEAYATM IKKGAQMKRI SDDTKQALRG SHMLMLDMYC DGKILNEDAI KFLKQA
|
| |