Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS4981 |
Symbol | |
ID | 2852344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | + |
Start bp | 4855469 |
End bp | 4856548 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637508235 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_031220 |
Protein GI | 49187967 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAA ACGATAAAAA CTCAACGGAG GAACCTATTA TGTCTAAAGA AAAAGAATTA AAAGAATTAC GTGCCAAAAT GGAAGCAATG GAAGCGGAAG TTCGTGCAGA GCAAGAAACA GCTCAAGAAG TAGAAGTTCG TGATGTAGAA GTTGACCAAA CAGAAGTTGA GCTACGTGGA GTAGAACAAT TCTTAAAAGG CGACATTCAC GGTGCTGAAG TTCGTACAAT GACAACTGGT ACTGGTGCTA TCACAGTTCC AACATCTTTA TCAAACGTTA TCGTAGAAAA ACTTGTTGAA GAAGCAGCTC TATTTGGTCG TGCTAAATCT TTCACGCCAG TATCTGGTAC TTTAGAAGTA TTACGTGAGA AAAATATCGG AGACGCTACA TTCATCGGTG AAATGGAAGA TGCTTCAATG TCTGATTTCA CATTCGACAA AGTAACTCTT GAACAACGTC GTGCTGCTAC AGCTATCGAA TTATCTCAAC AACTTGTAAA CGACTCAGGA ATTGACGTTG TTAACTATGC TGTTGGTGTA ATGACTCGTC GTCTTGCTCG TAAGCTTGAT GAAACAGTTC TTAACGGTGA CAAAACTAAA AAGCAATTCG AAGGTATCTT AACTTCAACT GTTGCTGAAG TAGTTGGCAC TCATGAAGCT GGTAAGATTT CTCTTGACAA TCTTTTAGAT ATGACTCTTG CTGTTCACCC AGACCACTTA GCTGGTTCAG TGTTCGTAAT GGGACGTCCT GCTTTCAACC AAGTTGCTAA GCTTAAAGAT GCTCAAGGAA ACTACCATGT AGTTAAAGAC GTTGTAAATG GCAAACCAGT TTACAAAATC TTCGGACATG AAATCTTAAT CCAAGACAAA ATGCCGTCAG CTGCTGCTGG TTCAATCACT GTAGTATTCA TCAACTTCGC TGAAGCTTAC GCTACTATGA TTAAAAAAGG CGCTCAAATG AAACGTATCT CTGACGATAC TAAACAAGCT TTACGTGGCT CTCATATGTT AATGCTTGAT ATGTATTGTG ACGGAAAAAT TCTTAATGAA GATGCAATTA AGTTCTTAAA ACAAGCTTAA
|
Protein sequence | MEKNDKNSTE EPIMSKEKEL KELRAKMEAM EAEVRAEQET AQEVEVRDVE VDQTEVELRG VEQFLKGDIH GAEVRTMTTG TGAITVPTSL SNVIVEKLVE EAALFGRAKS FTPVSGTLEV LREKNIGDAT FIGEMEDASM SDFTFDKVTL EQRRAATAIE LSQQLVNDSG IDVVNYAVGV MTRRLARKLD ETVLNGDKTK KQFEGILTST VAEVVGTHEA GKISLDNLLD MTLAVHPDHL AGSVFVMGRP AFNQVAKLKD AQGNYHVVKD VVNGKPVYKI FGHEILIQDK MPSAAAGSIT VVFINFAEAY ATMIKKGAQM KRISDDTKQA LRGSHMLMLD MYCDGKILNE DAIKFLKQA
|
| |