Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS4979 |
Symbol | |
ID | 2852496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | + |
Start bp | 4853669 |
End bp | 4854979 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637508233 |
Product | hypothetical protein |
Protein accession | YP_031218 |
Protein GI | 49187965 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTTT TTAGTGGCTT ATTTTCAAAA AAACCTGTAC TGGAAGAGCG TTCAACATAT GACTCAATGG AAGTTGACGG AGCTTTTTCT CTTGATAGTT TACTAGTTAC AGATGCTGTA ACTGAAGAAA AAGTATTGAA AATTCCTACA GCTCGTTCGT GTGTTGAATT AATTACAAGC TCAATCGCAC AGATGCCTGT TTATTTATAC AAGGAAAATG CTGATGGCTC AGTCGAGCGA ATTTTAGACG ACAATCGTGT TCATTTGCTA AATCATGAAG CCAATGACTT TTTAAACGGT TATTCCTTAA AGAAACACAT GGTCAAGGAT TACTTATTAC ATGGTTCCTC TTATGTATCA ATCATTGAGG CAGGTAACAC AATTTTAGAG CTTCACCCAT TGCTTTCAAA AGCTATTGTG GTTAACAAAC GAATTAAGCA CGGTTATCGC ACAGTTGGTG CTGATATTTT CTTATCAAAC AGTGAAAATG GCGCTGTTAA CGAGCTTAAT CGCCAACAAA CTAAGTTTAA ACCACATGAG CTTATGATTA CATTGCAAGA TACAAACGAT GGTTTAACAA GTCATGGTGT TATTAAACAT GGTCAAGACA TTTTTAAACA AGCACTTTCT GAATCAGTTT ACACTCATAA CTTGTATGAA AATGGAGCGC TTCCTTTGGG TCTTCTTAAG ACAGACGCTC GACTAAATAA AAAGCAAGCT TCTAGTTTAC GAGAAGCTTG GCAAAAACTT TACGGTGGAG TTAAGAATGC AGCTAAAACT GTTGTACTTC AAGAAGGTAT GAAATACGAA GCTCTTTCAA TGAATCCTTC AGAAATTCAA ATGTCTGAAA CTCGTAAAGC TACAAACTCT GAAATTTGTA AATTGTTTGG TGTGCCTGAA AGCATGGTTA ACGCAACAAT TGGCAAACAG TACGTTTCAC TGGAGCAAAA TCAATTATAT CTCTTAAAGA ATACTCTATC TCCAATCATC GTTGCTATGG AAAGCTCGAT GGACAAAGCA CTTTTGCTTG AGTCTGAAAA AGACAAAGGT TATTTCTTCA GATTTGATAC TTCAGAGCTT ATCCGCTCGA CTGAAAAAGA GCTAGTTGAT ACTGTGGTAA CTGCTGTTCA AGGCGGTATT TTCACAATTA ACGAAGGACG AGCTAAGTTT AACTTGCCAT CGATTGATGA AGGCGACAAT GTACTCGTAA CGCCTGGTGC TAGTCAAATG GGCGATAAAA ATACAAAAGA AACAACCGAT CCACACGAGG AGGAACAATT AAATGACAAA ACTGGAACTC AGACAGATTG A
|
Protein sequence | MGLFSGLFSK KPVLEERSTY DSMEVDGAFS LDSLLVTDAV TEEKVLKIPT ARSCVELITS SIAQMPVYLY KENADGSVER ILDDNRVHLL NHEANDFLNG YSLKKHMVKD YLLHGSSYVS IIEAGNTILE LHPLLSKAIV VNKRIKHGYR TVGADIFLSN SENGAVNELN RQQTKFKPHE LMITLQDTND GLTSHGVIKH GQDIFKQALS ESVYTHNLYE NGALPLGLLK TDARLNKKQA SSLREAWQKL YGGVKNAAKT VVLQEGMKYE ALSMNPSEIQ MSETRKATNS EICKLFGVPE SMVNATIGKQ YVSLEQNQLY LLKNTLSPII VAMESSMDKA LLLESEKDKG YFFRFDTSEL IRSTEKELVD TVVTAVQGGI FTINEGRAKF NLPSIDEGDN VLVTPGASQM GDKNTKETTD PHEEEQLNDK TGTQTD
|
| |