Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_0481 |
Symbol | |
ID | 2817945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 480505 |
End bp | 482139 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637787451 |
Product | phage minor structural protein |
Protein accession | YP_017100 |
Protein GI | 47525751 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAACACA TAAAACTTTA CGATAAACAA TTACAGTTAA AAGCATATCT CGAAAACGCA TTTAAAATAA AGTACAGCCC GCCACTCAAT GAACTTTGGA CGGCGGGTTT TTCATTACCT TTTACCGATC CTAAGCGAGA GGAAATTGAA ACGTTTGATT ATGTGGAAAT ATTTGATAAC GGTAAACGTA TCGGATTGTT CCGTATTATG GATAGCGAGG AAGAAAGGGA AGTAAGTCAA AAAATAATAA CTTATGACTG TGAGCATGTT TTATCTACTT TAATGGATAG CGTGCTTTTT GGTTATCACG AAAGAATTAA TTTAACGACA AGGGAGAATA TTGAGTATCT CTTAAGCAAA CAAAGAATAA AGCATTGGAA ACTTGGTCAA TGTGATTTTG TAAAATATTT TCAATATAGT TGGGAAAATG AAGATACGAT ATTAGGACCG ATATATAGTA TTGCAAAACC GTTTGATGAG AAATTCCAAT GGACATGGGA CGATACTTCC TATCCTTGGA CTTTAAATCT CGTGAAATAT TCCGATGAAA TTACAGGTGA GCTTCGATAT AGAAAGAACA TGAAAGGCAT TAAGCGAAAG GTAGAAGCTA AGGATGTCAT GACTAGAATT TATCCACTAG GTTATGGTGA AGGTGTTAAT CAACTTAATA TCAAAAGTGT GAACAATGGT CTTCCTTATA TAGATGCTCC TGATTTTGTC AGAGAGTTAC AGGATGGGTT TGATTATATT TGGGTGGATA GACGATTTGA GGACCCAAAA ACGCTTTATG CTTCAGCAAA AGCGATGTTA TTAAAAGCGT GTATGCCAAA AGTTACATAT GAAATTGATG CAATTGATTA TGAATTGATT GACCCATACA AAATAGAAAA ATATGAAACA GGCAAGTTAG TACGTCTATA TGATGAGGAT TTTAATATAT CTGTTGATTT ACGAGTAATG GACCGTTCAA AAGATGATGT TACTGGTAAT CCACTTGATG TAAAGCTTGT ATTAGAAAAT AAAGTCACTG ATATTGGGAC AATACAAGCA GATATTGAAA AGCGACAGAA AGTAAACGAA GTGTATTCTC AAGGAACAAC TAATATTGAT AGTCAACCTT TCCAAGATAA TTGCGACCCA GATCATCCAG CCATCATAAG GTTTCAAATA CCTAACGATG TTAAGAATGT GAATCAACTG ATACTGACAT TTGAGACATT ACGGTTTAGA GCATATGAGC GGGCTATTAA AGGCGGCGGA GCTGTCGTTG CTTCAACTTC TGCCGGTGGT GCGAATGTAT CTTCAACAAG TTCCGGTGGT GGTACGGTTG GTTCGACGTC ATCAGGCGGA AGTTCTGTGC AAGCAAGTAG TTCCGGTGGT GGTACGGTTA AAGCTTCAAG TAGTGGAGGA GATCATGTTC ATAAAATGTT TCATGGTGGC GGAATTGTTC CTGCTGAACC ATCAACAATA GGATTATATA CAGCTTTTTC TGATCCAGGA AGAAATACAT CAGCTTCATT TTACGCAAAA GGAACGGGAT CTAGTTTCTA CACATATGGT TCTAGTGGGA ATCACACTCA TGATATATCA ATACCTAACC ATACTCACAG TATTAATATA AATCATAAAA TATAG
|
Protein sequence | MKHIKLYDKQ LQLKAYLENA FKIKYSPPLN ELWTAGFSLP FTDPKREEIE TFDYVEIFDN GKRIGLFRIM DSEEEREVSQ KIITYDCEHV LSTLMDSVLF GYHERINLTT RENIEYLLSK QRIKHWKLGQ CDFVKYFQYS WENEDTILGP IYSIAKPFDE KFQWTWDDTS YPWTLNLVKY SDEITGELRY RKNMKGIKRK VEAKDVMTRI YPLGYGEGVN QLNIKSVNNG LPYIDAPDFV RELQDGFDYI WVDRRFEDPK TLYASAKAML LKACMPKVTY EIDAIDYELI DPYKIEKYET GKLVRLYDED FNISVDLRVM DRSKDDVTGN PLDVKLVLEN KVTDIGTIQA DIEKRQKVNE VYSQGTTNID SQPFQDNCDP DHPAIIRFQI PNDVKNVNQL ILTFETLRFR AYERAIKGGG AVVASTSAGG ANVSSTSSGG GTVGSTSSGG SSVQASSSGG GTVKASSSGG DHVHKMFHGG GIVPAEPSTI GLYTAFSDPG RNTSASFYAK GTGSSFYTYG SSGNHTHDIS IPNHTHSINI NHKI
|
| |