Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_0468 |
Symbol | |
ID | 2817554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 467993 |
End bp | 469183 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637787438 |
Product | prophage LambdaBa04, major capsid protein |
Protein accession | YP_017089 |
Protein GI | 47525740 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAG AGCAATTATT AAAACGTAAA TCTGAAATCG GTGAATTATT AAGTGATGAA ACGCGTTCTA TTGATAATCT TGATACGATT GAAACGGAGT TACGAGACAT TAATGAACAG TTGGCAGCAA TTGAAAAACG TGAACAACTT TTAAATGAAG CACGTTCAAT TAATGAAGGG AACGCAGCTG GAGCAAATAA AATCGAAACA TTTAATGCTA ATCTTAATCT ATCAAATGAG AAACGTGAAA TTGGTACGAA TACAGTTGAA TACCGTAATG CTTTTATGAA TTACGTATTA CGTGGTGAGG CAATTCCAGC TGAATTACGT GCAAATGCTG TGACTAAAAC AAGTGACATC GGTTCTGTTA TCCCGCAAAC AGTACTAGAT AAGATTATTG AAAAGATTGA AGCGGTAGGG ATGATTCTAC CTTTAATTAC TCGTACAGCT ATTAAAGGAG GGGTAACAGT ACCAACCTCA GCAGTTAAAC CAGTTGCAAC ATGGGTTGCT GAAAGTTCTG GAAGTGATAA ACAAAAGAAA ACTACAGGAA GCATTACTTT CAACTATCAT AAATTACGTT GTGCTGTAGC TGTTTCTCTT GAAGTAGAAA CAATGTCTTT AGCAGTATTT GAAACAACAT TAATCAATAA TATTGTGGAA GCTATGACAA AAGCGATTGA ACAAGCTATT GTTAGTGGTG ATGGGTCTGG TAAACCAAAG GGAATCCTAG CGGAAACACC GTTTGAAGGA CAAGCATTAG ATGTTGCGAA AATCAATTAT AAAACTTTAA CAGATGCAGA AGCAGCTTTA CCACTTGAGT ATGAAGCAAG TGCAATTTGG ACGATGACGA AAAAGACATT TATGGAGTTT TCGGCAATGA CAGATGCAGA CGGACAGCCA ATTGCGCGTA CAAATTACGG AATTTCTGGT AAACCAGAAC GTATTTTATT AGGTCGTCCA GTTGTTTTAT GTAATTATGT TGATAGTTTC GCAACGGCTA CTGAAGGAAC AGCGTTCGCA TTCTTATTTA ATTACAAAGA TTATATTCTT AATACAAACT ACCAAATGGG TGTTAAGAAA TATGAAGACA ATGAAACTGA CGATCAAGTT ACAAAAGCAA TTATGATTGT GGATGGTAAA GTAGTAGACA AAAATTCTTT AGTTGTTTTA AAAAAAGCTC CATCAGCTTA A
|
Protein sequence | MNKEQLLKRK SEIGELLSDE TRSIDNLDTI ETELRDINEQ LAAIEKREQL LNEARSINEG NAAGANKIET FNANLNLSNE KREIGTNTVE YRNAFMNYVL RGEAIPAELR ANAVTKTSDI GSVIPQTVLD KIIEKIEAVG MILPLITRTA IKGGVTVPTS AVKPVATWVA ESSGSDKQKK TTGSITFNYH KLRCAVAVSL EVETMSLAVF ETTLINNIVE AMTKAIEQAI VSGDGSGKPK GILAETPFEG QALDVAKINY KTLTDAEAAL PLEYEASAIW TMTKKTFMEF SAMTDADGQP IARTNYGISG KPERILLGRP VVLCNYVDSF ATATEGTAFA FLFNYKDYIL NTNYQMGVKK YEDNETDDQV TKAIMIVDGK VVDKNSLVVL KKAPSA
|
| |