Gene GBAA_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_0468 
Symbol 
ID2817554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp467993 
End bp469183 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content35% 
IMG OID637787438 
Productprophage LambdaBa04, major capsid protein 
Protein accessionYP_017089 
Protein GI47525740 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAG AGCAATTATT AAAACGTAAA TCTGAAATCG GTGAATTATT AAGTGATGAA 
ACGCGTTCTA TTGATAATCT TGATACGATT GAAACGGAGT TACGAGACAT TAATGAACAG
TTGGCAGCAA TTGAAAAACG TGAACAACTT TTAAATGAAG CACGTTCAAT TAATGAAGGG
AACGCAGCTG GAGCAAATAA AATCGAAACA TTTAATGCTA ATCTTAATCT ATCAAATGAG
AAACGTGAAA TTGGTACGAA TACAGTTGAA TACCGTAATG CTTTTATGAA TTACGTATTA
CGTGGTGAGG CAATTCCAGC TGAATTACGT GCAAATGCTG TGACTAAAAC AAGTGACATC
GGTTCTGTTA TCCCGCAAAC AGTACTAGAT AAGATTATTG AAAAGATTGA AGCGGTAGGG
ATGATTCTAC CTTTAATTAC TCGTACAGCT ATTAAAGGAG GGGTAACAGT ACCAACCTCA
GCAGTTAAAC CAGTTGCAAC ATGGGTTGCT GAAAGTTCTG GAAGTGATAA ACAAAAGAAA
ACTACAGGAA GCATTACTTT CAACTATCAT AAATTACGTT GTGCTGTAGC TGTTTCTCTT
GAAGTAGAAA CAATGTCTTT AGCAGTATTT GAAACAACAT TAATCAATAA TATTGTGGAA
GCTATGACAA AAGCGATTGA ACAAGCTATT GTTAGTGGTG ATGGGTCTGG TAAACCAAAG
GGAATCCTAG CGGAAACACC GTTTGAAGGA CAAGCATTAG ATGTTGCGAA AATCAATTAT
AAAACTTTAA CAGATGCAGA AGCAGCTTTA CCACTTGAGT ATGAAGCAAG TGCAATTTGG
ACGATGACGA AAAAGACATT TATGGAGTTT TCGGCAATGA CAGATGCAGA CGGACAGCCA
ATTGCGCGTA CAAATTACGG AATTTCTGGT AAACCAGAAC GTATTTTATT AGGTCGTCCA
GTTGTTTTAT GTAATTATGT TGATAGTTTC GCAACGGCTA CTGAAGGAAC AGCGTTCGCA
TTCTTATTTA ATTACAAAGA TTATATTCTT AATACAAACT ACCAAATGGG TGTTAAGAAA
TATGAAGACA ATGAAACTGA CGATCAAGTT ACAAAAGCAA TTATGATTGT GGATGGTAAA
GTAGTAGACA AAAATTCTTT AGTTGTTTTA AAAAAAGCTC CATCAGCTTA A
 
Protein sequence
MNKEQLLKRK SEIGELLSDE TRSIDNLDTI ETELRDINEQ LAAIEKREQL LNEARSINEG 
NAAGANKIET FNANLNLSNE KREIGTNTVE YRNAFMNYVL RGEAIPAELR ANAVTKTSDI
GSVIPQTVLD KIIEKIEAVG MILPLITRTA IKGGVTVPTS AVKPVATWVA ESSGSDKQKK
TTGSITFNYH KLRCAVAVSL EVETMSLAVF ETTLINNIVE AMTKAIEQAI VSGDGSGKPK
GILAETPFEG QALDVAKINY KTLTDAEAAL PLEYEASAIW TMTKKTFMEF SAMTDADGQP
IARTNYGISG KPERILLGRP VVLCNYVDSF ATATEGTAFA FLFNYKDYIL NTNYQMGVKK
YEDNETDDQV TKAIMIVDGK VVDKNSLVVL KKAPSA