Gene GBAA_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3784 
Symbol 
ID2817908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3475537 
End bp3476700 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content35% 
IMG OID637790511 
Productphage major capsid protein 
Protein accessionYP_020421 
Protein GI47529072 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAACAT TACAAGAAAT TTTAACTAGA AAATCAGAAA TTCGTTCAAT GTTACAAAGC 
GATAAGGAAG TAGATTTAGC AGCATTAGAA ACAGAATTAC GAGATCTTGA AGAAACACAA
AAACAAATTG AAACTCGACA AAGATTATTA AAAGAAGCAG AGGAGATTAA TAATAATCAA
ATGCCTGAAA TTCGTACAGT TGAAACATTT AACAATGAAC CTCAGAAACA AGATGTAGAA
TTAGAGACTT CTGAAAAGCG TGGACAAGCT CTAATGGAAA ACCGTGCTGT TACAGTTGGA
AGCGGTAATG TAGTTTTACC TAAGCATAGT GCAACAGATA TTCGTCCGAC TTTCAATGAA
GTGTCTACAC TGATTGATCG TGTTTCTTCT AAAACTTTAA AAGGTGGAGA GAGTTACCAA
CAGCCGTACA TTAAAAGTTA TGGAGAAGGT GATTATACCA CTGAAGGTAA TGACTACAAT
ACATCAGAAA CAACGTTTGG ATATGCAGAT ATCACAAAAG CAAAAGTTAC AGCTTATTCA
GAGGACACAG AAGAGCTTCA AAAATTACCA GCAGCTGATT ACGATGCTGA AGTAATGAAG
GGGATTACGG TAGCTACTCG TAAAAAGTTA ACTCGTGAAA TTTTAATTGG GACAGGTGCT
ACAAATCGAC TTGTTGGTAT TTTCTCAGCA GCAGCTACGG CAATTGATTC AGAAACAGAT
TTAGAAATTT CAGCAATTGA TGCATCTACA TTGGATGAGA TTATCTATAG CTATGGTGGA
GATGAAGATG TAGAAGATGC GGCAGTATTG ATTTTAAATA AACTAGATTT AAAATCATTT
GCTAAGCTTC GTACTTCTGA TGGTAAAAAG GTATATAACG TAGTATCACA AGGTAATTCT
GGAACAATTG ATGGGGTACC ATTCATTATT AATAGTGCTT GTAAGGCTGT TTCTGATGCT
AAAACAACAG CTGGACAATA TAGCATGGCA TATGGTCCTT TATCAAACTA TCAACTTACT
ATTTTCTCAG ATATGGACGT TCAACGATCT ACAGACTTTA AATTCAAGCA AGGTATGATT
GCTCATAGAG GTTCTGTTTT TGCAGGTGGT AACGTAATTT CTAAAAATGG ATTCTTACGA
GTGAAGAAAG CGGCTACTGT ATAA
 
Protein sequence
MKTLQEILTR KSEIRSMLQS DKEVDLAALE TELRDLEETQ KQIETRQRLL KEAEEINNNQ 
MPEIRTVETF NNEPQKQDVE LETSEKRGQA LMENRAVTVG SGNVVLPKHS ATDIRPTFNE
VSTLIDRVSS KTLKGGESYQ QPYIKSYGEG DYTTEGNDYN TSETTFGYAD ITKAKVTAYS
EDTEELQKLP AADYDAEVMK GITVATRKKL TREILIGTGA TNRLVGIFSA AATAIDSETD
LEISAIDAST LDEIIYSYGG DEDVEDAAVL ILNKLDLKSF AKLRTSDGKK VYNVVSQGNS
GTIDGVPFII NSACKAVSDA KTTAGQYSMA YGPLSNYQLT IFSDMDVQRS TDFKFKQGMI
AHRGSVFAGG NVISKNGFLR VKKAATV