Gene BAS0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0444 
Symbol 
ID2853084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp466481 
End bp467479 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content36% 
IMG OID637503669 
Productphage-like 
Protein accessionYP_026724 
Protein GI49183472 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTA TTCTAGAAGA GCCGAACCCG TTAATGACAA GTCAAATGTT TCAAGAGAAA 
ATGACAGTGC AATTAGAATT GAATCACAAC GCATTTGCTT ATATTAAACG TGATGAACTC
GGCTTTGCAA CTGAATTATA CCCAATTCCT TGTATTACTG TAGAAGTTGT GGAAGGTATT
CAAGGCGACA TTTTCTTAAC TTTTTATTTT AAAAACGGAA AGAGAATGAC GGTTCCGTAT
GTAGATGTGA TTCATTTAAG AAAAGATTTC AATGAAGATG ACTTTTTTGG AGAACATCCA
GGACAAGCGT TGTCTTCATT AATGGATATT GTAACAACTA CAGATCAAGG GATTGTTAAA
GCGATTAAAA ATAGCGCAAT TATTAAGTGG ATTCTTAAGT TTAAGTCCGT TTTGAAACAA
GAAGATATAG ATATGCAAGT TGGTAATTTT GTTAAAAACT ATCTAAATAT TGATAACGTA
AATGGAGGCG CGGCGGCTAC TGATCCACGT TATGATTTAG AGCAAGTTAA AAATGAAGCA
TTTGTTCCTG ACTCAAAACA GATGCAAGAA ACAACGCAAA GGATTTACAA CTTCTTCAAT
ACAAACGAAA AAATTATACA AAGTAAGTAT ACAGAAGATG AATGGAATGC GTATTATGAA
TCTGAAATTG AACCGTTGGC AATGCAGCTT GCTGGAGAAT TTACCAGGAA GCTTTTTTCA
CGCCGTGAAC GTGGTTTCGG TAACAAAATC ATTTTTGAAG CAGCGAGTCT TCAGTACGCT
TCTATGTCAA CAAAGATGAA CTTAGTTCAA ATGGTTGATA GAGGAGCAAT GACACCAAAT
GAGTGGCGCT CTATTCTTTC ATTAGGTCCG ATTGAAGGTG GAGATAAACC AATTAGAAGG
CTTGATACAG CGCTAGTTAA AGATGGAAAT GTAATTGATA AAGGAGGTGG GAAAATTGGA
CAAGACGGAA ACAAGGGAAA TAGTAACACA GAAGATTGA
 
Protein sequence
MRFILEEPNP LMTSQMFQEK MTVQLELNHN AFAYIKRDEL GFATELYPIP CITVEVVEGI 
QGDIFLTFYF KNGKRMTVPY VDVIHLRKDF NEDDFFGEHP GQALSSLMDI VTTTDQGIVK
AIKNSAIIKW ILKFKSVLKQ EDIDMQVGNF VKNYLNIDNV NGGAAATDPR YDLEQVKNEA
FVPDSKQMQE TTQRIYNFFN TNEKIIQSKY TEDEWNAYYE SEIEPLAMQL AGEFTRKLFS
RRERGFGNKI IFEAASLQYA SMSTKMNLVQ MVDRGAMTPN EWRSILSLGP IEGGDKPIRR
LDTALVKDGN VIDKGGGKIG QDGNKGNSNT ED