Gene BAS3509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3509 
Symbol 
ID2852771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3477839 
End bp3479050 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content36% 
IMG OID637506750 
Producthypothetical protein 
Protein accessionYP_029763 
Protein GI49186511 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGTTAT TTGATAAGAT CTTTGGTAAG AAACAGGCTC CTACTACAAC TCGTTTTGAA 
ATGATAAACG ATAATGGTGG AGGTTTTTTT GCGTGGAATG GGGATATCTA TCAAAGTGAC
ATTATACGGG CTTGTATACG TCCTAAAGCA AAAGCAGTCG GTAAGCTGAT AGCCAAGCAT
ATACGAGATA ACTCTACTGA ATTTAAGGTG AATCCAGATT CTTATATGAG ATTTTTACTG
GAAGAGCCTA ATCCATTGAT GACAGGACAA ATGTTTCAAG AGAAAATGGC TGTTCAATTA
GAGTTGAATC ATAATGCATT CGCTTATATT AAGCGTGATG ATTTTGGTTA TCCTACTGAG
ATTTATCCTA TTCCATGTAC AACAGTTGAA GTTGTAGAAG GTGCACAGGG AGACATCTTT
TTAAAGTTTT ATTTTAAAAA TGGTAAGCAG ATGACGATTC CGTATACAGA TATCATTCAT
TTACGTAAAG ACTTTAATGA TAATGACTTT TTCGGAGAAC ATCCTGGTAA TGCATTAGCT
CAGTTAATGG AGATTGTAAC AACTACTGAT CAAGGTATTG TTAAAGCTAT TAAAAATAGT
GCAGTAGTAA AGTGGATTCT TAAGTTTAAG TCAGTATTAA AACAAGAGGA TATTGATAGT
CAGGTAAAAA ACTTTGTGAA CAACTATTTG AATATCTCGA ATGATGGCGG AGCAGCTTCT
TCGGATCCGA GGTATGATTT AGAACAAGTG AAACCTGAAG CGTTTGTACC AGATTCCAAG
CAGATGCAAG AAACAGTACA ACGTATTTAT AATTTCTTTA ATACAAACGA AAAGATTATC
CAAAGTAAAT ATAACGAAGA TGAATGGAAT GCTTATTATG AATCAGAAAT AGAGCCATTT
GCAATGCAGC TTGCTGGAGA ATTTACCAGG AAGCTTTTTT CGCGTCGGGA AAGAGGGTTT
GGTAACAGAA TTATCTTTGA ATCTTCTTCA CTTCAATACG CTTCTATGGG GACCAAAATG
AATCTTGTTC AGATGGTAGA TAGAGGCTCT TTGACACCAA ATGAATGGCG AGCAATTCTT
TCACTTGGTC CAATTGAAGG TGGAGATAAG CCAATTAGAA GGTTAGATAC AGCACTGGTT
AAGGAAGGGA ATGTCACTGA TGAAGGAGGT GATGACAATG AACAAGACGG AAAAGAGGGA
GCTACTGAGT AG
 
Protein sequence
MGLFDKIFGK KQAPTTTRFE MINDNGGGFF AWNGDIYQSD IIRACIRPKA KAVGKLIAKH 
IRDNSTEFKV NPDSYMRFLL EEPNPLMTGQ MFQEKMAVQL ELNHNAFAYI KRDDFGYPTE
IYPIPCTTVE VVEGAQGDIF LKFYFKNGKQ MTIPYTDIIH LRKDFNDNDF FGEHPGNALA
QLMEIVTTTD QGIVKAIKNS AVVKWILKFK SVLKQEDIDS QVKNFVNNYL NISNDGGAAS
SDPRYDLEQV KPEAFVPDSK QMQETVQRIY NFFNTNEKII QSKYNEDEWN AYYESEIEPF
AMQLAGEFTR KLFSRRERGF GNRIIFESSS LQYASMGTKM NLVQMVDRGS LTPNEWRAIL
SLGPIEGGDK PIRRLDTALV KEGNVTDEGG DDNEQDGKEG ATE