Gene BAS5274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5274 
Symbol 
ID2852403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5156843 
End bp5158243 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content32% 
IMG OID637508528 
Producthypothetical protein 
Protein accessionYP_031512 
Protein GI49188259 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAGCG AAAATAGTAA AAGTAATAAA TTTGAAAATT TTCTACTAAT CTTTATTTTG 
TTGCAACCGA TTTTAGATTT ACTTACAGCG TTCTGTATCA TGGTTTTAAA AATTGATACG
ACCATTGGAG TTATTACACG CTTATTCGTT ATGTTCCTAG GCGGGATATA TATTTTAATT
CAAACGAAGA AACAAGGAAA TATAAAATAT ATATTGTATA TAATCTTAGT AGGAATAGTC
TTTACTATCG GATTGGTAAA TAATAAGCTT ACAAAAGATC CTATGGTACT TACAGAAGAA
ATGAAGTTTA TAGCAAAAGC TTTATATCCG TTTGTTATGC TTACTTGTTA TGTGTTTGTG
TTTAAGTCTT TAAAAGAAAA AAGTCATTCG AAAATGCGTA ACTACATTAC GTATGCATCG
CTAATTATAG GAGTAGTAAT GGTAGCTTCG ATTACTACTG GCACAGATTA TAATAGTTAC
GAATGGGTAA AGTTAGGATC ACGCGGTTGG TTTTATGCAG GTAACGAGCT AGGATCTATT
TTAGCTATCA TGTGTCCAAT TGTTATTTTG TACTCAATCG AAAAAACAAA AAGTATAGGT
AAAGCATATT ATTGGATTCC TTCAATTTTA GTTGTTTATT CACTATTTGC AATTGGGACA
AAGGTTGGCG TAGGAGCAAT TTTTGGATCA ATGGCTATCG CAGTTGTTAT GTGCTTTATT
CAAGCATTTA CACAACGTAA AGATGGAAAG AAACATGCCT ATTTATTAAA TGGTTTTCTT
GCAATGACTG TATTTGTAGG CATATTGGCA TATACACCTT TTTCACCATT TATGAAAAAT
ATGGGATTCC ATTTCCAATT AATTGAACAA GAACAGAGTG CGAAAAAGGA AGAGAAAAAG
AAGGAGGAAG CTAAAGAACA CAAACCTCCT GTGACACAGC AAGAAAAAGA AAAAGAGAAA
GAAAAAGAAA AAGTAGCAGA GAAAAAAGAA GAAACGCAAG CTCTTATTTT TAGTGGGCGT
CAACTATTTG AACAAATGTA TAAAGATTTC TATAATGAAG CCCCAATGTC TCAAAAATTA
TTAGGAATGG GTTATGCGGG TAACTATAAA GAACAGCCGA AATTAATTGA ACGTGATTTC
CACGATTGGT TCTATTCTTT CGGAATCATA GGGTTCATTT TACTTGTAAT TCCATTCTTA
TACTTCGGTA TTAAATTTAT TGCATGTATA TTTACTAAAT TTAAACAAAT ATTCACTGTG
AAATATGCAA TGGTGATTGC GGCGATACTT CTTGGATTAG GTATCTCATT CATGGCTGGT
CATATTTTAA TTGCACCTGG AGTGAGTTTC TACTTAGTAG TAATCATGGC ATATTTAATA
GTTGACCTGG AAATTGAATG A
 
Protein sequence
MLSENSKSNK FENFLLIFIL LQPILDLLTA FCIMVLKIDT TIGVITRLFV MFLGGIYILI 
QTKKQGNIKY ILYIILVGIV FTIGLVNNKL TKDPMVLTEE MKFIAKALYP FVMLTCYVFV
FKSLKEKSHS KMRNYITYAS LIIGVVMVAS ITTGTDYNSY EWVKLGSRGW FYAGNELGSI
LAIMCPIVIL YSIEKTKSIG KAYYWIPSIL VVYSLFAIGT KVGVGAIFGS MAIAVVMCFI
QAFTQRKDGK KHAYLLNGFL AMTVFVGILA YTPFSPFMKN MGFHFQLIEQ EQSAKKEEKK
KEEAKEHKPP VTQQEKEKEK EKEKVAEKKE ETQALIFSGR QLFEQMYKDF YNEAPMSQKL
LGMGYAGNYK EQPKLIERDF HDWFYSFGII GFILLVIPFL YFGIKFIACI FTKFKQIFTV
KYAMVIAAIL LGLGISFMAG HILIAPGVSF YLVVIMAYLI VDLEIE