Gene BAS3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3289 
Symbol 
ID2849196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3252795 
End bp3254696 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content32% 
IMG OID637506533 
ProductMutS family protein 
Protein accessionYP_029546 
Protein GI49186294 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGA TGACTTTTGA AAAGTTACAA TATAACGAAT TAAAGGATAT AGTGAAATTT 
TATTGTGTAA GTGGATTAGG AAAAGAATTA ATAAATAAAT TAGAGCCGAG TACGAGTATA
AAAGTGGTAA GGAATCGATT AAATGAAACA ACCGAAGCGC GAGCTATATT AGATGCAGAA
GGGCATGTGC CTTTTTTCGG TATTTCAAAT ATTGCTAGTA CAATTCAAAA ATTAGAAAAA
GGAATGATTT TAGATCCAGA AGAGTTAGTA AGTGTTTCAG ACTTTTTACG CGGATGTAGA
AAGATTAAAA AATTTATGTT AGATAAAGAA TTTTTTGCAC CAGTATTAGC TTCTTATGCA
AATTCAATGA CTGAATATAA AAGTATTGAA GAGGAAATTA ACTTTTCAAT TAAAGGAAAT
AGTATTGATT CTGCCGCTAG TAAAGAGTTA AAACGAATTC GAAATAACAT TGATTCGGTA
GACGGGAAAA TAAAAGAACG TTTAACGAAG TTTTTAAATA GTAGTGCAAA TAAGAAGTAT
ATTCAAGAAT TCTTTATTAG TAAGAAGGAT GATAGGTATA CGATTCCGAT TAAATCTTCT
TATAAAAATC AAGTTGCGGG AAGTATAGTT GAAGCGTCGG CTAAAGGTTC TACTGTATTT
ATAGAACCGC ATACGGTTAC AAAGTTAAAT GCGGAACTTG CAAGTTTGAA AGCAGAAGAA
GCGATGGAAG AATATCAAAT TTTAGCGACT TTATCAGGAA TGGTAGTAGA AAATATATAT
CATATAAAAA TTAATATGGA ATTAATTAGT CAGTATGATA TGGTGTTTGC GAAAGCGAAG
TTTAGTAAAT CAATCGATGG AATAGAGCCG AAGTTAAATG ATCATGGCCA TATTCATTTA
GTAAATTGTA AGCATCCGCT TTTAAGTGGA AAAGTAGTAC CGTTAAACTT TGAAATCGGT
CAAAACTATC GTAGTTTAAT TATTACAGGG CCAAATGCGG GCGGTAAGAC AATTGTGCTA
AAAACAATTG GATTACTAAC ATTAGCGACG ATGTCAGGTC TTCATATTGC TGGAGATAAA
GAAACAGAAA TTGCTATTTT CGAAAATGTA TTTGTAGATA TTGGTGATAA TCAAAGTATC
GAAAATGCAC TCAGTACGTT TTCATCACAT ATGAAAAATT TATCTGAGAT TATGAGGATG
TCAAATAATA ATACGTTGCT ATTGTTTGAT GAAATAGGAA GCGGGACTGA ACCGAACGAA
GGAGCAGCAC TTGCAATTTC TATTTTAGAG GAGTTTTATC TTGCAGGATG TATTACAGTT
GCGAGTACGC ATTACGGTGA AATTAAACGC TTCTCAGAAA TGCACGATGA TTTTATGAAT
GCAGCAATGC AATTTAATAG TGAGACGCTA GAACCGCTTT ATAAATTAGT GATCGGTAAA
TCAGGTGAAA GTAATGCACT TTGGATTGCA AATAAAATGA ACGTAAGAGA ACGTGTACTG
AAAAGAGCGA AAGCGTACAT GGGAAATAAA GAATATACTT TAGAAAAAGT GAATGAAAGT
AAAATTAGAA AACCGAAATT CCTGCAAGAA AAAAGAGAAA ATCATTACGA GTATAAAATT
GGCGATCGTG TAAATTTATT GGATCATGAT GATTTTGGTA TCATCTATAA GGAAAAAGAT
AACTTCTATA ATGTCGTTGT ATATTATAAC GGTGAATTCA TTGAAGTGAA TGTAAAACGT
ATTACTTTAG AAGTAGCAGC AAAGGAATTA TATCCAGAGG GATACGATTT AAATACGCTA
TTTGTCGATT ATAAAGAAAG AAAAATGCAA CATGATATTG AGCGCGGATC GAAAAAAGCA
CTTCGTAAAA TTCAAAAAGA AATGAGAAAG AATAGAGGGT AA
 
Protein sequence
MNTMTFEKLQ YNELKDIVKF YCVSGLGKEL INKLEPSTSI KVVRNRLNET TEARAILDAE 
GHVPFFGISN IASTIQKLEK GMILDPEELV SVSDFLRGCR KIKKFMLDKE FFAPVLASYA
NSMTEYKSIE EEINFSIKGN SIDSAASKEL KRIRNNIDSV DGKIKERLTK FLNSSANKKY
IQEFFISKKD DRYTIPIKSS YKNQVAGSIV EASAKGSTVF IEPHTVTKLN AELASLKAEE
AMEEYQILAT LSGMVVENIY HIKINMELIS QYDMVFAKAK FSKSIDGIEP KLNDHGHIHL
VNCKHPLLSG KVVPLNFEIG QNYRSLIITG PNAGGKTIVL KTIGLLTLAT MSGLHIAGDK
ETEIAIFENV FVDIGDNQSI ENALSTFSSH MKNLSEIMRM SNNNTLLLFD EIGSGTEPNE
GAALAISILE EFYLAGCITV ASTHYGEIKR FSEMHDDFMN AAMQFNSETL EPLYKLVIGK
SGESNALWIA NKMNVRERVL KRAKAYMGNK EYTLEKVNES KIRKPKFLQE KRENHYEYKI
GDRVNLLDHD DFGIIYKEKD NFYNVVVYYN GEFIEVNVKR ITLEVAAKEL YPEGYDLNTL
FVDYKERKMQ HDIERGSKKA LRKIQKEMRK NRG