Gene Bcer98_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_2199 
Symbol 
ID5345383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp2281612 
End bp2283513 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content35% 
IMG OID640839718 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_001375444 
Protein GI152975927 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.402475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCAAA TGACTTTTGA AAAGTTGCAG TACAATGAAT TGAAGGACAT TGTTAAATCT 
TACTGTGTAA GTGGGTTAGG AAAGCGATTA TTGGATCAAT TAGAGCCAAG TACAAATATA
AAGGTAGTAC GAAATCGATT GAATGAAACG ACGGAAGCGC GAGCGATATT AGATGCAGAA
GGACATGTAC CTTTTTTAGG TATTTCTAAT ATTGATAACA TCATGCAAAA ACTAGAAAAA
GGAATGATTC TAGAGCCAAG CGAGTTCGTG AGTGTTTCAG ATTTTTTACG GGGCTGTCGC
AAGATTAAAA AGTTTATGTT AGATAAGGAA TTTTTTGCAC CAATGCTCGC AGCGTATGCA
AATTCTATGT CCGAGTTTAA AAGCATAGAG GAAGAAATAC AATTTTGTAT AAAAGGAAAC
CGTGTTGATT CTGCTGCAAG TAAAGAATTA AAACGAATTC GAAATCAAAT GGATTCGGTA
GAGGGAAAAA TAAAGGAACG GTTAAATAAA TTTTTAAACA GTAGTGCAAA TAAGAAGTAT
ATACAAGAAT TCTTCATTAG TAAAAAGGAC GACCGATATA CGATTCCGGT GAAAGCATCT
TATAAAAATC AAGTTGCAGG AACAATTGTT GCAGTTTCTT CTAAAGGTTC TACTGTTTTT
ATAGAACCAA ATACTGTTAC AACATTAAAC GTGGAACTGG CAAGCTTACG AGCGGAAGAA
GCGATGGAAG AATATCAAAT ATTAGCAACT CTCTCAGGAA TGATATTAGA AAACATATAT
CAAATCAAAA TAAATATAGA GCTAGTGAGT CAATATGATT TAGTATTTGC CAAAGCAAAA
TTTAGTAAGC AAATTGGCGG AATAGAACCC AAGTTAAATG ATTACGGGTA TATAAAGCTC
GTTCATTGCA AGCATCCGCT TTTATCAGGA GAAGTGATAC CGCTAAATTT TGAAATTGGT
CAAAAATATC GAAGTTTAAT TATTACAGGA CCAAATGCTG GGGGGAAAAC GATTGTTCTG
AAAACAATCG GATTATTAAC ATTAGCGGCA ATGTCGGGGT TTCATATTGC AGGAGAGCGA
GAAACGGAGA TTGCTGTTTT TGAACATATC TTTGTAGATA TCGGTGATAA TCAAAGTATC
GAAAATGCAC TAAGTACATT TTCATCTCAT ATGAAAAATC TTTCGGAAAT TATGGAAGTA
TCAAATAATA ATACGCTGTT ATTATTTGAT GAAATAGGCA GCGGTACGGA ACCGAATGAG
GGAGCTGCGC TAGCGATTTC TATTTTAGAA GAGTTTTATC ATATGGGCTG TATCACTGTC
GCGACGACGC ATTATGGTGA GATTAAGCGA TTCTCAGAAA TGCATAGTGA CTTTATGAAT
GCAGCGATGC AATTTCATAG TGAAACATTG GAACCGATGT ATCAATTATT AATTGGAAAA
TCGGGAGAAA GTAATGCACT TTGGATTTCT CGAAAAATGA ATGTACGAGA ACATGTATTG
CAAAGAGCAA AAGAGTACAT GGAAAACAAA GAATATCGTT TAGAAAAGCT ACATGAAGGA
AAAGTAAGAA AGCCTAAAAT TGTAAAGCAA GCGGTAGAAG AGGCATATGC ATATAAAAAA
GGAGATCGCG TGAGATTATT AGATGATGAT GAATTTGGAA TTATATACCG AGAGAAAGAT
AACTTCTCCA ACGTTGTCGT GTTTTCCAAA GAGCGGTTTA TAGAAGTAAA TAGTAAACGG
ATTGCTTTAG AGGTCGAGGC GAAGGAATTA TATCCAGAAG GTTACGATTT AGATACGTTG
TTTGTAGATT ATAAAGAACG AAAAATGCAG CATGATATAG AGCGTGGATC AAAAAAAGCG
CTGCGTAACA TTCAAAAAGA AATAAGAAAG AATAAAGGAT AA
 
Protein sequence
MNQMTFEKLQ YNELKDIVKS YCVSGLGKRL LDQLEPSTNI KVVRNRLNET TEARAILDAE 
GHVPFLGISN IDNIMQKLEK GMILEPSEFV SVSDFLRGCR KIKKFMLDKE FFAPMLAAYA
NSMSEFKSIE EEIQFCIKGN RVDSAASKEL KRIRNQMDSV EGKIKERLNK FLNSSANKKY
IQEFFISKKD DRYTIPVKAS YKNQVAGTIV AVSSKGSTVF IEPNTVTTLN VELASLRAEE
AMEEYQILAT LSGMILENIY QIKINIELVS QYDLVFAKAK FSKQIGGIEP KLNDYGYIKL
VHCKHPLLSG EVIPLNFEIG QKYRSLIITG PNAGGKTIVL KTIGLLTLAA MSGFHIAGER
ETEIAVFEHI FVDIGDNQSI ENALSTFSSH MKNLSEIMEV SNNNTLLLFD EIGSGTEPNE
GAALAISILE EFYHMGCITV ATTHYGEIKR FSEMHSDFMN AAMQFHSETL EPMYQLLIGK
SGESNALWIS RKMNVREHVL QRAKEYMENK EYRLEKLHEG KVRKPKIVKQ AVEEAYAYKK
GDRVRLLDDD EFGIIYREKD NFSNVVVFSK ERFIEVNSKR IALEVEAKEL YPEGYDLDTL
FVDYKERKMQ HDIERGSKKA LRNIQKEIRK NKG