Gene GBAA_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3904 
SymbolmutL 
ID2814810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3575350 
End bp3577230 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content38% 
IMG OID637790622 
ProductDNA mismatch repair protein 
Protein accessionYP_020542 
Protein GI47529193 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.595251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAA TTCGCAAACT CGATGACCAA CTCTCTAACT TAATTGCGGC AGGGGAAGTA 
GTAGAGCGCC CTGCCTCAGT CGTAAAAGAA CTTGTGGAAA ATTCTATCGA TGCGAATAGT
ACATCTATTG AAATCCACTT AGAAGAAGCC GGTTTATCGA AAATTCGCAT CATTGATAAC
GGAGATGGCA TTGCAGAAGA AGATTGTATC GTTGCTTTTG AACGACATGC GACGAGCAAA
ATTAAAGATG AAAACGATCT GTTTCGCATA AGAACACTCG GTTTCCGCGG TGAGGCATTG
CCAAGTATTG CCTCAGTTAG TGAATTAGAA TTAATCACTA GCACCGGTGA TGCTCCTGGT
ACACACCTTA TTATTAAAGG TGGAGACATT ATAAAGCAGG AAAAAACAGC GAGCCGAAAA
GGAACAGATA TTACAGTACA AAACTTATTC TTTAATACAC CAGCGCGTCT TAAATATATG
AAAACCATTC ATACAGAGCT TGGGAATATT ACAGATATTG TGTATCGTAT TGCAATGTCG
CATCCAGAAG TATCATTAAA GCTATTTCAT AATGAAAAGA AATTGCTTCA TACATCAGGA
AATGGTGATG TAAGACAAGT ACTTGCATCG ATTTATAGCA TTCAAGTTGC AAAGAAGCTT
GTTCCAATTG AAGCTGAATC TTTAGATTTC ACAATTAAAG GTTATGTAAC ATTACCAGAA
GTAACGAGAG CATCTCGTAA TTATATGTCA ACAATCGTAA ATGGCCGTTA CGTTCGAAAT
TTTGTATTAA TGAAAGCTAT TCAGCAAGGA TACCATACAT TACTGCCAGT CGGACGATAT
CCAATCGGTT TCTTATCAAT TGAAATGGAT CCAATGCTTG TTGACGTTAA CGTACATCCA
GCGAAATTAG AAGTTCGTTT TAGTAAAGAA CAAGAATTAC TAAAGCTGAT TGAAGAAACA
TTGCAAGCAG CATTCAAAAA AATACAGCTC ATTCCAGATG CAGGTGTAAC AACGAAGAAA
AAAGAAAAAG ATGAAAGTGT GCAAGAACAG TTTCAGTTTG AGCATGCGAA GCCGAAAGAA
CCATCTATGC CGGAGATCGT TTTACCGACG GGCATGGATG AAAAACAAGA AGAACCGCAG
GCTGTGAAAC AGCCAACACA ACTGTGGCAA CCATCCACTA AACCGATAAT TGAAGAGCCA
ATTCAAGAAG AGAAATCGTG GGACAGTAAC GAAGAGGGCT TTGAACTAGA GGAATTAGAA
GAAGTTCGGG AAATAAAAGA GATTGAAATG AACGGGAATG ACTTACCACC GCTTTATCCA
ATTGGACAAA TGCATGGAAC ATATATTTTC GCTCAAAATG ATAAAGGCTT ATATATGATT
GACCAGCATG CCGCGCAGGA ACGTATTAAT TATGAATACT TCCGTGATAA AGTAGGAAGG
GTAGCGCAAG AAGTACAAGA ACTACTCGTA CCATATCGTA TTGATTTATC TCTTACTGAA
TTTTTACGTG TTGAAGAGCA ACTAGAAGAA CTAAAGAAAG TCGGTCTATT CTTGGAGCAA
TTCGGCCATC AATCCTTTAT CGTTCGCTCG CATCCAACGT GGTTCCCGAA AGGGCAAGAA
ACAGAAATTA TCGATGAAAT GATGGAGCAG GTCGTTAAAC TAAAAAAAGT TGATATTAAA
AAATTACGTG AAGAAGCAGC CATCATGATG AGCTGTAAAG CTTCAATTAA AGCAAATCAA
TATTTAACGA ACGATCAAAT ATTTGCTTTA CTGGAAGAAC TTCGTACAAC AACAAACCCA
TACACATGCC CGCACGGAAG ACCAATTCTT GTGCATCATT CTACTTATGA GTTGGAGAAG
ATGTTTAAGA GGGTTATGTA G
 
Protein sequence
MGKIRKLDDQ LSNLIAAGEV VERPASVVKE LVENSIDANS TSIEIHLEEA GLSKIRIIDN 
GDGIAEEDCI VAFERHATSK IKDENDLFRI RTLGFRGEAL PSIASVSELE LITSTGDAPG
THLIIKGGDI IKQEKTASRK GTDITVQNLF FNTPARLKYM KTIHTELGNI TDIVYRIAMS
HPEVSLKLFH NEKKLLHTSG NGDVRQVLAS IYSIQVAKKL VPIEAESLDF TIKGYVTLPE
VTRASRNYMS TIVNGRYVRN FVLMKAIQQG YHTLLPVGRY PIGFLSIEMD PMLVDVNVHP
AKLEVRFSKE QELLKLIEET LQAAFKKIQL IPDAGVTTKK KEKDESVQEQ FQFEHAKPKE
PSMPEIVLPT GMDEKQEEPQ AVKQPTQLWQ PSTKPIIEEP IQEEKSWDSN EEGFELEELE
EVREIKEIEM NGNDLPPLYP IGQMHGTYIF AQNDKGLYMI DQHAAQERIN YEYFRDKVGR
VAQEVQELLV PYRIDLSLTE FLRVEEQLEE LKKVGLFLEQ FGHQSFIVRS HPTWFPKGQE
TEIIDEMMEQ VVKLKKVDIK KLREEAAIMM SCKASIKANQ YLTNDQIFAL LEELRTTTNP
YTCPHGRPIL VHHSTYELEK MFKRVM