Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1083 |
Symbol | mutL |
ID | 4240583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1200245 |
End bp | 1202092 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104645 |
Product | DNA mismatch repair protein |
Protein accession | YP_719295 |
Protein GI | 113461226 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATTA AAATTCTTCC TCCACAACTG GCTAACCAAA TAGCGGCGGG CGAAGTGGTA GAGCGTCCGG CTTCTGTCGT CAAAGAACTG ATTGAAAACA GCTTGGATGC AGGAGCGACA CACATTCAAA TTGAGATTGA AAATGGCGGT GCAAATTTAA TTCGTATTCG TGACAACGGC ATAGGAATTG CAAAAGATGA ACTCCATTTA GCCCTTGCTC GCCACGCAAC AAGCAAGATC GCCAGCCTTG ATGATCTTGA AATGATTTTA AGTTTAGGCT TTCGTGGCGA AGCACTCGCA AGTATCAGTT CAGTTTCCCG TTTAACCTTA ACCTCTCGTA CAGCACAACA AAATGAAGCA TGGCAGGTTT ATGCACAAGG TCGAGATATG GAAACCAGCA TCACACCCGC TTCTCATCCG ATTGGCACAA CAGTTGAAGT AGCAAATTTG TTTTTTAATA CGCCTGCCAG ACGCAAATTT TTACGTACAG ACAAAACTGA ATTTGCACAT ATTGATGAAG TAATCCGCCG TATAGCGTTG GCAAAACCTC AAGTTGCTTT TACGTTGACC CATAACAATA AATTAATACA CCGCTATAAA AGTGCGGTAA CAAATGAGCA AAAAATTAAG CGTATTGCGA CTATTTGTGG CAATGATTTT ATGCAAAATG CGTTACATAT TGATTGGAAA CATAATGATC TGCATCTTTC CGGTTGGGTT ATACAACCTC AGTTTGCTCG TCATCAAAAT GACCTCAATT ATTGTTATAT CAATGGCAGA ATGGTGCGAG ATAAAGTCAT TACGCATGCT ATCCGTCAAG CCTATTCAGA ATACCTTAAC AACGAACAAT ATCCGGCATT TGTATTATTT ATTGATCTCA ATCCGAATGA AGTAGATGTA AACGTACACC CAACCAAACA TGAAGTTCGC TTTCATCAAG CTCGTCTAAT TCACGATTTT ATTTATCAAG GCATGACAAA TGCACTCACC TCTGAACAAA CCAATATTCC AATACAGAGT GAACAATCCA ATCCAACCAA AGTTGCCGAA CCTCAAGGTA TTTGGAACTT GACCACGCAT AACAAAGGTA ATCGAGCGAC TGCCGGCAAA AATATTTTTG CCCAGCAACC TAAAGATTAT GATAAAAAAT CGTCTCAATT TAAACCGCAC TTTACAGCAA ATTACAGCGA AGTAACGCCA AAGAAAGCAG TGCAAAAAGC CTATGCAGAA TTACTCGTAA CACATGAAGA AAAAACTATC GCCTCTTCTA CTCTACCCCA TCAATTTACA CACAATGCAA CGTATATCAG CGAACAGAAA AATGTTTTAC ATGCACTTGC ATTAATTGAA AATAAGGCAT TGTTATTGCA ACAAAACCAA CAATATTTCC TCCTTTCTAT TCAGGCATTG CAGCATTTCA ATATCCGCTT ACAGTTGCAA CAAAGTAATA TTGCACAACA AACATTGCTT ATTCCGATTT TATTGCGTTT AAATAAACAA CAATATCAAT CTTGGCAACA GCAAGCTCTA TTTTTTCAAC AAAGCGGTTT CGATTTTACA GAAAATTCTG CACAACACAG AATAACTCTA AACCGTTTAC CTATTTGCTT GCGTACACAA AATATACAAA AAATAATTTT ACACTTATTG GATCAACCTC ATGAAAAATA CACTATTTTT CTTACCGCAC TTTGCTCACA ATTGGAATTT CCCTCGCTCA GTACTTTCTC CGAAGCTGTA AATTTACTCA CAAAAACTGA GCAACAATTT TCAACCCAAC ATCAACCAGA ATTCCAATCT TTATTAGTCA AAATAGAGTG GGATCACTAT TTAGATAAAT TGCAATGA
|
Protein sequence | MTIKILPPQL ANQIAAGEVV ERPASVVKEL IENSLDAGAT HIQIEIENGG ANLIRIRDNG IGIAKDELHL ALARHATSKI ASLDDLEMIL SLGFRGEALA SISSVSRLTL TSRTAQQNEA WQVYAQGRDM ETSITPASHP IGTTVEVANL FFNTPARRKF LRTDKTEFAH IDEVIRRIAL AKPQVAFTLT HNNKLIHRYK SAVTNEQKIK RIATICGNDF MQNALHIDWK HNDLHLSGWV IQPQFARHQN DLNYCYINGR MVRDKVITHA IRQAYSEYLN NEQYPAFVLF IDLNPNEVDV NVHPTKHEVR FHQARLIHDF IYQGMTNALT SEQTNIPIQS EQSNPTKVAE PQGIWNLTTH NKGNRATAGK NIFAQQPKDY DKKSSQFKPH FTANYSEVTP KKAVQKAYAE LLVTHEEKTI ASSTLPHQFT HNATYISEQK NVLHALALIE NKALLLQQNQ QYFLLSIQAL QHFNIRLQLQ QSNIAQQTLL IPILLRLNKQ QYQSWQQQAL FFQQSGFDFT ENSAQHRITL NRLPICLRTQ NIQKIILHLL DQPHEKYTIF LTALCSQLEF PSLSTFSEAV NLLTKTEQQF STQHQPEFQS LLVKIEWDHY LDKLQ
|
| |