Gene HS_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1083 
SymbolmutL 
ID4240583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1200245 
End bp1202092 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content38% 
IMG OID638104645 
ProductDNA mismatch repair protein 
Protein accessionYP_719295 
Protein GI113461226 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTA AAATTCTTCC TCCACAACTG GCTAACCAAA TAGCGGCGGG CGAAGTGGTA 
GAGCGTCCGG CTTCTGTCGT CAAAGAACTG ATTGAAAACA GCTTGGATGC AGGAGCGACA
CACATTCAAA TTGAGATTGA AAATGGCGGT GCAAATTTAA TTCGTATTCG TGACAACGGC
ATAGGAATTG CAAAAGATGA ACTCCATTTA GCCCTTGCTC GCCACGCAAC AAGCAAGATC
GCCAGCCTTG ATGATCTTGA AATGATTTTA AGTTTAGGCT TTCGTGGCGA AGCACTCGCA
AGTATCAGTT CAGTTTCCCG TTTAACCTTA ACCTCTCGTA CAGCACAACA AAATGAAGCA
TGGCAGGTTT ATGCACAAGG TCGAGATATG GAAACCAGCA TCACACCCGC TTCTCATCCG
ATTGGCACAA CAGTTGAAGT AGCAAATTTG TTTTTTAATA CGCCTGCCAG ACGCAAATTT
TTACGTACAG ACAAAACTGA ATTTGCACAT ATTGATGAAG TAATCCGCCG TATAGCGTTG
GCAAAACCTC AAGTTGCTTT TACGTTGACC CATAACAATA AATTAATACA CCGCTATAAA
AGTGCGGTAA CAAATGAGCA AAAAATTAAG CGTATTGCGA CTATTTGTGG CAATGATTTT
ATGCAAAATG CGTTACATAT TGATTGGAAA CATAATGATC TGCATCTTTC CGGTTGGGTT
ATACAACCTC AGTTTGCTCG TCATCAAAAT GACCTCAATT ATTGTTATAT CAATGGCAGA
ATGGTGCGAG ATAAAGTCAT TACGCATGCT ATCCGTCAAG CCTATTCAGA ATACCTTAAC
AACGAACAAT ATCCGGCATT TGTATTATTT ATTGATCTCA ATCCGAATGA AGTAGATGTA
AACGTACACC CAACCAAACA TGAAGTTCGC TTTCATCAAG CTCGTCTAAT TCACGATTTT
ATTTATCAAG GCATGACAAA TGCACTCACC TCTGAACAAA CCAATATTCC AATACAGAGT
GAACAATCCA ATCCAACCAA AGTTGCCGAA CCTCAAGGTA TTTGGAACTT GACCACGCAT
AACAAAGGTA ATCGAGCGAC TGCCGGCAAA AATATTTTTG CCCAGCAACC TAAAGATTAT
GATAAAAAAT CGTCTCAATT TAAACCGCAC TTTACAGCAA ATTACAGCGA AGTAACGCCA
AAGAAAGCAG TGCAAAAAGC CTATGCAGAA TTACTCGTAA CACATGAAGA AAAAACTATC
GCCTCTTCTA CTCTACCCCA TCAATTTACA CACAATGCAA CGTATATCAG CGAACAGAAA
AATGTTTTAC ATGCACTTGC ATTAATTGAA AATAAGGCAT TGTTATTGCA ACAAAACCAA
CAATATTTCC TCCTTTCTAT TCAGGCATTG CAGCATTTCA ATATCCGCTT ACAGTTGCAA
CAAAGTAATA TTGCACAACA AACATTGCTT ATTCCGATTT TATTGCGTTT AAATAAACAA
CAATATCAAT CTTGGCAACA GCAAGCTCTA TTTTTTCAAC AAAGCGGTTT CGATTTTACA
GAAAATTCTG CACAACACAG AATAACTCTA AACCGTTTAC CTATTTGCTT GCGTACACAA
AATATACAAA AAATAATTTT ACACTTATTG GATCAACCTC ATGAAAAATA CACTATTTTT
CTTACCGCAC TTTGCTCACA ATTGGAATTT CCCTCGCTCA GTACTTTCTC CGAAGCTGTA
AATTTACTCA CAAAAACTGA GCAACAATTT TCAACCCAAC ATCAACCAGA ATTCCAATCT
TTATTAGTCA AAATAGAGTG GGATCACTAT TTAGATAAAT TGCAATGA
 
Protein sequence
MTIKILPPQL ANQIAAGEVV ERPASVVKEL IENSLDAGAT HIQIEIENGG ANLIRIRDNG 
IGIAKDELHL ALARHATSKI ASLDDLEMIL SLGFRGEALA SISSVSRLTL TSRTAQQNEA
WQVYAQGRDM ETSITPASHP IGTTVEVANL FFNTPARRKF LRTDKTEFAH IDEVIRRIAL
AKPQVAFTLT HNNKLIHRYK SAVTNEQKIK RIATICGNDF MQNALHIDWK HNDLHLSGWV
IQPQFARHQN DLNYCYINGR MVRDKVITHA IRQAYSEYLN NEQYPAFVLF IDLNPNEVDV
NVHPTKHEVR FHQARLIHDF IYQGMTNALT SEQTNIPIQS EQSNPTKVAE PQGIWNLTTH
NKGNRATAGK NIFAQQPKDY DKKSSQFKPH FTANYSEVTP KKAVQKAYAE LLVTHEEKTI
ASSTLPHQFT HNATYISEQK NVLHALALIE NKALLLQQNQ QYFLLSIQAL QHFNIRLQLQ
QSNIAQQTLL IPILLRLNKQ QYQSWQQQAL FFQQSGFDFT ENSAQHRITL NRLPICLRTQ
NIQKIILHLL DQPHEKYTIF LTALCSQLEF PSLSTFSEAV NLLTKTEQQF STQHQPEFQS
LLVKIEWDHY LDKLQ