Gene Oant_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3049 
SymbolmutL 
ID5381452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp357153 
End bp359033 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content58% 
IMG OID640835726 
ProductDNA mismatch repair protein 
Protein accessionYP_001371586 
Protein GI153010372 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTC GACACTTAAG CGAAACCATC ATCAATCAGA TCGCAGCGGG GGAAGTCATC 
GAACGCCCCG CCAGCGTTAT CAAGGAACTT GTCGAAAATG CTATCGACGC TGGCGCGACC
CGCATTGAGG TCGTGACCGG TGGCGGTGGC AAGACATTGT TGCGCGTGAC GGATAATGGT
TCGGGGATTC CGGTCGATGA GCTTCCATTG GCAGTTTCCC GCCACTGTAC CTCCAAGCTT
TCGGACGATG TGCACGACAT TCGTGCACTT GGGTTTCGTG GGGAAGCTCT GCCATCTATT
GGCTCGGTCG CAAAACTGAC CCTCAAATCT CGTCCGCAGG ACGCTGACTC CGGCTTTGAA
GTATCGGTTT CCGGCGGACA TCTCGACGGT CCCCGTCCTT CCGCGCTCAA TCGCGGGACA
ATTGCCGAGG TCCGCGATCT TTTTTTCGCG ACGCCTGCCC GTCTCAAATT CATGAAAACG
GATCGTGCGG AAGCTTCCGC CATAACCGAT GTCGTCAAGC GCATCGCAAT TGCCTTTCCC
CATGTCCGCT TTTCGCTTGC AGGCACCGAC AGGACACCAC TGGAACTGGC CGCAACCGGC
AGCGGCGCAG AAGCAACGCT TGAGCGCATC AATCAGGTGC TCGGAAAAGA ATTCGGCGAA
AATGCGCTTG CCATTGACGC GGAACGCGAC GGCGTACGTC TGGCCGGATT TGTCGGCATC
CCCTCCCATA ATCGTGGCAA CGCCCTGCAT CAGTTCGCCT ATGTGAATGG GCGCCCGGTG
CGGGACAAAC AGCTTTTCGG CGCATTGCGT GGAGCCTATG CGGATGTCAT GGCACGTGAT
CGTCATCCGG TTGCCGTATT GTTTCTGACG CTGGATCCGG CATTTGTCGA CGTTAATGTG
CATCCTGCCA AGGCTGACGT GCGTTTCCGT GATCCAGGTC TTGTGCGCGG GCTAATCGTC
GGGGCGATCA AACAAGCACT GGCGCAATCA GGTATTCGGC CTGCGACCAG CGGTGCCGAT
GCCATGCTGC AAGCGTTCCG TGCAGAAGGA TTCCAGCCGC CCTCACCATC ATTCACATCG
CGGCCTTCTT CGGCTGGTTA TGCATCGGGG AGCTGGCACC CCGCAGTTTC CTCGCCGAGA
ACCGAATGGT CACCGCAGAC CGCACATCCG GCGCATCGGC CGCTGGACTT GGGAGCGGCA
CCCTCCTTTC AGGAAAGTGA CCAGGCGACG CTCGCCACCG TTAACGTGCT GGCGGCAGAT
GCCCGAGCGA CGCGTGACGA AGCACCGGTG GAACTCCAGC AGAAGCCACT CGGGGCGGCC
CGCGCGCAAA TCCACGCGAA CTATATTGTC TCCCAGACCG AAGACAGTCT CGTCATTGTA
GATCAGCATG CGGCTCACGA ACGTCTCGTT TATGAGGCGT TGAAAAACGC CCTGCATTCT
CGTCCCATAT CGGGACAAAT GCTGCTTATT CCTGAAATCG TGGATCTGCC GGAAGAAGAC
GCCGAGCGAC TGGCAACCCA CGCGGAAACA CTTGCCCGCT TTGGCCTTGG CATCGAGCAA
TTCGGGCCAG GTGCTATCGC GGTGCGTGAA ACGCCCGCAA TGCTCGGAGA AATGAACGTG
CAGCAGCTGA TCCGCGATCT TTCGGATGAG ATTGCGGAAC ACGACACATC TGAAGGATTG
AAGGCCATGT TGAACCATGT GGCCGCAACA ATGGCCTGTC ATGGTTCCGT TCGCTCGGGG
CGACGGTTGA AGCCTGAAGA AATGAATGCG CTCCTGCGTG AAATGGAAGC CACCCCCGGC
TCAGGCACCT GCAATCACGG TCGCCCAACC TACATCGAAC TGAAACTGAC AGATATCGAA
CGGCTATTTG GCAGGCGCTG A
 
Protein sequence
MPIRHLSETI INQIAAGEVI ERPASVIKEL VENAIDAGAT RIEVVTGGGG KTLLRVTDNG 
SGIPVDELPL AVSRHCTSKL SDDVHDIRAL GFRGEALPSI GSVAKLTLKS RPQDADSGFE
VSVSGGHLDG PRPSALNRGT IAEVRDLFFA TPARLKFMKT DRAEASAITD VVKRIAIAFP
HVRFSLAGTD RTPLELAATG SGAEATLERI NQVLGKEFGE NALAIDAERD GVRLAGFVGI
PSHNRGNALH QFAYVNGRPV RDKQLFGALR GAYADVMARD RHPVAVLFLT LDPAFVDVNV
HPAKADVRFR DPGLVRGLIV GAIKQALAQS GIRPATSGAD AMLQAFRAEG FQPPSPSFTS
RPSSAGYASG SWHPAVSSPR TEWSPQTAHP AHRPLDLGAA PSFQESDQAT LATVNVLAAD
ARATRDEAPV ELQQKPLGAA RAQIHANYIV SQTEDSLVIV DQHAAHERLV YEALKNALHS
RPISGQMLLI PEIVDLPEED AERLATHAET LARFGLGIEQ FGPGAIAVRE TPAMLGEMNV
QQLIRDLSDE IAEHDTSEGL KAMLNHVAAT MACHGSVRSG RRLKPEEMNA LLREMEATPG
SGTCNHGRPT YIELKLTDIE RLFGRR