Gene Haur_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2999 
Symbol 
ID5734871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3787657 
End bp3789552 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content53% 
IMG OID641280143 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001545765 
Protein GI159899518 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTC GTGTTTTAGA CCCAACTTTA GCTGCCCAAA TCGCCGCTGG CGAGGTGGTT 
GAACGCCCAG CCTCGGTCGT CAAAGAATTA ATCGAAAATT CGGTCGATGC TGGCGCTACC
GAGATTCGGG TCGAGGCACG CGAGGGCGGC AAACGCGAGT TGCGCATCCA AGATAATGGC
TGTGGCATTG CCAGCGATGA GGTTGAAACG GCATTTTTGC GCCATGCCAC CAGTAAAGTA
ACCGAAATTG AAGACCTATT TTCGATTCGC ACACTGGGTT TTCGTGGCGA AGCCTTGCCT
TCCATTGCCT CAGTTGCTCA AGTAACATGC CTTACCCGCA CTGCCGCCGA TGAAGTTGGT
ACTGAATTGC GGATCGCTGG TGGCGAAATT CAGGCTAAAA CGCCGCGTGG CTGCTCGGTC
GGCACAACCT TCACGATTCG CAATTTGTTT TATAACACCC CAGCACGGCT CAAATTTATG
CGCTCCGATG CCACCGAAAT GAGCCAAATT AGCACAATCG TGACCCAATA TGCCTTGGCC
TACCCCAACA TTCGCTGGAC CTTGCTGCTC GATGGCAAGC TGGCCTTACA AACCCCAGGC
AATGGGCGTT TGCTCGATGC CTTGATTGAA TTGTATGGGA TTGATGTTGG CCGCGAGATG
ATCAGCGTTG ATCGGACCTC GGAAGCCGAA GATGAAACTG TGCGCGTTCA TGGCTTTGTC
AGCCAACCTT CGACCTTTCG CGCTGCCCGT TCGTATATGC ACTTATTCGT CAATCAGCGT
TGGATCAAGC CGCAAGGCAA TTTGGTCTAT ATGATCGAAG AGGCCTACCA TACCTTATTG
ATGAAGGGTC GGCATCCGAT TGTGGCCTTG AATATTGAGC TTGAGCCAGA AGCGGTTGAT
GTGAATGTGC ACCCAACCAA GAGCGAGGTC AAATTTCGCA ATCAATCGCA TGTCTATGGC
GCATTGACCA AAGCAGTACG CGAAGCCTTG GCTGCTCAAA GCACTATTCG GGCTTGGACA
GGCTTTGGAG CCAACGAAAG TGTCAATCGG CGGGTCGAAT TACGCTCGCC CAATGGCGAA
CGACGTGGCT CAAGCAATGA TGCACCCTTG TTTGATGATG CACCAGCAGC GCCACGCCCT
CAGGTCAATA ATTATCCTGA TGACGATTTT GATTCGACCG TGAATTTGCC GCCAATTGCT
AAACAAGCGC GGTTTGAAAC CCCAACCACC AGCCAAACCA GCAGTTTCTT GCCGCCGCAG
CAACAAGCTT TTGATCCGGC GTATGCACCG AGCATGCCAG CTCCAGGCGA GGCCAAATTG
CCGATGTTAC GGGTGGTTGG CCAAGTTAAC GAAACCTATA TTGTGGCCGA AAGCAGCGAT
GGTATGTATT TGGTCGATCA GCATGCGGCC CACGAACGGG TGGTGTATGA GCGCTTGATG
GCCGAACATC AGGATGTGCC AATTGAACGC CAAACCCTGA TGTTAGCCCA ACCGATTGAA
CTACCACCAG CCGTCACCCG TTTGCTCAGC GCTCACTTGG CCGATTTGGA GCAATGGGGC
TTTGAGGCCG AGGAATTTGG CGAAGGCACA TTAATGTTGC GAGCTGTGCC AAGTGGCTTG
CACGTTGGCC AAATTGCCAC CGCCTTGATG GAAATCGCTG ATCATTTGAG TTATGAAGGC
GGGGCCACTA GCGACGATCG GCGTGAAAAA ATGTTGACCA CGATCGCCTG CCATAGCTCA
ATTCGCGCGG GCAAAACCCT GACCCACGAA GAGATGCGCC AGCTTTTGCA ACAACTTGAG
CGCTGCGAAA TGCCGCGCAC CTGCCCGCAT GGCCGCCCAA CCATGCTCCA AATTACGCAA
GGCCAAATCG AACGCCAATT TGGGCGCAAA GGCTAG
 
Protein sequence
MPIRVLDPTL AAQIAAGEVV ERPASVVKEL IENSVDAGAT EIRVEAREGG KRELRIQDNG 
CGIASDEVET AFLRHATSKV TEIEDLFSIR TLGFRGEALP SIASVAQVTC LTRTAADEVG
TELRIAGGEI QAKTPRGCSV GTTFTIRNLF YNTPARLKFM RSDATEMSQI STIVTQYALA
YPNIRWTLLL DGKLALQTPG NGRLLDALIE LYGIDVGREM ISVDRTSEAE DETVRVHGFV
SQPSTFRAAR SYMHLFVNQR WIKPQGNLVY MIEEAYHTLL MKGRHPIVAL NIELEPEAVD
VNVHPTKSEV KFRNQSHVYG ALTKAVREAL AAQSTIRAWT GFGANESVNR RVELRSPNGE
RRGSSNDAPL FDDAPAAPRP QVNNYPDDDF DSTVNLPPIA KQARFETPTT SQTSSFLPPQ
QQAFDPAYAP SMPAPGEAKL PMLRVVGQVN ETYIVAESSD GMYLVDQHAA HERVVYERLM
AEHQDVPIER QTLMLAQPIE LPPAVTRLLS AHLADLEQWG FEAEEFGEGT LMLRAVPSGL
HVGQIATALM EIADHLSYEG GATSDDRREK MLTTIACHSS IRAGKTLTHE EMRQLLQQLE
RCEMPRTCPH GRPTMLQITQ GQIERQFGRK G