Gene DET1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1195 
Symbol 
ID3229520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp1089265 
End bp1090965 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content52% 
IMG OID637120758 
ProductMutL/HexB family DNA mismatch repair protein 
Protein accessionYP_181908 
Protein GI57234054 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0321912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATAA AACTGCTGGA CAAGGCAACT ATTGCCCGCA TAGCCGCCGG AGAGGTCATT 
GAAAGGCCGT CTTCCGTGGT CAAAGAGCTG CTGGAAAACT CTCTTGATGC CGGGGCTAAA
CGGGTGGATG TTGTTATCCG CGAAGGCGGC ATTGGCTATA TAGAAGTAAG TGATGACGGG
TGCGGCATAG TTTTTTCGGA AGTTTTGCTG GCCTTTGAAC GCCATGCCAC CAGCAAGCTT
TCCAGCTTTG AAGATATTTA TGCTATCTCC AGTCTGGGTT TCAGGGGTGA AGCCCTGCCC
AGTATTGCGG CGGTGGCAGA TTTGGAAATG CTGACGGCTG CCCGAACTGA AGAAAGCGGC
ACTTATTTGT CTTTGTCCGG CGGCGAAATG GTTAAGCATA CCCGTATGGC ACGTTCACCA
GGTACTACTA TAAAGCTCAC CCGCCTTTTC AGCCGGGTGC CTGCCCGCCT GAAATTCCTT
AAAACCCCCC AGCATGAAGC CTCCAAAGTG TCGGAAGTGG TGCTGAGCTA TGCTTTGGCT
TACCCTGAGG TCAAGTTTAC TCTGAGCATT GACGGGCGGA ATACCTTAAA TACCCCCGGC
AACGGCAAAC TGCGGGATGC CGTGCTGGAA ATATACGGAA ACGACGTTGC AAGTAAAATG
CTGGATTTGG AAACAGACTC TTACCGTTCA TCTGCCATAA ATATCAGCGG TCTGGTAAGC
CCGCCTGAAA TCAGCCGTTC CAACCGTAAT TCCCTCCATT TCTTTGTTAA CCGCCGCCTT
ATCCAGAGCA GGGCTTTGCA AAAAGCGGCA GAACAGGCCT ACAGCGGCTT GCTTATGGTG
GGGCGTTACC CTCTGGGGGT TATAAATATA TGGCTGGATG GGGCGCTGGT AGATGTAAAT
ATTCACCCCA CCAAGGCAGA AGTTAAATTT TCAGATGAAA GTGCCGTTTT TACCGCTGTC
CAGCGGGCAG TCCGTTCGGT ACTGGTGGAG AAGCCACCCA CTCCTCATAT AGCCGAAGAA
GCGTCTGTTT ACCGGCAGGA ATCTGCCAGA CAAGAGCCAA TCTGGGGTGA GACTCCAAAA
CCCGCCGGTA CTGTCCAGCA GTATTTTTCG CCTGTTATCC AGAGTGCTAA AACATCGGTT
TTGCCGCTGC TGCGGCTGGT GGGGCAGATA GGCGGCCTTT ACCTGCTGGC CGAAGGGCCG
GATGGGCTTT ACATAATAGA CCAGCACGCC GCCCATGAGC GTATCCGTTA TGAAGAAATT
GCCTCACAAA CCCCCTCTGA AAATGCGCGC CAGAGCCTTC TTGATCCGTT TATACTGGAA
CTAAACCCGG TGCAGGAAGC CATGATTGAA AAATGCAAAT CAGAGCTGGA TTTAATGGGT
TTTGAAATAG AAGAATTCGG CCGCAGAGTC TACCGTGTGC AATCAATTCC GGCCGGTTTT
ACCGCACCCC AGGCCAAAGC CCTTCTTTCA GAGCTTGTTG ATAATCCCAA AGATGCCCCG
GCAGAGATAA AGGAACGTTT ACAGCGGCTG ATGGCTTGCC ATACCGCAGT TCGGGCAGGA
CAGGTGCTTA ACGAGGCGGA GATGCGTGAA CTGCTGCTGA AACTGGAGAA AACCGCTGTA
CCCGGCCACT GCCCTCACGG GCGTCCCACT ATTGTAAAAA TAGACTTTTG CCAGCTTGAA
AAAGACTTCA AGCGTACTTA G
 
Protein sequence
MPIKLLDKAT IARIAAGEVI ERPSSVVKEL LENSLDAGAK RVDVVIREGG IGYIEVSDDG 
CGIVFSEVLL AFERHATSKL SSFEDIYAIS SLGFRGEALP SIAAVADLEM LTAARTEESG
TYLSLSGGEM VKHTRMARSP GTTIKLTRLF SRVPARLKFL KTPQHEASKV SEVVLSYALA
YPEVKFTLSI DGRNTLNTPG NGKLRDAVLE IYGNDVASKM LDLETDSYRS SAINISGLVS
PPEISRSNRN SLHFFVNRRL IQSRALQKAA EQAYSGLLMV GRYPLGVINI WLDGALVDVN
IHPTKAEVKF SDESAVFTAV QRAVRSVLVE KPPTPHIAEE ASVYRQESAR QEPIWGETPK
PAGTVQQYFS PVIQSAKTSV LPLLRLVGQI GGLYLLAEGP DGLYIIDQHA AHERIRYEEI
ASQTPSENAR QSLLDPFILE LNPVQEAMIE KCKSELDLMG FEIEEFGRRV YRVQSIPAGF
TAPQAKALLS ELVDNPKDAP AEIKERLQRL MACHTAVRAG QVLNEAEMRE LLLKLEKTAV
PGHCPHGRPT IVKIDFCQLE KDFKRT