Gene Suden_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_1854 
Symbol 
ID3762687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp1937613 
End bp1939016 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content27% 
IMG OID 
ProductDNA mismatch repair enzyme MutH 
Protein accessionYP_394363 
Protein GI78778048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAC CTTATGATAT TTTTAATAAA CAATCTATTA TAGAATTTGC AAAGAAGCTT 
AAATCTAGTA CATTAAAAAA ATCTTGTAAT AAAACTATTT TGTCACATGA ATATTCTGGA
AAAGGAAGTT TTGGACAGAT ACTTGAGAAA TTTTATTTTT TATATGCTCC AAATTCTAAT
GCAGAAGCAG ATTTTCCAGA AGTTAATCTT GAATTAAAAT CATCGCCATT AAAACAACTT
AAAAATAATC AGTTTCGCGC TAAAGAGAGA TTATCACTAA ATATAATCAA TTATCAAGAT
ATTGTTCATC AAAACTTTGA AACAAGTTCT TTTTGGAAAA AAAATGAGAA TTTACTCCTA
GTTTTTTATT TATATGAAAA TGATACCAAT GTTTTGGACT ATGTGATTAA GCTTGTTGAT
GAATGGACTT TTCCAAGTAT TGATTTAGAA ATAATAAAGC AAGATTGGAA AAGAATAAAA
CAAAAAGTTT TAGATGGAAA AGCTCATGAA CTTTCCGAAG GTGATACATT TTATTTAGGA
GCTGCACCAA AAGGTGGAAA AGGTGGAAAT CCAAGAGAGC AGCCCAATAG TACTTTAACA
GCAAAACAGA GAGCCTACTC ATTAAAACAA GGCTATGTAA ATCATATTAT TGCTTCAATA
TCTGGTAATT CATCTGTATA TGGTAAATTA ATACAATCAA CTGAAATTAC TAAAGATAAA
ACATTAGAAG AGATTGTGGT GTCAAAATTT GAATCTTATT ATGATAAAAC AATAGAAGAT
ATTTTGGCAA TGTTAAATAT AAATTTAAAC CTTAAAGCAA AAAACTTTTA TGCAAATTTA
ACAAAAGCAA TTTTAGGCAT AGAGCTAAAT AAAGAAATAG AAGAATTTGA AAAAGCTGAA
ATAATCGTAA AAACTATTCG GCTTAAAGAT AATAATTTAC CTAAAGAAGA TATTTCATTT
CCAAATTTTA AATATGAGAA TATAGTCAAT GAAACTTGGG ATGAATCTGA AATAAGCAAT
ATATTAGGAC ATAAATTTTT ATTTGTTTTT TTTCAATTTG AAAATAAAAA ACTGATATTC
AAGAAAGCTC AATTTTGGAA TATGCCATAT AAAGATATCT TGGAGGTTGA AAAAGTGTGG
GCAAGAACAA AAGAGATTGT TCAAAGTGGG GATATAGTTA AAGATATAAA AACAAATAAA
AGTGGCAACA AGATTAGATA TACAAACTTT CCAAATAAAA AATTTAACGC AGTATCACAT
GTAAGACCAC ATGCAATAAA TGCTGACGAT ACAATATCTT TACCAGTCAG AGACAAATTG
ACTCATTTAA AAGAGTATAC GAAGCATTGT TTTTGGCTTA ATGCTTCATA TGTAAAAAAT
GAAATTTATT TAAAATCTCT TTAG
 
Protein sequence
MELPYDIFNK QSIIEFAKKL KSSTLKKSCN KTILSHEYSG KGSFGQILEK FYFLYAPNSN 
AEADFPEVNL ELKSSPLKQL KNNQFRAKER LSLNIINYQD IVHQNFETSS FWKKNENLLL
VFYLYENDTN VLDYVIKLVD EWTFPSIDLE IIKQDWKRIK QKVLDGKAHE LSEGDTFYLG
AAPKGGKGGN PREQPNSTLT AKQRAYSLKQ GYVNHIIASI SGNSSVYGKL IQSTEITKDK
TLEEIVVSKF ESYYDKTIED ILAMLNINLN LKAKNFYANL TKAILGIELN KEIEEFEKAE
IIVKTIRLKD NNLPKEDISF PNFKYENIVN ETWDESEISN ILGHKFLFVF FQFENKKLIF
KKAQFWNMPY KDILEVEKVW ARTKEIVQSG DIVKDIKTNK SGNKIRYTNF PNKKFNAVSH
VRPHAINADD TISLPVRDKL THLKEYTKHC FWLNASYVKN EIYLKSL