Gene Dde_2869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_2869 
Symbol 
ID3758830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp2856595 
End bp2857941 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content50% 
IMG OID637783770 
Producttype I restriction-modification system, S subunit 
Protein accessionYP_389359 
Protein GI78357910 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0947716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGT ACAAAGCGTA TCCCGCGTAC AAGGATTCCG GCGTTGAGTG GATTGGGCAG 
GTGCCGGAGC ATTGGAAGAT TGCGCCAGTA AAGTATCACT ACGATGCAAG ACTGGGGAAA
ATGATCCAGC CTGCTGCGGT CTCTGACCGA GATATAGAAG TGCCATACCA TCGGGCGCAA
ACCGTTCAAT GGGAAAGGAT CGTTGAGTCT GACATCAAAG AAATGTGGGC ATCACCAAGG
GATATAGAAC AGTTTTCTGT ATCTGAAGGC GACCTTTTAA TTTGCGAGGG CGGTGATGTT
TGTCGCGCTG CAATTGTTAA ACAGCCTCCT GAAAAAAACA TGATATTCCA GAAATCCATC
CATCGTATCC GCTCGAAAGG CGAATATGGT GTTGGTTGGG TTATGCGTTT GATGCAGCAC
TTACGCTCGT CTGAGTGGAT AGATGTTCTG TGCAATAAAA ACACGATTGT CCATTTTACA
AGCGACAAAC TTGGTTCATT AGAATGCCCC CTGCCGCCAC CAGACGAACA AGCCTCCATC
GCCGCCGCCC TCGACCGCGA AACTGCCCGT ATTGATGCGC TGATCCAGAA GAAAACCCGC
TTTATCGAGC TGCTGAAGGA AAAGCGCCAG GCGCTGATTA CCCATGCGGT CACCAAGGGG
CTAGACCCCA ATGTAAAGAT GAAGGATTCC GGGGTGGAGT GGCTGGGGGA GGTGCCGGAG
CATTGGAGCA GTGTTCCCAT TAAGTACATG GCGCTTGAAC GAAATTCATT GTTCTTAGAT
GGTGACTGGA TTGAGAGCAA GGATATTTCG ACCGATGGGA TTCGCTATAT AACAACAGGG
AACGTCGGCG AGGGTGTGTA TAAAGAGCAA GGTTCTGGTT TCATATCTGA AGAGACGTTC
CATGCTCTTG GATGCACAGA GGTTTACGGG GGTGACGTTC TGGTATCTCG TTTGAACAAT
CCTATTGGTC GTGCTTGCAT GGTTCCAGAC CTCGGCGTGA GAGTGGTCAC GTCTGTAGAT
AACGTGATTT TTAGGCCGGA CTCAAAGTTC AATAAGAAGT TCATCGTTTA TCTCTTCAGT
AGCGAAGAGT ATTTCAAGCA CACAAGCAAT CTGGCACGCG GCGCCACCAT GCAGCGTATT
AGTCGTGGGC TTTTAGGCAA TATTCGAGTT GCTACTCCTT CGATTGAAGA ACAAACCCAA
ATCGCCCGCT TCCTCGACCA CGAAACCGCC CGTATTGATG CGTTGATTGG CAAGGCAGAG
CAAAGTATTA CCCTACTCAA AGAGCGCCGC GCCGCATTTA TCACCGCCGC TGTGACCGGC
CAGATTGATT TACGAGGAGA GCAATAA
 
Protein sequence
MSQYKAYPAY KDSGVEWIGQ VPEHWKIAPV KYHYDARLGK MIQPAAVSDR DIEVPYHRAQ 
TVQWERIVES DIKEMWASPR DIEQFSVSEG DLLICEGGDV CRAAIVKQPP EKNMIFQKSI
HRIRSKGEYG VGWVMRLMQH LRSSEWIDVL CNKNTIVHFT SDKLGSLECP LPPPDEQASI
AAALDRETAR IDALIQKKTR FIELLKEKRQ ALITHAVTKG LDPNVKMKDS GVEWLGEVPE
HWSSVPIKYM ALERNSLFLD GDWIESKDIS TDGIRYITTG NVGEGVYKEQ GSGFISEETF
HALGCTEVYG GDVLVSRLNN PIGRACMVPD LGVRVVTSVD NVIFRPDSKF NKKFIVYLFS
SEEYFKHTSN LARGATMQRI SRGLLGNIRV ATPSIEEQTQ IARFLDHETA RIDALIGKAE
QSITLLKERR AAFITAAVTG QIDLRGEQ