Gene Dde_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_1858 
Symbol 
ID3756861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp1906851 
End bp1907900 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content58% 
IMG OID637782742 
Productrestriction endonuclease-like 
Protein accessionYP_388350 
Protein GI78356901 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTTA AAACGGCCGC CATTCAGGTT TTGCAGCAGG CCGGAACGGA ACTGCACGCC 
AAGGATATCG CCGAGCAGAT CATGGCTGCC GGTCTCTGGC AATCCGGAGG GAAAACCCCA
GACGCCACTG TCAGCGCCCG GCTCTACTCC GACATCAAGA ACAACGGAGA CAAGTCACCC
TTTGTAAAGG TCGGCCCTCA GACCTTCGCG CTTCGGGATT CCGCTGAAAT ACCGAGCGGC
GCTGAACCGG TTCCTGCGTT CGTCGAGGAC ACTCCAAAAC CGCCTTCTGT AAATGCAGGT
TTCTCCTTCA CCGATTGCGC TCAGAAAGTG CTTGAGACGT TCGGCGGCAA GAAGCCGATG
CATTACAAAG AGATCACCGA GAAGGCCCTG CAAAAAGGCT GGCTGGTAAC CGGCGGCAAG
ACGCCCGAGG CCACCATGTA CGCCCAGGTG ATCACCGAGA TCAAGCGCCA GCAGAAATGT
GGTGAGCGGC CCCGCTTCGT TCAGCACGGC CGTGGCAATG TGGGCCTGAG CCAATGGATG
GGGCGTGGGT TGGCGTTCCA GATCGAGCAG CACAACCACC AGGTCCGGAA AGTCTTGCGC
GAACGACTGC TGGCCATGAA GCCCGGCGAG TTCGAGGAAC TTATCTCGCA GTTGCTGGCG
GAGATGGGTT TCGAGATGGT CGAGGTAACC AAACTCAGCG GAGACGGCGG CATCGATGTC
CGGGGCACCT TGGTGGTCGG TGACGTGGTC CGCATCAAGA TGGCCGTCCA GGTCAAGAAA
TGGAAGCTCA AGAACAACAT CCAGGCTCCG GTGGTACAGC AGGTGCGCGG CAGTTTGGGG
GCGCACGAGC AAGGCCTGAT CATCACCACC AGCGACTTCA GTGCCGGAGC CATCAAGGAA
GCGGCCCAGT CCGACAAGAC CCCAATCGCC CTGATGAACG GGGAACAGCT TGTAATGCTG
CTGATGGAAC ACGGCATCGG CGTCCATCGC TCGACGCCTG ATCTTTTTGA AATTGATGAA
GAGTGTGCCG TAAGAGCTGA AACAGAATGA
 
Protein sequence
MDVKTAAIQV LQQAGTELHA KDIAEQIMAA GLWQSGGKTP DATVSARLYS DIKNNGDKSP 
FVKVGPQTFA LRDSAEIPSG AEPVPAFVED TPKPPSVNAG FSFTDCAQKV LETFGGKKPM
HYKEITEKAL QKGWLVTGGK TPEATMYAQV ITEIKRQQKC GERPRFVQHG RGNVGLSQWM
GRGLAFQIEQ HNHQVRKVLR ERLLAMKPGE FEELISQLLA EMGFEMVEVT KLSGDGGIDV
RGTLVVGDVV RIKMAVQVKK WKLKNNIQAP VVQQVRGSLG AHEQGLIITT SDFSAGAIKE
AAQSDKTPIA LMNGEQLVML LMEHGIGVHR STPDLFEIDE ECAVRAETE