Gene Dde_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_1201 
Symbol 
ID3755266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp1228288 
End bp1229811 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content57% 
IMG OID637782071 
Productsulfatase, putative 
Protein accessionYP_387697 
Protein GI78356248 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.68416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTTG ACCGCAGTAA AATCAAAAAC GTAGTCATGA TCATGCTGGA TACCCTGCAG 
TTCAACTATC TGGGCTGCTA CGGCAACAAG CAGGTCAAGA CTCCGAACCT GGACAGGTTC
GCCCGTCAGT CCGTTCTTTT TGAAAATGCC TACAGCGAAG GGCTGCCTAC CATTCCGGTG
CGCCGTGCGC TGATGACCGG CCGCTTCACC CTGCCTTACG GCGGGTGGAA GCCCCTTTCC
GGCGACGACA CCACCATCAC CGACATCCTC TGGGGGCGCA ATGTGCAGAC CGCGCTCATT
TATGACACCG CTCCCATGCG CCTTGCCAAG TTCGGCTATT CCAGAGGTTT TGACTATGTG
GATTTCTGCC CCGGGCAGGA ACTCGATCAT ACCACCTTTG CCGACATGCC GCTGGATCCG
GCTCTCAAGC CGGAAGATTT CACGTCGCCT TCCATGGTCT GGGACAAGGA CGGCAACCTG
ATTGATGATG ACAGCAAGCA ATTGTTGGAC GAGATAGGTT GTTTTCTGCG TCAGATGCAG
CACCGCCGCA GTGACGCCGA CAGTTATGTG GCCAAGGTGA TGAACCGGAC GGAATACTGG
CTGCGCGAAC GGCGCGACAA AAGCCGTCCG TTCTTCCTGT GGGTGGATTC CTTTGATCCG
CACGAGCCCT GGGATCCGCC GTCAGTATGG GAAGGCACTC CCTGCCCGTA CGACCCCGAC
TACGAAGGAA ACCCCATTCT GCTGGCTCCG TGGACACCGG TGGAAGGACG CATTTCCGAA
CGCGAATGCG AACATGTGCG CGCCCTGTAC ATGGAAAAAA TCACCATGGT GGATAAATGG
GTGGGACGCA TTCTTGATTC GCTGCGCGAA CAGGGACTGA TGGACGAGAC CATGGTTGTG
GTCATGTCCG ACCACGGGCA GCCCATGGGC AACGGTGAAC ACGGGCACGG CATCATGCGC
AAGTCGCGCC CGTGGCCCTA TGAAGAACTG GTGCACGTAC CTCTGCTTAT GCACATTCCC
GGTGTGCAGG GCGGGCAGCG CATATCGTCT TTTGTCCAGA ATGTGGATGT GACCGCCACC
ATCATGGATG TGATGAATCT GGCCGACACC GAAGACGGTG TGTCCGATCA TGGCATGAAC
ACATTCGGCG CCGAGAGCAT GCAGGGCGAA AGCCTGCTGC CGCTCATGCG GGGCGAGGCC
GATTCTGTGC GCGAGTACGC CATTGCAGGA TATTACGGCA TGTCGTGGTC CATCATCACC
GAAGACTACA GCTATGTGCA CTGGCTTGTC AGCGAAGACG AAAAGAACCG CGCCGACTGC
GTGGAAGGCG CCGACAAGGA AATGTCCGAG GAGATGTGGA CCTGCACCGC AGGAGCCAAG
GTTCAGATGC CGGAACATAA CGAACTGTAT GACAGGCGTA CCGACCCCTT CCAGCTGAAC
AACATTGCCG CGCAGAAGCC CGAAGTGGCC GAAGAACTGC TGCAGAAGCT CAAGCTGGTC
ATTGGCGGGC TGCGCACGAC CTAA
 
Protein sequence
MTFDRSKIKN VVMIMLDTLQ FNYLGCYGNK QVKTPNLDRF ARQSVLFENA YSEGLPTIPV 
RRALMTGRFT LPYGGWKPLS GDDTTITDIL WGRNVQTALI YDTAPMRLAK FGYSRGFDYV
DFCPGQELDH TTFADMPLDP ALKPEDFTSP SMVWDKDGNL IDDDSKQLLD EIGCFLRQMQ
HRRSDADSYV AKVMNRTEYW LRERRDKSRP FFLWVDSFDP HEPWDPPSVW EGTPCPYDPD
YEGNPILLAP WTPVEGRISE RECEHVRALY MEKITMVDKW VGRILDSLRE QGLMDETMVV
VMSDHGQPMG NGEHGHGIMR KSRPWPYEEL VHVPLLMHIP GVQGGQRISS FVQNVDVTAT
IMDVMNLADT EDGVSDHGMN TFGAESMQGE SLLPLMRGEA DSVREYAIAG YYGMSWSIIT
EDYSYVHWLV SEDEKNRADC VEGADKEMSE EMWTCTAGAK VQMPEHNELY DRRTDPFQLN
NIAAQKPEVA EELLQKLKLV IGGLRTT