Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_1201 |
Symbol | |
ID | 3755266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 1228288 |
End bp | 1229811 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637782071 |
Product | sulfatase, putative |
Protein accession | YP_387697 |
Protein GI | 78356248 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.68416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTG ACCGCAGTAA AATCAAAAAC GTAGTCATGA TCATGCTGGA TACCCTGCAG TTCAACTATC TGGGCTGCTA CGGCAACAAG CAGGTCAAGA CTCCGAACCT GGACAGGTTC GCCCGTCAGT CCGTTCTTTT TGAAAATGCC TACAGCGAAG GGCTGCCTAC CATTCCGGTG CGCCGTGCGC TGATGACCGG CCGCTTCACC CTGCCTTACG GCGGGTGGAA GCCCCTTTCC GGCGACGACA CCACCATCAC CGACATCCTC TGGGGGCGCA ATGTGCAGAC CGCGCTCATT TATGACACCG CTCCCATGCG CCTTGCCAAG TTCGGCTATT CCAGAGGTTT TGACTATGTG GATTTCTGCC CCGGGCAGGA ACTCGATCAT ACCACCTTTG CCGACATGCC GCTGGATCCG GCTCTCAAGC CGGAAGATTT CACGTCGCCT TCCATGGTCT GGGACAAGGA CGGCAACCTG ATTGATGATG ACAGCAAGCA ATTGTTGGAC GAGATAGGTT GTTTTCTGCG TCAGATGCAG CACCGCCGCA GTGACGCCGA CAGTTATGTG GCCAAGGTGA TGAACCGGAC GGAATACTGG CTGCGCGAAC GGCGCGACAA AAGCCGTCCG TTCTTCCTGT GGGTGGATTC CTTTGATCCG CACGAGCCCT GGGATCCGCC GTCAGTATGG GAAGGCACTC CCTGCCCGTA CGACCCCGAC TACGAAGGAA ACCCCATTCT GCTGGCTCCG TGGACACCGG TGGAAGGACG CATTTCCGAA CGCGAATGCG AACATGTGCG CGCCCTGTAC ATGGAAAAAA TCACCATGGT GGATAAATGG GTGGGACGCA TTCTTGATTC GCTGCGCGAA CAGGGACTGA TGGACGAGAC CATGGTTGTG GTCATGTCCG ACCACGGGCA GCCCATGGGC AACGGTGAAC ACGGGCACGG CATCATGCGC AAGTCGCGCC CGTGGCCCTA TGAAGAACTG GTGCACGTAC CTCTGCTTAT GCACATTCCC GGTGTGCAGG GCGGGCAGCG CATATCGTCT TTTGTCCAGA ATGTGGATGT GACCGCCACC ATCATGGATG TGATGAATCT GGCCGACACC GAAGACGGTG TGTCCGATCA TGGCATGAAC ACATTCGGCG CCGAGAGCAT GCAGGGCGAA AGCCTGCTGC CGCTCATGCG GGGCGAGGCC GATTCTGTGC GCGAGTACGC CATTGCAGGA TATTACGGCA TGTCGTGGTC CATCATCACC GAAGACTACA GCTATGTGCA CTGGCTTGTC AGCGAAGACG AAAAGAACCG CGCCGACTGC GTGGAAGGCG CCGACAAGGA AATGTCCGAG GAGATGTGGA CCTGCACCGC AGGAGCCAAG GTTCAGATGC CGGAACATAA CGAACTGTAT GACAGGCGTA CCGACCCCTT CCAGCTGAAC AACATTGCCG CGCAGAAGCC CGAAGTGGCC GAAGAACTGC TGCAGAAGCT CAAGCTGGTC ATTGGCGGGC TGCGCACGAC CTAA
|
Protein sequence | MTFDRSKIKN VVMIMLDTLQ FNYLGCYGNK QVKTPNLDRF ARQSVLFENA YSEGLPTIPV RRALMTGRFT LPYGGWKPLS GDDTTITDIL WGRNVQTALI YDTAPMRLAK FGYSRGFDYV DFCPGQELDH TTFADMPLDP ALKPEDFTSP SMVWDKDGNL IDDDSKQLLD EIGCFLRQMQ HRRSDADSYV AKVMNRTEYW LRERRDKSRP FFLWVDSFDP HEPWDPPSVW EGTPCPYDPD YEGNPILLAP WTPVEGRISE RECEHVRALY MEKITMVDKW VGRILDSLRE QGLMDETMVV VMSDHGQPMG NGEHGHGIMR KSRPWPYEEL VHVPLLMHIP GVQGGQRISS FVQNVDVTAT IMDVMNLADT EDGVSDHGMN TFGAESMQGE SLLPLMRGEA DSVREYAIAG YYGMSWSIIT EDYSYVHWLV SEDEKNRADC VEGADKEMSE EMWTCTAGAK VQMPEHNELY DRRTDPFQLN NIAAQKPEVA EELLQKLKLV IGGLRTT
|
| |