Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1701 |
Symbol | |
ID | 6027278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | - |
Start bp | 1792371 |
End bp | 1793822 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641594521 |
Product | sulfatase |
Protein accession | YP_001717832 |
Protein GI | 169831850 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCCAAGA AAAAGCTGCC GAACATCATT CTCATCGTCC TGGACACCGC CAGGGCCAAA AGCTTCTCCT GTTACGGCTA CCACCGCAAA ACCACGCCCA ACATCGACCG GATTGCGGAG GAAGGCGTGC TTTATAAATG GTGCTTTTCT CCGGCCAACT GGACCATACC TTCGCACGCC TCGCTGTTTA CCGGGCTATA CCCTTCGGAG CACGGGTGCC ATTGGGGCAA CCCGTTTTTG GACGAAAACA TTCCTACATT ACCCGAGTTG CTGCGCTCCG TTGGCTACCG TACCGTAGGT ATCTCATGTA ATGGTTTGGT TTCCAAGCTT TATGGCTTTC ACAGGGGTTT TGATCTTTTT TTCGAGCTTT GGACGCCGGA TTTATTTCCG GATCTTGAGC CGTTGCCTGG GAAGACAGCA AAGGATAAAA TCGCAAGTCT CTTTAAGTTG ATTCCAATCC ACCCCAACCG GGCAGTCAAG TATGCTGCCA GAGCGATCTG TCGCAAATTT AGATTTAGAG GCACGGTGTT GACAAACAGC ACGCCTTGGA CCTTGCGGGC TTTTAAATCA GCCCGGGAGA TTCTTAGGGG AATAGCCTCA GATACTCCCT TATTCCTGTT TGTCAATATC ATGCAGTCCC ACTATCGTTA TAATCCTCCA CGGGAAACAA AGGGGAAGTT TGGTTCGAAT GGTTTCCGAT ATGAAAGTTT TCTAACGGAG CCCTATCAAT ATTATCTGAA TTTACTTAAG CCAACCGGCT CGGTGGAAAA AATGTGGTAC ATTCTAACCG CCCTTTACGA TGAGGAACTG TTCTTTGCCG ATTTGTGTAT AGGGCAATTT TACGAGTTTC TTAAGATCTC GCATTTGCTC GACAGATCGG TCTTCGTTGT CACGGCAGAT CATGGCGAAA TGCTCGGAGA ACACGGCCTG CTAGACCATT GGTTTAGTTC TTACAACGAG CTTATCCAGG TCCCGCTGAT AATCCGGTAT CCTGGGGCTA TGCAGAGGAG CGTCATTTCG GATCTGGTGC AGACTCATGA TCTATTCGGT ACCGTTTGTG ATATTGCTGG GCTACCTTAC CCTACACCGA TGGGATTAGT TTCCCTTGTA GGCACTCAGA GGCGTCGGTG GGCCTTCACC CAGGACATAG ATCCGCTGGT GGACGTACTG GCTTTGAGAA GACGCCAGCC CGAGTGGTCG AGCGATGGAT GGTGGTGCCG CCCCCACATG ACGGCTGTTA ATTCCGAGTT ACGAAAGATC GTTAAGACCA CCGACGGTCG GATTTTGTGC TTTGATCTTA GTCAGGACGC TAACGAATTA CACCCGAAAG TGCCCCCTCT TGTGGATGAG GATCTTCTAA AATTGATTAC AGAGTTGGAG GATACCTATA ACTGGAGTCG AGCCGTCAAA TCGTGCGAAG AAATGCTGCT GAATAGTAAA TTGGGAGGGT AA
|
Protein sequence | MPKKKLPNII LIVLDTARAK SFSCYGYHRK TTPNIDRIAE EGVLYKWCFS PANWTIPSHA SLFTGLYPSE HGCHWGNPFL DENIPTLPEL LRSVGYRTVG ISCNGLVSKL YGFHRGFDLF FELWTPDLFP DLEPLPGKTA KDKIASLFKL IPIHPNRAVK YAARAICRKF RFRGTVLTNS TPWTLRAFKS AREILRGIAS DTPLFLFVNI MQSHYRYNPP RETKGKFGSN GFRYESFLTE PYQYYLNLLK PTGSVEKMWY ILTALYDEEL FFADLCIGQF YEFLKISHLL DRSVFVVTAD HGEMLGEHGL LDHWFSSYNE LIQVPLIIRY PGAMQRSVIS DLVQTHDLFG TVCDIAGLPY PTPMGLVSLV GTQRRRWAFT QDIDPLVDVL ALRRRQPEWS SDGWWCRPHM TAVNSELRKI VKTTDGRILC FDLSQDANEL HPKVPPLVDE DLLKLITELE DTYNWSRAVK SCEEMLLNSK LGG
|
| |