Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1574 |
Symbol | |
ID | 6409231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 1683348 |
End bp | 1685012 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642711466 |
Product | sulfatase |
Protein accession | YP_001990581 |
Protein GI | 192289976 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGCGCAACGT GCTGTTCATC ATGTGCGACC AGCTGCGATA CGACTACCTA GGCATATCTG GTCATCCGCG ATTGAAGACC CCGAACATCG ATGCACTGGC GCGACGCGGC GTCCGCTTCT TCAATGCCTA TGTGCAGTCG ACGATCTGCG GCCCGTCCCG GATGAGCACC TATACCGGCC GTTATGTGCG GTCCCATGGT TCGACCTGGA ACGGCATCCC GCTGCGGGTC GGTGAACCGA CCCTGGGTGA TCATCTCAAA GAGATCGGCG TCCGTAACGT CCTGGTCGGC AAGACCCACA TGGTGCCGGA TCGCGAGGGC ATGGCGCGGC TCGGCATCGT GCCGGATTCG CTGATCGGCG TTCACGTCTC GCAATGCGGC TTCGAGCCGT ACGAGCGCGA CGACGGTCTG CATCCGGACG GTCCATACGA TCCGGCGCCG GACTACGACG CCTATCTGCG CAGCCAGGGC TTCGACGCCG GCAACCCGTG GGAGGCGTGG GCGAATTCCG CCGAGGGCGG CGACGGCGAA CTGCTCAGCG GCTGGCTGCT CTCGCATGCC GACAAGCCGG CGCGCGTCCC CGATGAACAT TCGGAAACGC CCTATATCAC CCGCCGGGCG ATCGAGTTCA TCGGCGAGGC GGAAGCCGAT GGGCGGCCAT GGTGTTTGCA CCTGTCATAC ATCAAGCCGC ACTGGCCCTA TATCGTGCCG GCGCCGTATC ACGATCGCTA CGGTGCAGAC GACGTCCTGC CGGTGGTGCG GTCCGATCGT GAGCGGCAGC ACCCGCATCC GATCTTCGCC GAGTTTCAGC ACGAGCGCGT GTCCCAGGCG TTCTCGCGGC CGGGCGTACG TGAACGGGTA ATCCCGGCCT ATATGGGGTT GATCGAACAG ATCGACGACC AACTCGGGCT GCTGTTCGCC TATCTCGACG AACGCGGACT GACCGACGAC ACCCTGATCG TGTTCACCTC CGATCACGGC GATTATCTCG GCGACCACTG GCTCGGCGAG AAGCAGATGT TTCACGACGT CTCGGTGAAG GTACCGTTGA TCGTGGTCGA TCCGTCGCCT GCAGCCGACG CTACGCGCGG CACGGTTTCG GAGGCGCTGG TCGAGCAGAT CGATCTGGCG CCGACCTTCC TCGATTACTT CGGCGGCCGG CCCAAGCCGC ACATTCTCGA GGGACGGTCG CTGCTGCCGC TGCTGCGCTG CGAGCGCGTC GAAAACTGGC GATCCTACGT CTTCTCGGAA TACGACTACG CACTGGATCG CGCTCGCATC TCGCTCGGAA CGCCGGTGCC TGATTGTCGG CTGACGATGG TGGCCGATGG TCGCTGGAAG GCGGTGTTCG TCGAGGGATT CCGCCCGATG CTGTTCGACG TCGACAATGA TCCGCACGAA TTCGACGATC TTGGTGACAG CGAAGATCAT GCCGAGGTCC GGCAGCGCCT GTCCGATGCG TTCTTCGCCT GGGCACGGCG GCCGCGCAGC CGAATCACTC GCTCGGACGA TGCAATTGCC GCGAAGGATG AGGCGCAGCG TGCCTACGAT CGCAATATCG AATCCGGCGT CCTGATCGGC TATTGGGACG AGACGGAGCT CGCCGAGGAA CGCGCCAAGC GCGCCCGATA TCTGGCATTG CGCCGACCAG ACTGA
|
Protein sequence | MTTKRNVLFI MCDQLRYDYL GISGHPRLKT PNIDALARRG VRFFNAYVQS TICGPSRMST YTGRYVRSHG STWNGIPLRV GEPTLGDHLK EIGVRNVLVG KTHMVPDREG MARLGIVPDS LIGVHVSQCG FEPYERDDGL HPDGPYDPAP DYDAYLRSQG FDAGNPWEAW ANSAEGGDGE LLSGWLLSHA DKPARVPDEH SETPYITRRA IEFIGEAEAD GRPWCLHLSY IKPHWPYIVP APYHDRYGAD DVLPVVRSDR ERQHPHPIFA EFQHERVSQA FSRPGVRERV IPAYMGLIEQ IDDQLGLLFA YLDERGLTDD TLIVFTSDHG DYLGDHWLGE KQMFHDVSVK VPLIVVDPSP AADATRGTVS EALVEQIDLA PTFLDYFGGR PKPHILEGRS LLPLLRCERV ENWRSYVFSE YDYALDRARI SLGTPVPDCR LTMVADGRWK AVFVEGFRPM LFDVDNDPHE FDDLGDSEDH AEVRQRLSDA FFAWARRPRS RITRSDDAIA AKDEAQRAYD RNIESGVLIG YWDETELAEE RAKRARYLAL RRPD
|
| |