Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2202 |
Symbol | |
ID | 6409862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2386502 |
End bp | 2388292 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642712086 |
Product | sulfatase |
Protein accession | YP_001991198 |
Protein GI | 192290593 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACC AAACCAACAA CACCGCACTG GGGTCGCGCC GCGACTTTCT CGGCCTCGCG ATGGGCGCCG TCGCCGCCGG CACGTCGTCC ACGGTGCTGG GGCCGACGAC GGCCGCCGCG CAGGCGCAGC CGGGCGGCGG GAGCCTGCCG CGGAAGCGAT CGTCGCGGCG GCCGAACATC GTCTTCATCT TCAGCGATCA GGAGCGATTT GCATCGACGT GGCCGAAGGG CCTGTCGCTG CCCGCTCACG AACGCCTGAT GCGGACCGGC ACCACGTTCC TCAATCACTA TTGTCCCGCG GTCATGTGTA CGTCGTCGCG CGCGGTTCTG CTGACCGGTT TGCAGACCGC CGACAACCGC ATGTTCGAAA ACTGCGATGT GCCGTGGGTC GGCAACCTCT CGACCAAGAT TCCCACCGTC GGCCACATGC TTCGCAAGGC CGGCTACTAC ACAGCCTACA AGGGCAAATG GCACCTCAAT CGGAAGTTCG ATACCCAGGA AACCGATCGG CTGTTCACCA AGGAGATGGA CGACTACGGC TTCTCCGACT ATTTCTCGCC AGGGGACATC ATCGGCCACA CGCTCGGCGG CTATCAGTTC GATCCGCTGA TCGCCTCGAG CGCGATCACG TGGCTGCGGC GTAACGGACG GCCGCTGACC GACGACGACA AGCCGTGGGC GCTGTTCGTC AGCTTGGTCA ATCCCCACGA CATCATGTAC TTCAACACCG ACCGTCCCGG CGAGAAGGTG CAGGATACGG GGACGCTGAT CAAGCATGCG GCCCGTGCGC CCGAGCATGA AATGTTCAAG GCGACCTGGG ACGTCTCGGT GCCGAAAAGC TACAAGGAGC CGTTCGACGC GCCGGGCCGT CCGAAGGCCC ACGGCGAATT CCTGCAGATA TGGGACTACG TCCTCGGCCA TATTCCGCCG GAAGAAGAAC GGTGGCGGCG ATTCCACGAC TACTACGTCA ACTGCACGCG ATCGGTCGAC GGGCAGGTCG ACCGGATCCT GCAGGAGCTC GACGCGCTCG GTCTGACCGA CAATACGGTG ATCTGCTTCA CCTCCGACCA CGGCGAGGCG GCGGGCGCCC ACGGCCTCCA TGGCAAGGGG CCGTTCGCCT ATGAGGAGAC GGTCCACCTG CCGTTCTTCA TGGTCCATCC CGACGTTCGC GGCGGTCAGG ACTGCCGCGC GCTGACGGGA CACATCGACG TCGTGCCGAC GCTGCTGTCG ATCGCCGGCG TTTCTCCTGA AAAGATCGCC GGCATCGCGG GGCGGCAGCT GCCCGGGAAG GATTTTTCGT CGGTGCTGAC GAATCCCTCC AGCGCGGACA TCCATGCGGT GCGCGATGCG ATCCTCTTCA CCTACAGCGG CCTCGGTGCA AACGACGCGA CGCTGTGGAA GACGGTCGCC GAGGCCCGTG CGGCCGGCAA GAATTCGGCC ATGGCCATTC TCAAGCAGGG CTTCAAGCCC GACATGCAGA AGCGCGGCAG CCTGCGGTCG ACCTACGACG GACGCTACAA GTTCACGCGC TATTTCGCCC CGGCCGAGCG CAATCGACCG ACCAATCTCA CCGATCTCTA CAAACACAAC GACGTCGAGT TGTTCGATCT GCAGAACGAT CCGGAGGAAA TGAACAATCT GGCGATCGAC AAGGACGCCA ACGCGTCGCT GATCTCCACG ATGAACGACA AGCTGGAACG CGTGATCAAG GCCGAGATCG GCGTCGACGA TGGACGGGAG ATGCCCAACA TCCCGCTGAT CGAGTGGAAT ATCGATCGTC CGGATCTGTA G
|
Protein sequence | MTDQTNNTAL GSRRDFLGLA MGAVAAGTSS TVLGPTTAAA QAQPGGGSLP RKRSSRRPNI VFIFSDQERF ASTWPKGLSL PAHERLMRTG TTFLNHYCPA VMCTSSRAVL LTGLQTADNR MFENCDVPWV GNLSTKIPTV GHMLRKAGYY TAYKGKWHLN RKFDTQETDR LFTKEMDDYG FSDYFSPGDI IGHTLGGYQF DPLIASSAIT WLRRNGRPLT DDDKPWALFV SLVNPHDIMY FNTDRPGEKV QDTGTLIKHA ARAPEHEMFK ATWDVSVPKS YKEPFDAPGR PKAHGEFLQI WDYVLGHIPP EEERWRRFHD YYVNCTRSVD GQVDRILQEL DALGLTDNTV ICFTSDHGEA AGAHGLHGKG PFAYEETVHL PFFMVHPDVR GGQDCRALTG HIDVVPTLLS IAGVSPEKIA GIAGRQLPGK DFSSVLTNPS SADIHAVRDA ILFTYSGLGA NDATLWKTVA EARAAGKNSA MAILKQGFKP DMQKRGSLRS TYDGRYKFTR YFAPAERNRP TNLTDLYKHN DVELFDLQND PEEMNNLAID KDANASLIST MNDKLERVIK AEIGVDDGRE MPNIPLIEWN IDRPDL
|
| |