Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2738 |
Symbol | |
ID | 3970291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 2973052 |
End bp | 2974335 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637925848 |
Product | sulfatase |
Protein accession | YP_532605 |
Protein GI | 90424235 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.15562 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCTC AGCTGGCCAG CCTTCTGCAG CAGCACGCCT CGCGGCCCAT CGAAGCAGAG AACCAGATGT CGCTCAGATC GATCATCACC CTCGGTGTTG CCGTCGCGCT CGCCACGCTG ATGGCTGTCC CGAACGTTGC GGCGCAGGCG CGCCCCGATC CCCGTCCGGC GCGTGTCATC ATCTTCCACA TGGACAGCAT GCTCGCCGAC GCGCCGGAGC GCCTCAGCCT GAGCAACTGG CTCGCGGTCG CGGCCGAGGG CACCCGCGCG AGCGAGATGA CCACCGTGAT TCCGTACCAT CCGACGGATT CCGGCTACTT CGTGCTCAGC ACGACCTCGT TTCCCAACCC GACGACGGCC GCCGGCACGC TCTTCCTCGA GCCAGCCATC GAACAGACCT ATCTCCAGCA TCGCTTCAAA GGCCACACCG CCTTGATCGC CGGCTCGACG GCCTACCGGT CGATCGGCGA AGGATTCACT TACACCAATC TGTCTCAGGC ACTCACCGAC GAGCGGGTTG TCGAGGAGGG GCTGCGGCAA TTGCGCGAGC ACCCCGACCT GAGCTTCATG CGCCTGATAC TGCAGGACGC CAACGCCGTG TTGCAGCGTG TCGGCTTCAC CCGGGAGAAT GTGCCCTGGC GCGGCGACGC CTACGGGGAG GGCTCGCCCT ACTTTGCATC GCTGCGGCGG GCAGACGCGC TGCTCGGCCG CTTCGTCGAC GAGTTGAAGC GGATGGACAA GTACGAGGAC ACGCTGCTGG TGCTGATGCC CGATGGCGCC GCCCGCGGCG GCTGGCACGG CCCGCAGCAG GAAGAGAGCT GGCGCTTGCC CTTCGCACTA CGCGGTCCGG GAATCGCGAA ACGGCGTGTG ATCGGCTATG CCGAGAATAT CGACGTGGCG CCGACCATCG CCGCCATGAT GGGTGTCGAG CCGCCGAACG CCGACGGAGC CTCGGGACGC GTCCTGACCG AGGTCATGGC GGGGCAACCC GCGACGGCGG CCGGCGGCGC GCGCCGCATC GAGCGCCTCA ACCGCCAACA CAAAGAATAC TTGCGTCTCA CCGGCTGGAT GCAGGTTCAT GCCGGCCGCT ACCCGCTGCT CGACCTGGCG TGGATGGCGT CGCACAACCG GCTGGTACAA CCGACGCGGT TCTGGGACCT TTCCAGCATC GACGAGTGGC GCCGCGCCGG GAGCTTCGAT CGAATGCTCG CGGACAACGA GGCTGCGCTC GTCGCTCTTC GCGATGCTCT CGCGCGATCG GGCGCGCCGG CTCTGCCGGA TTGA
|
Protein sequence | MSAQLASLLQ QHASRPIEAE NQMSLRSIIT LGVAVALATL MAVPNVAAQA RPDPRPARVI IFHMDSMLAD APERLSLSNW LAVAAEGTRA SEMTTVIPYH PTDSGYFVLS TTSFPNPTTA AGTLFLEPAI EQTYLQHRFK GHTALIAGST AYRSIGEGFT YTNLSQALTD ERVVEEGLRQ LREHPDLSFM RLILQDANAV LQRVGFTREN VPWRGDAYGE GSPYFASLRR ADALLGRFVD ELKRMDKYED TLLVLMPDGA ARGGWHGPQQ EESWRLPFAL RGPGIAKRRV IGYAENIDVA PTIAAMMGVE PPNADGASGR VLTEVMAGQP ATAAGGARRI ERLNRQHKEY LRLTGWMQVH AGRYPLLDLA WMASHNRLVQ PTRFWDLSSI DEWRRAGSFD RMLADNEAAL VALRDALARS GAPALPD
|
| |