Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1640 |
Symbol | |
ID | 4022120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1841631 |
End bp | 1844099 |
Gene Length | 2469 bp |
Protein Length | 822 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637961835 |
Product | sulfatase |
Protein accession | YP_568778 |
Protein GI | 91976119 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.139745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.861297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAT TGGCTTTGAA GCGATTGGTT TTGGTTGTTA CGCCACTGTT GTTGCTGGCA TCGTGGATCG GCTACGCAAG CGCTCAGCAG GCGGCTCCGA TCGACCCGGC TCCGATCAAC CCGTCGATGC CCACCGATCG CTCGGTATTG CCGATTCCTG AACCGCAGTA CCCGCACAGC ACGGTGTTCG ACGTCCGCAA CGCAACGCCG CCGCCACGGT TCGAAGTCAA GGCTCCGGCG GGCGCGCCCA ACGTAGTCAT CGTTCTGATC GACGATATGG GTTTTGGCCA ATCCAGCGCA TTCGGCGGAC CCGTCAGGAT GCCGACAGTC GAAGGCCTGG CCAATCAGGG ACTCCGCTAC AATGAGTTTC ACACCACGGC GCTTTGCTCG CCGACACGCG CCGCGCTGCT CAGCGGACGC AATCACCACA TCAACAACAT GGGCTCGATT ACGGAAACGG CGACGTCGTT TCCCGGGCAA ACCGGGCAGC GCCCGAACAG CGTGGCATCG GTCGCAGAGA TTCTGCGGCT GAACGGCTAC AGCACCGCCC ACTTCGGCAA GAACCACGAA ACCGCGGCAT GGGAGGTCAG TCCGTCCGGT CCGACCGACC GCTGGCCGAC GCGTCAAGGG TTCGACAAGT TCTACGGATT CATGGGGGGC GAGACCAACC AATGGGCGCC CCTTCTGTAT GACGGCATGG CCCAGGTCGA ACTACCAAAA GACCCGAACT ATCATTTTAT GACCGACATG ACGAATCATG CGGTCGACTG GATGAAGTCG CAGAAGGCGC TGACCCCGGA CAAGCCGTTC TTCATCTACT TCGCGCCCGG CGCCACACAC GCGCCGCATC AGGTGCCGAA GGAATGGATC GCTAAATACA AAGGCAAATT CGATCAGGGC TGGGACCAGC TTCGCGAAGA GACCCTGGCG CGGCAGATCA GGCTCGGCGT GGTGCCGGCC GGCACCAAGC TCGCCCCGAA GCCCGAGCCG ATCAAGGATT GGGCCACGCT GAACGCAGAC GAGAAGAAGC TGTTCGCGCG CCAGATGGAA GTCTTCGCCG GTTTCGGCGA ATACGCCGAC ACCGAAATCG GCCGTCTGAT CGAGGCCATC AGGCAAACGG GTCAGCTCGA CAATACGCTG ATCTTCTACA TCGTCGGCGA CAACGGCGCG AGCGCTGAAG GCGGCATGAA CGGCCTGTTC AACGAGATGA CCTATTTCAA TGGCGCTCAG GAAACCGTTC AGGATGTTCT GAAGCACTAC GACGAGCTGG GCGGCCCGAA CACCTATGGC CACTATGCAG CCGGTTGGGC GATTGCAGGC GATACGCCGT TCACCTGGAC CAAGCAGGTG GCCTCCAGCT ACGGCGGTAC CCGTAACGGC ATGGTGATCC ACTGGCCCAA GGGCATCTCC GCAAAGGGTG AGGTGCGCTC GCAATGGCAT CACGTCATCG ATGTCGTGCC CACAATTCTC GAAGCGGCGA GTTTGCCGGA GCCGAGCGCC GTCAACGGCA CGCCTCAACT GCCGATCGTC GGCAACAGCA TGGTGTATAC GTTCGCCGAC CCGAAAGCGG CGAGCACGCA CAAGACCCAG TACTTTGAGA TCTTCGGCAA TCGCGCGATC TACAGCGACG GATGGCTGGC CGGAACGGTT CACCGAGCGG CATGGGAAAC CAAGCCCCGC AGGGCGCTCG AACAGGATGT CTGGGAACTC TACGATACGC GGTCGGATTT CAGCCTCGTC AACGATCTGG CGGCGACGAA TCCCGACAAG CTGAAGGAGT TGCAGGATCT GTTCATGAAG GAGGCGGAGA AGAACTCCGT CCTGCCACTC GACGATCGAA CCCTGGAGCG CACCAACGCA GCTCTGGTCG GACGCCCGGA TCTGATGGCC GGTCGAACCA CGCTGACGGT TTACGAGGGA ATGATCGGGA TGTCTGAAAA CGTCTTCATC AATCTCAAGA ACCGGTCTCA CACGGTCACC GCCGAGGTGG ACGTTCCGAA GGCCAACGCC AACGGCGTCC TGATGGCCCA GGCCGGGCGA TTTGGCGGCT GGAGCCTGTA TGTGAAGAAC GGCAAACCGG TTTACACCTA CAACTGGCTC GGCCTGAAAC GGTTCAGTAT CGCCGGCAAA CAGCCGATAC CGGCCGGCAA AGCAACGATC CGTTTCGAGT TTGTCTACGA CGGCGGGGGG CTCGGAAAGG GCGGCCTTGG CACCCTTCTG GTCAACGGAA AACCCGCTGC TTCAGGTCGC ATCGATCAGA CCCAATGCTG CTTCTACTCG GCCGACGAAG GCGCCGATGT CGGCGCCGAC GAAGGAACGC CCGTGACCGA AGACTACAAG TCGCCGTTCA AATTCACCGG AAAGATCTCG TCAGTGACGA TCGAGCAGAA AGAGATGAAG AAGACCGAAA GCGAGGACGC CGTTCAGGCT CGCAAGGCGG CGCTGTTGAA GAAGGGACTG TCGGATTGA
|
Protein sequence | MKRLALKRLV LVVTPLLLLA SWIGYASAQQ AAPIDPAPIN PSMPTDRSVL PIPEPQYPHS TVFDVRNATP PPRFEVKAPA GAPNVVIVLI DDMGFGQSSA FGGPVRMPTV EGLANQGLRY NEFHTTALCS PTRAALLSGR NHHINNMGSI TETATSFPGQ TGQRPNSVAS VAEILRLNGY STAHFGKNHE TAAWEVSPSG PTDRWPTRQG FDKFYGFMGG ETNQWAPLLY DGMAQVELPK DPNYHFMTDM TNHAVDWMKS QKALTPDKPF FIYFAPGATH APHQVPKEWI AKYKGKFDQG WDQLREETLA RQIRLGVVPA GTKLAPKPEP IKDWATLNAD EKKLFARQME VFAGFGEYAD TEIGRLIEAI RQTGQLDNTL IFYIVGDNGA SAEGGMNGLF NEMTYFNGAQ ETVQDVLKHY DELGGPNTYG HYAAGWAIAG DTPFTWTKQV ASSYGGTRNG MVIHWPKGIS AKGEVRSQWH HVIDVVPTIL EAASLPEPSA VNGTPQLPIV GNSMVYTFAD PKAASTHKTQ YFEIFGNRAI YSDGWLAGTV HRAAWETKPR RALEQDVWEL YDTRSDFSLV NDLAATNPDK LKELQDLFMK EAEKNSVLPL DDRTLERTNA ALVGRPDLMA GRTTLTVYEG MIGMSENVFI NLKNRSHTVT AEVDVPKANA NGVLMAQAGR FGGWSLYVKN GKPVYTYNWL GLKRFSIAGK QPIPAGKATI RFEFVYDGGG LGKGGLGTLL VNGKPAASGR IDQTQCCFYS ADEGADVGAD EGTPVTEDYK SPFKFTGKIS SVTIEQKEMK KTESEDAVQA RKAALLKKGL SD
|
| |