Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2522 |
Symbol | |
ID | 4023013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2821921 |
End bp | 2824290 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637962715 |
Product | sulfatase |
Protein accession | YP_569653 |
Protein GI | 91976994 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCT TGCGAAAGAC GCTAGCAGGC GCCGTCAGCC TATCGCTGGC GGTATTGTTC TCGCCGATCG TCAATGCCCA GGAAATCCTG CCGTTTCCTC CGGCGCCATC GGGCTCCGTT GCCGGCCTGA CGATCCAGGA CTCGGTTTAC AGCAAACGCG TCGAACCAAA ACGGCTCGCC GATGGTGCAC CGAACATTCT GATCATCCTG ATGGACGATG TCGGCCCGGC GACGCCGTCG ACCTATGGCG GCGAAATCAA TACGCCAACG CTTGATCGCG TCGCCAGCAT GGGTGTCTCC TATAATCGCT TCCACTCCAC GGCGATGTGC TCGCCGACGC GTGCGTCTCT GCTCACCGGA CGCAACCACA CCTTCGTCGG AAATGGCCAG ATCGCCGCGA TTGCCAATGA TTTTGACGGG TTCAGCGGGA CTATCCCAAA ATCATCCGCG ACGGTTGCAG AGGTGCTGAA GAACTACGGC TACAATACCG GAGCTTGGGG CAAGTGGCAC AACACTCCCG AGGAGCAGAT CACTTCCAAA GGCCCCTTTG AATATTGGCC GACCGGTTAT GGCTTCGAAT ATTTCTACGG TTTCCTCGCC GGCGAAGCCT CACAGTATGA ACCGACCCTG GTGCGCAACA CTACTCAGGT CACCGAAGAG CCGCGAAAAG GCTACCACCT GACTGACGAT ATCGCGGCTG ACGCAATCAA ATGGCTGCGC GAGCAGAAGG CTTACGCACC GGACAAGCCG TTCTTCATGT ACTGGGCGCC AGGCGCTTCG CACGGCCCGC ATCAGATCAT GAAGGAGTGG GCCGATAAGT ATAAAGGCAA GTTCGACGAC GGCTGGGACG CTTACCGCGA GCGCGTGTTC AAGCGCGCCA GGGAGAAGGG CTGGATCCCG CAAAATTCTC AATTGACGCC GCGCCCTGAA TCGATGGCCT CCTGGGCTTC GATCCCCGAG GATGAAAAGC CATTCCAGAG CCGCCTCATG GAAGTCTTCG CCGGTTTTAC CGAGCACGCT GACTACAACG CAGGCCGTGT GATCGATGAG ATCAAGCGGC AGGGCAGGCT CGACAACACA TTGATCTTCT ACATCTGGGG CGACAACGGA TCCTCGGCAG AGGGGCTTAA CGGCACCATC AGCGAGCAAC TGGCACAGAA CGGCATTCCT ACCAAGATTT CGCAACACCT CGAGGCGTTG AAGGAGCTTG GCGGGCTTGA AGCACTGGGT GGTCCAAAGA CCGACAATAT GTACCACGCG GCGTGGGCCT GGGCCGGAAG CACGCCCTAT AAGTCGACCA AACTCGTCGG AGCACATTTC GGAGGCACAC GCCAACCGAT GGCGGTTGCA TGGCCGAAGG GCATCAAGCC GGATCCGACC CCTCGACCCC AATTCCATCA TGTGATCGAC ATCGTCCCAA CTATCTACGA TCAGCTCAAG ATCACTCCGC CCCGCGTGGT CAATGGATTC GAGCAAGATC CGATCCACGG CGTGAGCATG AGCTATACGC TCGCCGACGC CACGGCGCCG GGGCGACGGA AGACGCAGTT CTTTGACATC ATGGCGAGCC GCGGAATCTA TCATGACGGC TGGTTCGCAA GCGCCCCCGG ACCGCGCGAG CCTTGGGTAG GCGGGATACC CAAGGGCATT CGGGAATGGT CGCCTCTGAC CGACAAATGG GAGCTTTACA ATCTCGACAA AGACTGGAGC CAGGCCAACG ATCTTGCTGC CGCCGAACCG CAAAAACTGA CGGAGATGAA GTCGTTGTTT CTGATCGAAT CCACCAAGAA CAAGAACCTG CCGATTGGCG GCGGTCTGTG GTCCACGGCG CTGTACCATC CGGAAGATGC TCCGGCCTCA AATCTCACCG AATGGACGTT CGATGGTCCG ATGATGCGGA TGCCGGAATC CGCCGCGCCC AAACTCGGCA AGGTGGACAG CCTTGTCAGC ATGGAGGTGG ACCTGCCAGC GAACCCGAAC GGCGTGCTCT ATGCGCTGGC CGGATTCTCC GGCGGCGTCA CATGCTACGT CAAAGACGGC ATTCTCAGCT ACGAGTTCAA CCTGTTCGAG ATCACGCGCA CCAAGATCAG GGCGAAGGAA AAGCTGGCCG CCGGCAAGGC GAAGATCGAG GTCGAGTCGA AACTCGTCGA CAAAATCGGC GGGGCGATGA ACGTCACGCT GAAGGTCAAC GGGAAGGCGG TAGCGCAGGG CCAAGTGCCA GCGGCGATGT CGCTTCACTT CACGTCGAAT GCCACCTTCG ACATCGGTAG CGATCTCGAT TCGCCGGTGT CGCTCGACTA TTTCGACAAG GCACCGTTTG CCTTCAACGG CACGATCGGA ACGACGAAGG TCACCTATTT GAAGAAGTAG
|
Protein sequence | MEFLRKTLAG AVSLSLAVLF SPIVNAQEIL PFPPAPSGSV AGLTIQDSVY SKRVEPKRLA DGAPNILIIL MDDVGPATPS TYGGEINTPT LDRVASMGVS YNRFHSTAMC SPTRASLLTG RNHTFVGNGQ IAAIANDFDG FSGTIPKSSA TVAEVLKNYG YNTGAWGKWH NTPEEQITSK GPFEYWPTGY GFEYFYGFLA GEASQYEPTL VRNTTQVTEE PRKGYHLTDD IAADAIKWLR EQKAYAPDKP FFMYWAPGAS HGPHQIMKEW ADKYKGKFDD GWDAYRERVF KRAREKGWIP QNSQLTPRPE SMASWASIPE DEKPFQSRLM EVFAGFTEHA DYNAGRVIDE IKRQGRLDNT LIFYIWGDNG SSAEGLNGTI SEQLAQNGIP TKISQHLEAL KELGGLEALG GPKTDNMYHA AWAWAGSTPY KSTKLVGAHF GGTRQPMAVA WPKGIKPDPT PRPQFHHVID IVPTIYDQLK ITPPRVVNGF EQDPIHGVSM SYTLADATAP GRRKTQFFDI MASRGIYHDG WFASAPGPRE PWVGGIPKGI REWSPLTDKW ELYNLDKDWS QANDLAAAEP QKLTEMKSLF LIESTKNKNL PIGGGLWSTA LYHPEDAPAS NLTEWTFDGP MMRMPESAAP KLGKVDSLVS MEVDLPANPN GVLYALAGFS GGVTCYVKDG ILSYEFNLFE ITRTKIRAKE KLAAGKAKIE VESKLVDKIG GAMNVTLKVN GKAVAQGQVP AAMSLHFTSN ATFDIGSDLD SPVSLDYFDK APFAFNGTIG TTKVTYLKK
|
| |