Gene Rpal_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2202 
Symbol 
ID6409862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2386502 
End bp2388292 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content63% 
IMG OID642712086 
Productsulfatase 
Protein accessionYP_001991198 
Protein GI192290593 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGACC AAACCAACAA CACCGCACTG GGGTCGCGCC GCGACTTTCT CGGCCTCGCG 
ATGGGCGCCG TCGCCGCCGG CACGTCGTCC ACGGTGCTGG GGCCGACGAC GGCCGCCGCG
CAGGCGCAGC CGGGCGGCGG GAGCCTGCCG CGGAAGCGAT CGTCGCGGCG GCCGAACATC
GTCTTCATCT TCAGCGATCA GGAGCGATTT GCATCGACGT GGCCGAAGGG CCTGTCGCTG
CCCGCTCACG AACGCCTGAT GCGGACCGGC ACCACGTTCC TCAATCACTA TTGTCCCGCG
GTCATGTGTA CGTCGTCGCG CGCGGTTCTG CTGACCGGTT TGCAGACCGC CGACAACCGC
ATGTTCGAAA ACTGCGATGT GCCGTGGGTC GGCAACCTCT CGACCAAGAT TCCCACCGTC
GGCCACATGC TTCGCAAGGC CGGCTACTAC ACAGCCTACA AGGGCAAATG GCACCTCAAT
CGGAAGTTCG ATACCCAGGA AACCGATCGG CTGTTCACCA AGGAGATGGA CGACTACGGC
TTCTCCGACT ATTTCTCGCC AGGGGACATC ATCGGCCACA CGCTCGGCGG CTATCAGTTC
GATCCGCTGA TCGCCTCGAG CGCGATCACG TGGCTGCGGC GTAACGGACG GCCGCTGACC
GACGACGACA AGCCGTGGGC GCTGTTCGTC AGCTTGGTCA ATCCCCACGA CATCATGTAC
TTCAACACCG ACCGTCCCGG CGAGAAGGTG CAGGATACGG GGACGCTGAT CAAGCATGCG
GCCCGTGCGC CCGAGCATGA AATGTTCAAG GCGACCTGGG ACGTCTCGGT GCCGAAAAGC
TACAAGGAGC CGTTCGACGC GCCGGGCCGT CCGAAGGCCC ACGGCGAATT CCTGCAGATA
TGGGACTACG TCCTCGGCCA TATTCCGCCG GAAGAAGAAC GGTGGCGGCG ATTCCACGAC
TACTACGTCA ACTGCACGCG ATCGGTCGAC GGGCAGGTCG ACCGGATCCT GCAGGAGCTC
GACGCGCTCG GTCTGACCGA CAATACGGTG ATCTGCTTCA CCTCCGACCA CGGCGAGGCG
GCGGGCGCCC ACGGCCTCCA TGGCAAGGGG CCGTTCGCCT ATGAGGAGAC GGTCCACCTG
CCGTTCTTCA TGGTCCATCC CGACGTTCGC GGCGGTCAGG ACTGCCGCGC GCTGACGGGA
CACATCGACG TCGTGCCGAC GCTGCTGTCG ATCGCCGGCG TTTCTCCTGA AAAGATCGCC
GGCATCGCGG GGCGGCAGCT GCCCGGGAAG GATTTTTCGT CGGTGCTGAC GAATCCCTCC
AGCGCGGACA TCCATGCGGT GCGCGATGCG ATCCTCTTCA CCTACAGCGG CCTCGGTGCA
AACGACGCGA CGCTGTGGAA GACGGTCGCC GAGGCCCGTG CGGCCGGCAA GAATTCGGCC
ATGGCCATTC TCAAGCAGGG CTTCAAGCCC GACATGCAGA AGCGCGGCAG CCTGCGGTCG
ACCTACGACG GACGCTACAA GTTCACGCGC TATTTCGCCC CGGCCGAGCG CAATCGACCG
ACCAATCTCA CCGATCTCTA CAAACACAAC GACGTCGAGT TGTTCGATCT GCAGAACGAT
CCGGAGGAAA TGAACAATCT GGCGATCGAC AAGGACGCCA ACGCGTCGCT GATCTCCACG
ATGAACGACA AGCTGGAACG CGTGATCAAG GCCGAGATCG GCGTCGACGA TGGACGGGAG
ATGCCCAACA TCCCGCTGAT CGAGTGGAAT ATCGATCGTC CGGATCTGTA G
 
Protein sequence
MTDQTNNTAL GSRRDFLGLA MGAVAAGTSS TVLGPTTAAA QAQPGGGSLP RKRSSRRPNI 
VFIFSDQERF ASTWPKGLSL PAHERLMRTG TTFLNHYCPA VMCTSSRAVL LTGLQTADNR
MFENCDVPWV GNLSTKIPTV GHMLRKAGYY TAYKGKWHLN RKFDTQETDR LFTKEMDDYG
FSDYFSPGDI IGHTLGGYQF DPLIASSAIT WLRRNGRPLT DDDKPWALFV SLVNPHDIMY
FNTDRPGEKV QDTGTLIKHA ARAPEHEMFK ATWDVSVPKS YKEPFDAPGR PKAHGEFLQI
WDYVLGHIPP EEERWRRFHD YYVNCTRSVD GQVDRILQEL DALGLTDNTV ICFTSDHGEA
AGAHGLHGKG PFAYEETVHL PFFMVHPDVR GGQDCRALTG HIDVVPTLLS IAGVSPEKIA
GIAGRQLPGK DFSSVLTNPS SADIHAVRDA ILFTYSGLGA NDATLWKTVA EARAAGKNSA
MAILKQGFKP DMQKRGSLRS TYDGRYKFTR YFAPAERNRP TNLTDLYKHN DVELFDLQND
PEEMNNLAID KDANASLIST MNDKLERVIK AEIGVDDGRE MPNIPLIEWN IDRPDL