Gene Rpal_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1574 
Symbol 
ID6409231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1683348 
End bp1685012 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content64% 
IMG OID642711466 
Productsulfatase 
Protein accessionYP_001990581 
Protein GI192289976 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA AGCGCAACGT GCTGTTCATC ATGTGCGACC AGCTGCGATA CGACTACCTA 
GGCATATCTG GTCATCCGCG ATTGAAGACC CCGAACATCG ATGCACTGGC GCGACGCGGC
GTCCGCTTCT TCAATGCCTA TGTGCAGTCG ACGATCTGCG GCCCGTCCCG GATGAGCACC
TATACCGGCC GTTATGTGCG GTCCCATGGT TCGACCTGGA ACGGCATCCC GCTGCGGGTC
GGTGAACCGA CCCTGGGTGA TCATCTCAAA GAGATCGGCG TCCGTAACGT CCTGGTCGGC
AAGACCCACA TGGTGCCGGA TCGCGAGGGC ATGGCGCGGC TCGGCATCGT GCCGGATTCG
CTGATCGGCG TTCACGTCTC GCAATGCGGC TTCGAGCCGT ACGAGCGCGA CGACGGTCTG
CATCCGGACG GTCCATACGA TCCGGCGCCG GACTACGACG CCTATCTGCG CAGCCAGGGC
TTCGACGCCG GCAACCCGTG GGAGGCGTGG GCGAATTCCG CCGAGGGCGG CGACGGCGAA
CTGCTCAGCG GCTGGCTGCT CTCGCATGCC GACAAGCCGG CGCGCGTCCC CGATGAACAT
TCGGAAACGC CCTATATCAC CCGCCGGGCG ATCGAGTTCA TCGGCGAGGC GGAAGCCGAT
GGGCGGCCAT GGTGTTTGCA CCTGTCATAC ATCAAGCCGC ACTGGCCCTA TATCGTGCCG
GCGCCGTATC ACGATCGCTA CGGTGCAGAC GACGTCCTGC CGGTGGTGCG GTCCGATCGT
GAGCGGCAGC ACCCGCATCC GATCTTCGCC GAGTTTCAGC ACGAGCGCGT GTCCCAGGCG
TTCTCGCGGC CGGGCGTACG TGAACGGGTA ATCCCGGCCT ATATGGGGTT GATCGAACAG
ATCGACGACC AACTCGGGCT GCTGTTCGCC TATCTCGACG AACGCGGACT GACCGACGAC
ACCCTGATCG TGTTCACCTC CGATCACGGC GATTATCTCG GCGACCACTG GCTCGGCGAG
AAGCAGATGT TTCACGACGT CTCGGTGAAG GTACCGTTGA TCGTGGTCGA TCCGTCGCCT
GCAGCCGACG CTACGCGCGG CACGGTTTCG GAGGCGCTGG TCGAGCAGAT CGATCTGGCG
CCGACCTTCC TCGATTACTT CGGCGGCCGG CCCAAGCCGC ACATTCTCGA GGGACGGTCG
CTGCTGCCGC TGCTGCGCTG CGAGCGCGTC GAAAACTGGC GATCCTACGT CTTCTCGGAA
TACGACTACG CACTGGATCG CGCTCGCATC TCGCTCGGAA CGCCGGTGCC TGATTGTCGG
CTGACGATGG TGGCCGATGG TCGCTGGAAG GCGGTGTTCG TCGAGGGATT CCGCCCGATG
CTGTTCGACG TCGACAATGA TCCGCACGAA TTCGACGATC TTGGTGACAG CGAAGATCAT
GCCGAGGTCC GGCAGCGCCT GTCCGATGCG TTCTTCGCCT GGGCACGGCG GCCGCGCAGC
CGAATCACTC GCTCGGACGA TGCAATTGCC GCGAAGGATG AGGCGCAGCG TGCCTACGAT
CGCAATATCG AATCCGGCGT CCTGATCGGC TATTGGGACG AGACGGAGCT CGCCGAGGAA
CGCGCCAAGC GCGCCCGATA TCTGGCATTG CGCCGACCAG ACTGA
 
Protein sequence
MTTKRNVLFI MCDQLRYDYL GISGHPRLKT PNIDALARRG VRFFNAYVQS TICGPSRMST 
YTGRYVRSHG STWNGIPLRV GEPTLGDHLK EIGVRNVLVG KTHMVPDREG MARLGIVPDS
LIGVHVSQCG FEPYERDDGL HPDGPYDPAP DYDAYLRSQG FDAGNPWEAW ANSAEGGDGE
LLSGWLLSHA DKPARVPDEH SETPYITRRA IEFIGEAEAD GRPWCLHLSY IKPHWPYIVP
APYHDRYGAD DVLPVVRSDR ERQHPHPIFA EFQHERVSQA FSRPGVRERV IPAYMGLIEQ
IDDQLGLLFA YLDERGLTDD TLIVFTSDHG DYLGDHWLGE KQMFHDVSVK VPLIVVDPSP
AADATRGTVS EALVEQIDLA PTFLDYFGGR PKPHILEGRS LLPLLRCERV ENWRSYVFSE
YDYALDRARI SLGTPVPDCR LTMVADGRWK AVFVEGFRPM LFDVDNDPHE FDDLGDSEDH
AEVRQRLSDA FFAWARRPRS RITRSDDAIA AKDEAQRAYD RNIESGVLIG YWDETELAEE
RAKRARYLAL RRPD