Gene Rpal_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1175 
Symbol 
ID6408831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1245750 
End bp1247108 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content66% 
IMG OID642711073 
ProductPhthalate 4,5-dioxygenase 
Protein accessionYP_001990190 
Protein GI192289585 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.769494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCC AGGAACAGAA CGATCTGATC ACCCGGGTCG GGCCGGGGAC GCCGTGCGGC 
AAGCTGATGC GCGCCTATTG GCAGCCGGCC GCGCTCGTCG ACGAGCTCGA AGGCGAGCGG
CCGATCAAGC CGGTGCGACT GCTCGGCGAA GACCTCGTGC TGTTCAAGGA CGAGACCGGC
CGCTACGGCC TGATCGATCG CGACTGTCCG CACCGCGGCG CCGATCTCGC GTTCGGGCGG
CTGGAGAACG GCGGCCTGCG CTGCGCCTTC CACGGTTGGC TGTTCGACGT CGACGGCAAG
TGCATCGACA CCCCCGCCGA GCCTGCCGGC TCGCCGCTGT GCAAGAACAT CAAGCAGCGC
GCGTTTCCGG TCGTCGCCAA GGGCGGCATC CTGTGGGCCT ATCTCGGCGC GGGCGAACCG
CCGGCGTTTC CGGAGATCGA TTGCTTCATC GCCCCCGACA CCCATGTGTT CGCGTTCAAG
GGCCTGATGG AATGCAACTG GCTGCAGGCG CTCGAGGTCG GCATCGATCC GGCGCACGCC
TCGTTCCTGC ACCGCTTCTT CGAGGATGAG GACACCTCGC AGGCCTACGG CAAGCAATTC
CGCGGTGCCT CGGCCGGCAG CGATCTGCCG ATGACCAAGG TGCTGCGCGA ATACGATCGC
CCGATCATCA ATGTCGAGCA CACCGAATAC GGCTTGCGGC TGATCGCGCT ACGCGAGATC
GACGACGAAC GCACCCATGT TCGCGTCACC AATCAGCTGT TCCCGCACGG CTTCGTCATC
CCGATGAGCA CAGAGATGAC GATCACGCAA TGGCACGTGC CGGTCGACGA CACCCACTGC
TATTGGTATG CGATCTTCAC CAGCTACGCC GCGCCGGTCG ATAAGGTGAA GATGCGCGAC
CAGCGCCTCG AGCTCTACGA GTTGCCGGAC TACAAGTCTC GCCGCAACAA GACCAACGAT
TACGGCTTCG ATCCGCACGA GCAGGCGACC GCGACCTACA CCGGCATGGG GCTGGACATC
AACGTCCACG ATCAGTGGGC GGTGGAGTCG ATGGGCGCGA TCCAGGACCG CACCCGCGAG
CATCTCGGCC AGTCCGACAA GGCGATCATT CAGTATCGCC GGCTGCTGCG TCAGGAAATC
GAGAAGGCCG CCTCCGGTGG CAAGCCGTTG CTGGCGCTCG ACGAGGCCGC GGCGCGCGCG
ATCCAGGGAC CGGCCACGAT GGACGGCATC GGCCCGAGCC GCGGCTGGGA GACCTATTGG
ATGGAGGTCG ACGTCAAGCG TCGCCGCGGT GCGCCCTGGG CGGCACCGGT GCCGTCCGAG
ATCGCCGCCA AGGTCCCGCA TCTGACGGCC GCAGAATGA
 
Protein sequence
MMSQEQNDLI TRVGPGTPCG KLMRAYWQPA ALVDELEGER PIKPVRLLGE DLVLFKDETG 
RYGLIDRDCP HRGADLAFGR LENGGLRCAF HGWLFDVDGK CIDTPAEPAG SPLCKNIKQR
AFPVVAKGGI LWAYLGAGEP PAFPEIDCFI APDTHVFAFK GLMECNWLQA LEVGIDPAHA
SFLHRFFEDE DTSQAYGKQF RGASAGSDLP MTKVLREYDR PIINVEHTEY GLRLIALREI
DDERTHVRVT NQLFPHGFVI PMSTEMTITQ WHVPVDDTHC YWYAIFTSYA APVDKVKMRD
QRLELYELPD YKSRRNKTND YGFDPHEQAT ATYTGMGLDI NVHDQWAVES MGAIQDRTRE
HLGQSDKAII QYRRLLRQEI EKAASGGKPL LALDEAAARA IQGPATMDGI GPSRGWETYW
MEVDVKRRRG APWAAPVPSE IAAKVPHLTA AE