Gene Rpal_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1401 
Symbol 
ID6409058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1476020 
End bp1477423 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content66% 
IMG OID642711300 
ProductCarotenoid oxygenase 
Protein accessionYP_001990416 
Protein GI192289811 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCAGG TGACCGGAAT TCCGGATGCG TGCGACAATC TCGCGCCGAT CCCAATGGAA 
TGCGACGCGC CGTTCCTCAG CATCAAGGGC GAGCTGCCGC GGGAATTGAA CGGCACGTTG
TATCGCAACG GCGCCAACCC GCAATTCGTC TCGCCGAACG CGCACTGGTT CTTTGGCGAC
GGCATGCTGC ACGCGTTTCA TCTGGAGAAC GGCCGCGCGT CGTATCGTAA CCGCTGGGTG
CGCACGCCGA AATGGCTCGC GGAGCACGAG GCGGGCCGCC CGCTCTACGG CGAGTTCAAC
CTCAAGCTGC CCGATGCACC GCGCTCGGTG CCGGACGACG GCAACGTCGC CAACACCAAC
ATCGTGTTCC ACGCCGGCCG GCTCCTGGCG CTGGAAGAGG CGCATCTGCC GATGCAGATC
GAACGCGACA CGCTCGAGAC CCGCGGTTAC TGCGACTACG GCGGCGCGCT GAAGGGGCCA
TTCACCGCTC ACCCAAAGAT CGACCCGGTG ACCGGCGAGA TGCTGTTCTT CGGCTACAAC
GCCAGCGGCC CGCTGACGCG TACGATGTCG TTCGGGGCGA TCGACGCCTC GGGCAACGTC
ACCCGGCTGG AACATTTCAA GGCGCCGTTT GCGGCGATGG TGCACGACTT CATCGTCACC
GAGCATCATG TGCTGTTCCC GATCCTGCCG CTCACCGGCA GTATCTGGCG CGCGATGCGC
GGCCGGCCGC CCTACGCCTG GGATCCGCGC AAGGGCTCGT ATGTCGGCGT GATGAAGCGC
TCCGGCTCGA CCCGCGACAT CCGCTGGTTC CGCGGCGAGG CCTGCTTCGT GTTCCACGTC
ATGAACGCGT GGGAGGACGG CACCCGGATC GTCGCCGACG TGATGCAGTC GGAGGAAGCG
CCGCTGTTCA CCCATCCCGA TGGTCGCCGC ACCGATCCGG AGAAGGGCCG TGCGCGACTG
TGCCGCTGGA GCTTCGACCT CGCCGGCAAC ACCAACGCCT TCAAGCGCAG CTATCTGGAC
GAGATCAGCG GCGAATTTCC ACGGATCGAC GAGCGCCGTG CCGGCCTGCG CAGCGGCCAC
GGCTGGTACG CGTGCGCCAG CCCGGAAACA CCGATGCTCG GCATGCTCAC TGGACTCGTG
CATGTCGACG GCAACGGCCA TCGCCGCACT CGCTATCTGC TGCCGACCGG CGATACCATC
GGCGAGCCGG TGTTCGTGCC GCGTGCGGCC GACGCAAACG AAGCCGAAGG CTGGCTGCTC
GCGGTGGTGT GGCGCGGCTG CGAGAATCGC AGCGACCTTG CGGTGTTCAA TGCGACCGAC
ATTGCGGCAG GCCCGATCGC CCTGGTGCAT CTCGGCCACC GCATTCCCGA CGGCTTCCAC
GGCAATTGGG TGCCGGCAGG ATAA
 
Protein sequence
MLQVTGIPDA CDNLAPIPME CDAPFLSIKG ELPRELNGTL YRNGANPQFV SPNAHWFFGD 
GMLHAFHLEN GRASYRNRWV RTPKWLAEHE AGRPLYGEFN LKLPDAPRSV PDDGNVANTN
IVFHAGRLLA LEEAHLPMQI ERDTLETRGY CDYGGALKGP FTAHPKIDPV TGEMLFFGYN
ASGPLTRTMS FGAIDASGNV TRLEHFKAPF AAMVHDFIVT EHHVLFPILP LTGSIWRAMR
GRPPYAWDPR KGSYVGVMKR SGSTRDIRWF RGEACFVFHV MNAWEDGTRI VADVMQSEEA
PLFTHPDGRR TDPEKGRARL CRWSFDLAGN TNAFKRSYLD EISGEFPRID ERRAGLRSGH
GWYACASPET PMLGMLTGLV HVDGNGHRRT RYLLPTGDTI GEPVFVPRAA DANEAEGWLL
AVVWRGCENR SDLAVFNATD IAAGPIALVH LGHRIPDGFH GNWVPAG