Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1401 |
Symbol | |
ID | 6409058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1476020 |
End bp | 1477423 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711300 |
Product | Carotenoid oxygenase |
Protein accession | YP_001990416 |
Protein GI | 192289811 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCAGG TGACCGGAAT TCCGGATGCG TGCGACAATC TCGCGCCGAT CCCAATGGAA TGCGACGCGC CGTTCCTCAG CATCAAGGGC GAGCTGCCGC GGGAATTGAA CGGCACGTTG TATCGCAACG GCGCCAACCC GCAATTCGTC TCGCCGAACG CGCACTGGTT CTTTGGCGAC GGCATGCTGC ACGCGTTTCA TCTGGAGAAC GGCCGCGCGT CGTATCGTAA CCGCTGGGTG CGCACGCCGA AATGGCTCGC GGAGCACGAG GCGGGCCGCC CGCTCTACGG CGAGTTCAAC CTCAAGCTGC CCGATGCACC GCGCTCGGTG CCGGACGACG GCAACGTCGC CAACACCAAC ATCGTGTTCC ACGCCGGCCG GCTCCTGGCG CTGGAAGAGG CGCATCTGCC GATGCAGATC GAACGCGACA CGCTCGAGAC CCGCGGTTAC TGCGACTACG GCGGCGCGCT GAAGGGGCCA TTCACCGCTC ACCCAAAGAT CGACCCGGTG ACCGGCGAGA TGCTGTTCTT CGGCTACAAC GCCAGCGGCC CGCTGACGCG TACGATGTCG TTCGGGGCGA TCGACGCCTC GGGCAACGTC ACCCGGCTGG AACATTTCAA GGCGCCGTTT GCGGCGATGG TGCACGACTT CATCGTCACC GAGCATCATG TGCTGTTCCC GATCCTGCCG CTCACCGGCA GTATCTGGCG CGCGATGCGC GGCCGGCCGC CCTACGCCTG GGATCCGCGC AAGGGCTCGT ATGTCGGCGT GATGAAGCGC TCCGGCTCGA CCCGCGACAT CCGCTGGTTC CGCGGCGAGG CCTGCTTCGT GTTCCACGTC ATGAACGCGT GGGAGGACGG CACCCGGATC GTCGCCGACG TGATGCAGTC GGAGGAAGCG CCGCTGTTCA CCCATCCCGA TGGTCGCCGC ACCGATCCGG AGAAGGGCCG TGCGCGACTG TGCCGCTGGA GCTTCGACCT CGCCGGCAAC ACCAACGCCT TCAAGCGCAG CTATCTGGAC GAGATCAGCG GCGAATTTCC ACGGATCGAC GAGCGCCGTG CCGGCCTGCG CAGCGGCCAC GGCTGGTACG CGTGCGCCAG CCCGGAAACA CCGATGCTCG GCATGCTCAC TGGACTCGTG CATGTCGACG GCAACGGCCA TCGCCGCACT CGCTATCTGC TGCCGACCGG CGATACCATC GGCGAGCCGG TGTTCGTGCC GCGTGCGGCC GACGCAAACG AAGCCGAAGG CTGGCTGCTC GCGGTGGTGT GGCGCGGCTG CGAGAATCGC AGCGACCTTG CGGTGTTCAA TGCGACCGAC ATTGCGGCAG GCCCGATCGC CCTGGTGCAT CTCGGCCACC GCATTCCCGA CGGCTTCCAC GGCAATTGGG TGCCGGCAGG ATAA
|
Protein sequence | MLQVTGIPDA CDNLAPIPME CDAPFLSIKG ELPRELNGTL YRNGANPQFV SPNAHWFFGD GMLHAFHLEN GRASYRNRWV RTPKWLAEHE AGRPLYGEFN LKLPDAPRSV PDDGNVANTN IVFHAGRLLA LEEAHLPMQI ERDTLETRGY CDYGGALKGP FTAHPKIDPV TGEMLFFGYN ASGPLTRTMS FGAIDASGNV TRLEHFKAPF AAMVHDFIVT EHHVLFPILP LTGSIWRAMR GRPPYAWDPR KGSYVGVMKR SGSTRDIRWF RGEACFVFHV MNAWEDGTRI VADVMQSEEA PLFTHPDGRR TDPEKGRARL CRWSFDLAGN TNAFKRSYLD EISGEFPRID ERRAGLRSGH GWYACASPET PMLGMLTGLV HVDGNGHRRT RYLLPTGDTI GEPVFVPRAA DANEAEGWLL AVVWRGCENR SDLAVFNATD IAAGPIALVH LGHRIPDGFH GNWVPAG
|
| |