Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2557 |
Symbol | |
ID | 5708423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2911964 |
End bp | 2913214 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641272020 |
Product | lycopene beta and epsilon cyclase |
Protein accession | YP_001537390 |
Protein GI | 159038137 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.329611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000286937 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCCCACCG CCCCACCGGT CGACGTCGAC CTCGCCCTGG TCGGTGGCGG CGGCGCGGGG TCACTCGTGC TCGCCGCCCT GGATCGGTGC GGCGTGCGCG GTCTGCGGGT GGCGGTGGTC GATCCGGTGC GTCGTCGCGG ACAGGACCGG ACCTGGGCGT TCTGGGGCCG TCCCGACGAC AGGCTCGATT CGCTGTTGGC GGCGAGCTGG TCACAGGTGG AGGTGGCGAC ACCTGGGCGG CGCCGCGTGC TCGACCTCGC CCCGCTGCGG TACGCGATGC TGCGCTCGGC CGCGGTCTAC GACCAGGCCG CCACCGCCGA GCACCGCCTC GGCGCGGTCC GCGTCGCCGC CCCGGCCCGA GCTGTCGAGG ACGATGGTGA CCGGGTGACG GTCCGCGCCG GCGGGGCGAC TGTGCGCGCG TCCTGGGTCC TCGACTCCCG GCCCCGGCCG CCCCGCCGGG TCGGGCGCAC CAACTGGCTG CAACACTTCC GCGGCTGGTG GCTCGAGGCC GATCGGCCGC TGTTCGACCC GGGTCGGGCG GTGCTGATGG ATTTCCGCAC CCCGCAGCCG GCACGGGGCG TGTCGTTCGG GTACCTGTTG CCGGTCACCG ACCGGTACGC GCTGGTCGAG TACACCGAGT TCACTCCCGG CCTGCTCACC GACGCCGGCT ACGACGCGGC GTTGGCTGGG TACCGGGACC AACTCGGGCT GGACCCGGGC CGGCTGAGGG TGCGGGAGGT GGAGAACGGG GTGATCCCGA TGACCGATGG CCGGTTCGAC CTGCGGCCGT CGCCCCGGGT GGTCCGTCTC GGTACCGCCG GTGGTGCCAC CCGACCCTCC ACCGGCTTCA CGTTCGCCGC CATGTACCGG CAGGCGGGGC AGATCGCCGA GGCGCTCGCC GCCGGGCGGG CGCCGGTCCC GGCGCCCGCG TACCCCCGCC GACACCGGTG GCTGGACGCG GTCGCGCTGC GGGCACTGGA CCGCGGTGGG GTGGGCGGTC CGGACTTCTT CGATCGGCTC TTCGACCGCA ATCCGGCCGA GCGGGTACTG CGCTTCCTTG ACGGGACCAC CAGCCCGGCC GAGGAGGTGG CGCTGATGGG CACCACCCGG CTATCGCCGA TGGTCGCCGC CACCGTCGGT GACGCCGCCG CCCGGCTGCG GGACCGGGTC GTGCCGTGGC GGCGTCCCAC CACCTGGCAG ATACCGCCGC CGGTGACCGG CGCGACGCGG CGCTCCCCGC CGCAGAGGTG A
|
Protein sequence | MPTAPPVDVD LALVGGGGAG SLVLAALDRC GVRGLRVAVV DPVRRRGQDR TWAFWGRPDD RLDSLLAASW SQVEVATPGR RRVLDLAPLR YAMLRSAAVY DQAATAEHRL GAVRVAAPAR AVEDDGDRVT VRAGGATVRA SWVLDSRPRP PRRVGRTNWL QHFRGWWLEA DRPLFDPGRA VLMDFRTPQP ARGVSFGYLL PVTDRYALVE YTEFTPGLLT DAGYDAALAG YRDQLGLDPG RLRVREVENG VIPMTDGRFD LRPSPRVVRL GTAGGATRPS TGFTFAAMYR QAGQIAEALA AGRAPVPAPA YPRRHRWLDA VALRALDRGG VGGPDFFDRL FDRNPAERVL RFLDGTTSPA EEVALMGTTR LSPMVAATVG DAAARLRDRV VPWRRPTTWQ IPPPVTGATR RSPPQR
|
| |