Gene Sare_2557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2557 
Symbol 
ID5708423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2911964 
End bp2913214 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content75% 
IMG OID641272020 
Productlycopene beta and epsilon cyclase 
Protein accessionYP_001537390 
Protein GI159038137 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.329611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000286937 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCCCACCG CCCCACCGGT CGACGTCGAC CTCGCCCTGG TCGGTGGCGG CGGCGCGGGG 
TCACTCGTGC TCGCCGCCCT GGATCGGTGC GGCGTGCGCG GTCTGCGGGT GGCGGTGGTC
GATCCGGTGC GTCGTCGCGG ACAGGACCGG ACCTGGGCGT TCTGGGGCCG TCCCGACGAC
AGGCTCGATT CGCTGTTGGC GGCGAGCTGG TCACAGGTGG AGGTGGCGAC ACCTGGGCGG
CGCCGCGTGC TCGACCTCGC CCCGCTGCGG TACGCGATGC TGCGCTCGGC CGCGGTCTAC
GACCAGGCCG CCACCGCCGA GCACCGCCTC GGCGCGGTCC GCGTCGCCGC CCCGGCCCGA
GCTGTCGAGG ACGATGGTGA CCGGGTGACG GTCCGCGCCG GCGGGGCGAC TGTGCGCGCG
TCCTGGGTCC TCGACTCCCG GCCCCGGCCG CCCCGCCGGG TCGGGCGCAC CAACTGGCTG
CAACACTTCC GCGGCTGGTG GCTCGAGGCC GATCGGCCGC TGTTCGACCC GGGTCGGGCG
GTGCTGATGG ATTTCCGCAC CCCGCAGCCG GCACGGGGCG TGTCGTTCGG GTACCTGTTG
CCGGTCACCG ACCGGTACGC GCTGGTCGAG TACACCGAGT TCACTCCCGG CCTGCTCACC
GACGCCGGCT ACGACGCGGC GTTGGCTGGG TACCGGGACC AACTCGGGCT GGACCCGGGC
CGGCTGAGGG TGCGGGAGGT GGAGAACGGG GTGATCCCGA TGACCGATGG CCGGTTCGAC
CTGCGGCCGT CGCCCCGGGT GGTCCGTCTC GGTACCGCCG GTGGTGCCAC CCGACCCTCC
ACCGGCTTCA CGTTCGCCGC CATGTACCGG CAGGCGGGGC AGATCGCCGA GGCGCTCGCC
GCCGGGCGGG CGCCGGTCCC GGCGCCCGCG TACCCCCGCC GACACCGGTG GCTGGACGCG
GTCGCGCTGC GGGCACTGGA CCGCGGTGGG GTGGGCGGTC CGGACTTCTT CGATCGGCTC
TTCGACCGCA ATCCGGCCGA GCGGGTACTG CGCTTCCTTG ACGGGACCAC CAGCCCGGCC
GAGGAGGTGG CGCTGATGGG CACCACCCGG CTATCGCCGA TGGTCGCCGC CACCGTCGGT
GACGCCGCCG CCCGGCTGCG GGACCGGGTC GTGCCGTGGC GGCGTCCCAC CACCTGGCAG
ATACCGCCGC CGGTGACCGG CGCGACGCGG CGCTCCCCGC CGCAGAGGTG A
 
Protein sequence
MPTAPPVDVD LALVGGGGAG SLVLAALDRC GVRGLRVAVV DPVRRRGQDR TWAFWGRPDD 
RLDSLLAASW SQVEVATPGR RRVLDLAPLR YAMLRSAAVY DQAATAEHRL GAVRVAAPAR
AVEDDGDRVT VRAGGATVRA SWVLDSRPRP PRRVGRTNWL QHFRGWWLEA DRPLFDPGRA
VLMDFRTPQP ARGVSFGYLL PVTDRYALVE YTEFTPGLLT DAGYDAALAG YRDQLGLDPG
RLRVREVENG VIPMTDGRFD LRPSPRVVRL GTAGGATRPS TGFTFAAMYR QAGQIAEALA
AGRAPVPAPA YPRRHRWLDA VALRALDRGG VGGPDFFDRL FDRNPAERVL RFLDGTTSPA
EEVALMGTTR LSPMVAATVG DAAARLRDRV VPWRRPTTWQ IPPPVTGATR RSPPQR