Gene RoseRS_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2643 
Symbol 
ID5209612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3278938 
End bp3280179 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID640596245 
Productlycopene cyclase family protein 
Protein accessionYP_001276967 
Protein GI148656762 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGC ATGATGTTCT CGTCGTCGGC GCCGGTCCGA CCGGCATGGC AATCGCTGCT 
GCGCTCAGCG CCACAGGACT GCGCGTGGCG GGGCTTGCGG CGGCGCCGCC GACGAAACCA
TGGCAGAATA CCTACGGCGT GTGGCTCGAT GAATTGCCGA CGCCGGAATT GCGCGACACG
CTGGGGCATC GCTGGTCGGA TGTTGTGGTG TGCGTCGGCG AGCGCACCAT TGCCCTTGAT
CGTGCGTATG GGTTGTTCGA CAACCCGCGC TTGCAGCAGT ACCTCCTCGA CCAATGCGAG
CGCCACGGTG TCACGTGGTC TGCCGGGATT GCGGCGCGCG TCGAGCATCA GGCGACGCAT
TCCCTGGTGA CCACGCGTGA TGGGCGTGTT GTTGCGGCAC GGTTGGTGGT GGATGCCAGC
GGTCATTCAC CGGCGCTGCT GCGTCGCCCT GCAACATCGC ACGTGGCGCG TCAGGCGGCG
TATGGCATCG TCGGTGTCTT CTCCGCCCCG CCGATTCAGC CGAATCGAAT GGTGCTGATG
GACTACCGCG CTGATCATCT GACCGCTGAG GAACGGCGTG AGCCGCCAAC CTTTTTGTAC
GCCATGGATC TGGGGGACGG ACAATTTTTT GTTGAGGAAA CGTCGCTGGC GCATGTGCCC
GGCTTACCGC TCACCACGCT CGAACAACGG TTGCAGCGCC GGTTGACCGC CAGAGGTGTG
ACGGTGCAGC AGGTTGTGCA TATCGAGCGG TGTCTGTTCC CGATGAATAA TCCGTTACCG
TACCTCGATC AGCCGATGAT CGGGTTTGGC GGTGCAGCGA GTATGGTGCA TCCCCCGTCG
GGGTATATGG TCGGCAAGGC GCTGCGCCGT GCCCCTGAGG TTGCGCAGGC GATTGCTCGC
GCATTAGGCG CAGCGGACGC TACCCCGCGC AGCGCTGCCC GTGCCGGATG GCGGGCGCTC
TGGTCACCGG CGCGCCTGCG TCGCAGGCAG TTGTACCTGT TCGGGCTGGC GAGCCTGATG
CGCTGCGACA GCGCAACAAT CCAGGAATTT TTCGCTCTTT TTTTCAGTCT GCCGCGTCAC
GAATGGATGG GGTATCTATC GGACACGTTG AGCACTGTCG AGTTAGCGCG CACGATGCTG
CGTCTGTTCA TCCGCGCGCC CGGAAATGTG CGCCGAACCC TGATGGCGGC TGCGGGCGCA
GAACATGCGC TGCTGCGCCG TGCGGCACTT GGTCAGGCTT GA
 
Protein sequence
MTMHDVLVVG AGPTGMAIAA ALSATGLRVA GLAAAPPTKP WQNTYGVWLD ELPTPELRDT 
LGHRWSDVVV CVGERTIALD RAYGLFDNPR LQQYLLDQCE RHGVTWSAGI AARVEHQATH
SLVTTRDGRV VAARLVVDAS GHSPALLRRP ATSHVARQAA YGIVGVFSAP PIQPNRMVLM
DYRADHLTAE ERREPPTFLY AMDLGDGQFF VEETSLAHVP GLPLTTLEQR LQRRLTARGV
TVQQVVHIER CLFPMNNPLP YLDQPMIGFG GAASMVHPPS GYMVGKALRR APEVAQAIAR
ALGAADATPR SAARAGWRAL WSPARLRRRQ LYLFGLASLM RCDSATIQEF FALFFSLPRH
EWMGYLSDTL STVELARTML RLFIRAPGNV RRTLMAAAGA EHALLRRAAL GQA