Gene RoseRS_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0320 
Symbol 
ID5207255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp407987 
End bp409402 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content62% 
IMG OID640593946 
Producttyrosine phenol-lyase 
Protein accessionYP_001274702 
Protein GI148654497 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02618] tyrosine phenol-lyase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.666084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0468502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCACC CCGTTCCGCA AACCATGGGT CAGCAGTTCG GCAGGCGCTC CTGGGCTGAA 
CCGTGGAAGA TCAAGATGGT CGAGCCGCTG CGGGTGATCA GCCGAGAGGA GCGTGAGCGC
GCCCTGGCGG AAGCCGGGTA CAACACGTTC CTGCTCCGTT CGGAGGATGT GTACATCGAT
CTGCTGACCG ATAGCGGCAC CAACGCGATG AGCGATCGCC AGTGGGCCGC CATCATGCTG
GGCGATGAAG CGTATGCTGG CAGTCGCAAT TTCTACCGCC TCGAAGCGAC AGTGCAGCAG
TACTACGGCT ACCGCTACGT CGTCCCGACA CATCAGGGGC GCGGCGCAGA GCACCTGATC
AGTCGCGCTG CCATTCGTCC CGGACAGTAT GTGCCCGGCA ATATGTACTT CACCACCACG
CGGCTGCACC AGGAACTGGC GGGCGGCATC TTCGTCGATG TCATTATCGA CGAAGCGCAC
GATCCGCAAC ACCTGCATCC TTTCAAGGGG AATGTCGATC TCGACAAACT GGAGGCGCTG
ATCCGGCGCG AAGGAGCGCA GAACATCGCC TACGTCAGTC TGGCGGGCAC GGTGAACATG
GCTGGCGGGC AACCGGTGAG TATGGCGAAT GTGCGGGCGC TGCGCGCATT GTGCGACCGG
TACGGCATCC GCATCTTCCT CGACGCAACG CGCATGGTCG AAAATGCCTT CTTCATCCAG
GAGCGCGAAG AAGGGTACGC GCAGACGCCA ATTGCCGCCA TCCTGAAGGA GTTCTGCTCC
TACACCGACG GCGCATGGAT GAGCGCCAAA AAGGACAGCC TGGTGAACAT CGGCGGATGG
GTGGCGGTCA ACGATGCCGA CCTGTTCGAT GAACTGCGCA ACCTGGTGGT AGTCTACGAA
GGGTTGCACA CCTACGGCGG GCTGGCGGGG CGCGACATGG AAGCGATCGC CGTGGGCATC
GAAGAGTCGG TGCAGGACGA CTACATCCGG GCGCGGATCG GGCAGGTGCG CTACCTGGGC
GAACTGCTGA CCGACTGGAA CATCCCGATT GTGCAACCGG TCGGCGGGCA TGCCATCTTT
CTCGATGCGC GCCGTTTCTA CCCGCATATT CCGCAGCACG AATTTCCAGC GCAAACGCTG
GCCGCCGAAC TCTACCTCGA TTCGGGCATC CGGGCGATGG AACGGGGGAT TGCCAGCGCC
GGGCGCGATC CGAAAACCGG CGATCACTAC TACCCGAAAC TGGAACTCAC GCGCCTCACC
ATCCCACGCC GCGTCTACAC CCAGGCGCAC ATGGATGTGG TCGCCGAAGC GGTGAAGGCG
GTCTACGATG CACGTGAACA GACGCGCGGG CTACGCATGG TGTACGAACC GAAGTATCTG
CGCTTCTTCC AGGCGCGCTT CGAGCGCCTG GCGTAG
 
Protein sequence
MTHPVPQTMG QQFGRRSWAE PWKIKMVEPL RVISREERER ALAEAGYNTF LLRSEDVYID 
LLTDSGTNAM SDRQWAAIML GDEAYAGSRN FYRLEATVQQ YYGYRYVVPT HQGRGAEHLI
SRAAIRPGQY VPGNMYFTTT RLHQELAGGI FVDVIIDEAH DPQHLHPFKG NVDLDKLEAL
IRREGAQNIA YVSLAGTVNM AGGQPVSMAN VRALRALCDR YGIRIFLDAT RMVENAFFIQ
EREEGYAQTP IAAILKEFCS YTDGAWMSAK KDSLVNIGGW VAVNDADLFD ELRNLVVVYE
GLHTYGGLAG RDMEAIAVGI EESVQDDYIR ARIGQVRYLG ELLTDWNIPI VQPVGGHAIF
LDARRFYPHI PQHEFPAQTL AAELYLDSGI RAMERGIASA GRDPKTGDHY YPKLELTRLT
IPRRVYTQAH MDVVAEAVKA VYDAREQTRG LRMVYEPKYL RFFQARFERL A