Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0320 |
Symbol | |
ID | 5207255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 407987 |
End bp | 409402 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640593946 |
Product | tyrosine phenol-lyase |
Protein accession | YP_001274702 |
Protein GI | 148654497 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02618] tyrosine phenol-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.666084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0468502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCACC CCGTTCCGCA AACCATGGGT CAGCAGTTCG GCAGGCGCTC CTGGGCTGAA CCGTGGAAGA TCAAGATGGT CGAGCCGCTG CGGGTGATCA GCCGAGAGGA GCGTGAGCGC GCCCTGGCGG AAGCCGGGTA CAACACGTTC CTGCTCCGTT CGGAGGATGT GTACATCGAT CTGCTGACCG ATAGCGGCAC CAACGCGATG AGCGATCGCC AGTGGGCCGC CATCATGCTG GGCGATGAAG CGTATGCTGG CAGTCGCAAT TTCTACCGCC TCGAAGCGAC AGTGCAGCAG TACTACGGCT ACCGCTACGT CGTCCCGACA CATCAGGGGC GCGGCGCAGA GCACCTGATC AGTCGCGCTG CCATTCGTCC CGGACAGTAT GTGCCCGGCA ATATGTACTT CACCACCACG CGGCTGCACC AGGAACTGGC GGGCGGCATC TTCGTCGATG TCATTATCGA CGAAGCGCAC GATCCGCAAC ACCTGCATCC TTTCAAGGGG AATGTCGATC TCGACAAACT GGAGGCGCTG ATCCGGCGCG AAGGAGCGCA GAACATCGCC TACGTCAGTC TGGCGGGCAC GGTGAACATG GCTGGCGGGC AACCGGTGAG TATGGCGAAT GTGCGGGCGC TGCGCGCATT GTGCGACCGG TACGGCATCC GCATCTTCCT CGACGCAACG CGCATGGTCG AAAATGCCTT CTTCATCCAG GAGCGCGAAG AAGGGTACGC GCAGACGCCA ATTGCCGCCA TCCTGAAGGA GTTCTGCTCC TACACCGACG GCGCATGGAT GAGCGCCAAA AAGGACAGCC TGGTGAACAT CGGCGGATGG GTGGCGGTCA ACGATGCCGA CCTGTTCGAT GAACTGCGCA ACCTGGTGGT AGTCTACGAA GGGTTGCACA CCTACGGCGG GCTGGCGGGG CGCGACATGG AAGCGATCGC CGTGGGCATC GAAGAGTCGG TGCAGGACGA CTACATCCGG GCGCGGATCG GGCAGGTGCG CTACCTGGGC GAACTGCTGA CCGACTGGAA CATCCCGATT GTGCAACCGG TCGGCGGGCA TGCCATCTTT CTCGATGCGC GCCGTTTCTA CCCGCATATT CCGCAGCACG AATTTCCAGC GCAAACGCTG GCCGCCGAAC TCTACCTCGA TTCGGGCATC CGGGCGATGG AACGGGGGAT TGCCAGCGCC GGGCGCGATC CGAAAACCGG CGATCACTAC TACCCGAAAC TGGAACTCAC GCGCCTCACC ATCCCACGCC GCGTCTACAC CCAGGCGCAC ATGGATGTGG TCGCCGAAGC GGTGAAGGCG GTCTACGATG CACGTGAACA GACGCGCGGG CTACGCATGG TGTACGAACC GAAGTATCTG CGCTTCTTCC AGGCGCGCTT CGAGCGCCTG GCGTAG
|
Protein sequence | MTHPVPQTMG QQFGRRSWAE PWKIKMVEPL RVISREERER ALAEAGYNTF LLRSEDVYID LLTDSGTNAM SDRQWAAIML GDEAYAGSRN FYRLEATVQQ YYGYRYVVPT HQGRGAEHLI SRAAIRPGQY VPGNMYFTTT RLHQELAGGI FVDVIIDEAH DPQHLHPFKG NVDLDKLEAL IRREGAQNIA YVSLAGTVNM AGGQPVSMAN VRALRALCDR YGIRIFLDAT RMVENAFFIQ EREEGYAQTP IAAILKEFCS YTDGAWMSAK KDSLVNIGGW VAVNDADLFD ELRNLVVVYE GLHTYGGLAG RDMEAIAVGI EESVQDDYIR ARIGQVRYLG ELLTDWNIPI VQPVGGHAIF LDARRFYPHI PQHEFPAQTL AAELYLDSGI RAMERGIASA GRDPKTGDHY YPKLELTRLT IPRRVYTQAH MDVVAEAVKA VYDAREQTRG LRMVYEPKYL RFFQARFERL A
|
| |