Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0866 |
Symbol | |
ID | 5538332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1133584 |
End bp | 1134999 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640893017 |
Product | tyrosine phenol-lyase |
Protein accession | YP_001431000 |
Protein GI | 156740871 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02618] tyrosine phenol-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000757235 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCACC CAACCCCGCA AACGATGGGA CAACAATTCG GCAGGCGCTC CTGGGCGGAA CCGTGGAAGA TCAAGATGGT TGAGCCATTG CGGGTTACCA CCCGTGAGGA GCGTGAACGC GCGCTGGCGG AAGCCGGGTA CAATACCTTT CTTCTGCGCT CTGAGGATGT CTATATCGAT CTGTTGACCG ATAGCGGCAC GAACGCCATG AGCGACCGTC AGTGGGCTGC GATCATGCTT GGCGATGAGG CGTATGCTGG CAGCCGCAAC TTCTATCGGC TCGAAGCGAC GATCCAGCAG TACTACGGCT ATCGCTACGT TGTGCCGACC CATCAGGGGC GCGGCGCCGA GCACCTGATC AGCCGCGCCG CTATCCGCCC CGGGCAGTAT GTTCCCGGCA ATATGTACTT CACCACCACT CGCCTGCACC AGGAACTGGC CGGCGGCATT TTTGTCGATG TCATCATCGA TGAAGCGCAC GATCCACAAT CACTGCACCC GTTCAAGGGG AATGTCGATC TGGACAAATT GGAAGCGCTG ATCCGACGTG AAGGCGCGCA GAATATCGCG TATGTGAGCC TGGCGGGCAC CGTCAATATG GCGGGCGGGC AGCCGGTGAG CATGGCGAAT GTGCGCGCGG TGCGCAGCCT GTGCGACCGG TATGGCGTGC GCATCTTTCT CGACGCGACG CGTATGATCG AAAATGCCTT CTTCATCCAG GAGCGTGAAG AAGGGTATGC GAACCGATCA ATTGCCGAAA TCCTGAAAGA GTTTTGCTCT TACACCGACG GCGCCTGGAT GAGCGCCAAG AAGGATGCCT TGGTCAATAT CGGCGGATGG CTGGCAGTTA ACGACGCCGA ATTGTTCGAC GAACTGCGCA ACCTGGTGGT GGTGTACGAA GGGTTGCACA CCTACGGCGG GCTGGCAGGG CGCGACATGG AAGCAATGGC AGTCGGCATC GAGGAGTCGG TGCAGGATGA CTACATACGC GCGCGGATCG GGCAGGTGCG CTACCTGGGA GAACTGCTAA CTGATTGGGA TATTCCGATT GTCCAACCGG TCGGCGGGCA TGCGATTTTC CTGGATGCGC GCCGCTTCTA CCCGCATATT CCGCAGCGCG AATTTCCGGC GCAGACGCTG GCGGCTGAAT TATATCTCGA CTCAGGCATT CGTTCGATGG AGCGCGGGAT TGCCAGCGCC GGACGCGATC CGAAGACCGG TGATCACTAC TATCCGAAAC TCGAACTCAC ACGGCTGACC ATTCCACGGC GTGTCTATAC CCAGGCGCAT ATGGACGTGG TTGCGGAAGC GGTGAAAGCC GTCTACGACG CGCGCGCTCA GACCCGCGGA CTGCGCATGG TGTACGAACC GAAGTACCTG CGCTTCTTTC AGGCGCGGTT CGAGCGATTG GCGTGA
|
Protein sequence | MTHPTPQTMG QQFGRRSWAE PWKIKMVEPL RVTTREERER ALAEAGYNTF LLRSEDVYID LLTDSGTNAM SDRQWAAIML GDEAYAGSRN FYRLEATIQQ YYGYRYVVPT HQGRGAEHLI SRAAIRPGQY VPGNMYFTTT RLHQELAGGI FVDVIIDEAH DPQSLHPFKG NVDLDKLEAL IRREGAQNIA YVSLAGTVNM AGGQPVSMAN VRAVRSLCDR YGVRIFLDAT RMIENAFFIQ EREEGYANRS IAEILKEFCS YTDGAWMSAK KDALVNIGGW LAVNDAELFD ELRNLVVVYE GLHTYGGLAG RDMEAMAVGI EESVQDDYIR ARIGQVRYLG ELLTDWDIPI VQPVGGHAIF LDARRFYPHI PQREFPAQTL AAELYLDSGI RSMERGIASA GRDPKTGDHY YPKLELTRLT IPRRVYTQAH MDVVAEAVKA VYDARAQTRG LRMVYEPKYL RFFQARFERL A
|
| |