Gene Rcas_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0866 
Symbol 
ID5538332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1133584 
End bp1134999 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID640893017 
Producttyrosine phenol-lyase 
Protein accessionYP_001431000 
Protein GI156740871 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02618] tyrosine phenol-lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000757235 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCACC CAACCCCGCA AACGATGGGA CAACAATTCG GCAGGCGCTC CTGGGCGGAA 
CCGTGGAAGA TCAAGATGGT TGAGCCATTG CGGGTTACCA CCCGTGAGGA GCGTGAACGC
GCGCTGGCGG AAGCCGGGTA CAATACCTTT CTTCTGCGCT CTGAGGATGT CTATATCGAT
CTGTTGACCG ATAGCGGCAC GAACGCCATG AGCGACCGTC AGTGGGCTGC GATCATGCTT
GGCGATGAGG CGTATGCTGG CAGCCGCAAC TTCTATCGGC TCGAAGCGAC GATCCAGCAG
TACTACGGCT ATCGCTACGT TGTGCCGACC CATCAGGGGC GCGGCGCCGA GCACCTGATC
AGCCGCGCCG CTATCCGCCC CGGGCAGTAT GTTCCCGGCA ATATGTACTT CACCACCACT
CGCCTGCACC AGGAACTGGC CGGCGGCATT TTTGTCGATG TCATCATCGA TGAAGCGCAC
GATCCACAAT CACTGCACCC GTTCAAGGGG AATGTCGATC TGGACAAATT GGAAGCGCTG
ATCCGACGTG AAGGCGCGCA GAATATCGCG TATGTGAGCC TGGCGGGCAC CGTCAATATG
GCGGGCGGGC AGCCGGTGAG CATGGCGAAT GTGCGCGCGG TGCGCAGCCT GTGCGACCGG
TATGGCGTGC GCATCTTTCT CGACGCGACG CGTATGATCG AAAATGCCTT CTTCATCCAG
GAGCGTGAAG AAGGGTATGC GAACCGATCA ATTGCCGAAA TCCTGAAAGA GTTTTGCTCT
TACACCGACG GCGCCTGGAT GAGCGCCAAG AAGGATGCCT TGGTCAATAT CGGCGGATGG
CTGGCAGTTA ACGACGCCGA ATTGTTCGAC GAACTGCGCA ACCTGGTGGT GGTGTACGAA
GGGTTGCACA CCTACGGCGG GCTGGCAGGG CGCGACATGG AAGCAATGGC AGTCGGCATC
GAGGAGTCGG TGCAGGATGA CTACATACGC GCGCGGATCG GGCAGGTGCG CTACCTGGGA
GAACTGCTAA CTGATTGGGA TATTCCGATT GTCCAACCGG TCGGCGGGCA TGCGATTTTC
CTGGATGCGC GCCGCTTCTA CCCGCATATT CCGCAGCGCG AATTTCCGGC GCAGACGCTG
GCGGCTGAAT TATATCTCGA CTCAGGCATT CGTTCGATGG AGCGCGGGAT TGCCAGCGCC
GGACGCGATC CGAAGACCGG TGATCACTAC TATCCGAAAC TCGAACTCAC ACGGCTGACC
ATTCCACGGC GTGTCTATAC CCAGGCGCAT ATGGACGTGG TTGCGGAAGC GGTGAAAGCC
GTCTACGACG CGCGCGCTCA GACCCGCGGA CTGCGCATGG TGTACGAACC GAAGTACCTG
CGCTTCTTTC AGGCGCGGTT CGAGCGATTG GCGTGA
 
Protein sequence
MTHPTPQTMG QQFGRRSWAE PWKIKMVEPL RVTTREERER ALAEAGYNTF LLRSEDVYID 
LLTDSGTNAM SDRQWAAIML GDEAYAGSRN FYRLEATIQQ YYGYRYVVPT HQGRGAEHLI
SRAAIRPGQY VPGNMYFTTT RLHQELAGGI FVDVIIDEAH DPQSLHPFKG NVDLDKLEAL
IRREGAQNIA YVSLAGTVNM AGGQPVSMAN VRAVRSLCDR YGVRIFLDAT RMIENAFFIQ
EREEGYANRS IAEILKEFCS YTDGAWMSAK KDALVNIGGW LAVNDAELFD ELRNLVVVYE
GLHTYGGLAG RDMEAMAVGI EESVQDDYIR ARIGQVRYLG ELLTDWDIPI VQPVGGHAIF
LDARRFYPHI PQREFPAQTL AAELYLDSGI RSMERGIASA GRDPKTGDHY YPKLELTRLT
IPRRVYTQAH MDVVAEAVKA VYDARAQTRG LRMVYEPKYL RFFQARFERL A