Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3438 |
Symbol | |
ID | 7269663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4177124 |
End bp | 4178548 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643568248 |
Product | tyrosine phenol-lyase |
Protein accession | YP_002464716 |
Protein GI | 219850283 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02618] tyrosine phenol-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00561885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.374113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATGG AACCAGACTT CCCACGCACG ATGGGTCAGC AGTTTGGCCG CCGGTCGTGG GCCGAGCCGT GGAAGATCAA GACGGTCGAG CCGCTGCGGA TCATTAGCCG GGCCGAGCGC GCGGCAGCGC TGAAAGCCGC CGGCTACAAC ACCTTTTTAC TCCGTTCGGA AGATGTTTAT ATCGATCTGC TCACCGATAG CGGCACCAAT GCTATGAGCG ACCGGCAGTG GGCAGCGTTG ATGATGGGTG ACGAGGCGTA TGCCGGTAGC CGCAGTTTCT ATCGGTTGGA AGCGGCAGTG CAGCAGGCTT ACGGCTATCG TCATGTCATT CCCACCCACC AAGGCCGTGG CGCCGAGCAT CTGATCAGCC GGATCGCTAT TCAACCCGGT CAGTATGTGC CCGGTAATAT GTATTTCACT ACTACCCGTC TCCACCAAGA ACTCGCCGGT GGCATATTTG TTGATGTGAT TATCGATGAA GCTCACGATC CGCAGAGCCA ATATCCCTTC AAGGGGAATG TCGATCTCGA TAAGCTGCAA ACGCTGATCA ATCAAGTTGG CGCAAAACAG ATTGCCTACG TCAGCCTCGC CGGTACGGTC AATATGGCCG GCGGCCAGCC GGTCAGTATG GCGAACGTGC GCGCTTTGCG CGAATTGTGC GACCGCTACG GCATCCGCAT TTTTCTCGAC GCGACCCGGT TGGTTGAGAA TGCCTTCTTC ATCAAAGAGC GCGAACCCGG CTACGCGAAT CATACTATCG CCGAAATCGT GCGCGAGTTT TGTAGCTATA CCGACGGTGC GTGGATGAGC GCCAAAAAGG ATAGTCTGGT CAACATCGGG GGCTGGCTGG CGCTGAACGA CGATCAGCTT GCCGATGAGG CGCGCAATCT GGTGGTGGTG TACGAGGGGT TGCATACCTA CGGCGGGATG GCCGGGCGCG ATATGGAAGC ATTGGCAGTT GGAATTGAGG AATCATTACA AGAGGACTAC ATTCGGGCGC GAATCGGCCA AGTGCGCTAC CTTGGCGAAC TGCTGCTCGA TTGGAATATC CCGATTGTCG TGCCGATCGG TGGGCACGCG ATTTTCCTCG ATGCTCGCCG TTTCTACCCA CACCTCCCCC AAGACCTCTT CCCTGCCCAA ACTTTAGCCG CTGAGCTGTA CCTCGATTCG GGGGTACGGG CAATGGAACG TGGTATCGCC AGTGCCGGAC GCGATCCTAA GACCGGGCAG CACCACTATC CCAAACTCGA ACTGACCCGC CTCACCATCC CACGCCGAGT CTATACCCAA GCCCACATGG ACGTAGTTGC CGAATCGGTG AAGTCGGTCT ATGACCAACG TGAACGCGCC CGTGGGCTGC GTATGGTTTA TGAGCCGCGT TACCTGCGCT TCTTCCAAGC CCGCTTTGAA CCGGTTACGG AGTGA
|
Protein sequence | MEMEPDFPRT MGQQFGRRSW AEPWKIKTVE PLRIISRAER AAALKAAGYN TFLLRSEDVY IDLLTDSGTN AMSDRQWAAL MMGDEAYAGS RSFYRLEAAV QQAYGYRHVI PTHQGRGAEH LISRIAIQPG QYVPGNMYFT TTRLHQELAG GIFVDVIIDE AHDPQSQYPF KGNVDLDKLQ TLINQVGAKQ IAYVSLAGTV NMAGGQPVSM ANVRALRELC DRYGIRIFLD ATRLVENAFF IKEREPGYAN HTIAEIVREF CSYTDGAWMS AKKDSLVNIG GWLALNDDQL ADEARNLVVV YEGLHTYGGM AGRDMEALAV GIEESLQEDY IRARIGQVRY LGELLLDWNI PIVVPIGGHA IFLDARRFYP HLPQDLFPAQ TLAAELYLDS GVRAMERGIA SAGRDPKTGQ HHYPKLELTR LTIPRRVYTQ AHMDVVAESV KSVYDQRERA RGLRMVYEPR YLRFFQARFE PVTE
|
| |