Gene Cagg_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3438 
Symbol 
ID7269663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4177124 
End bp4178548 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content57% 
IMG OID643568248 
Producttyrosine phenol-lyase 
Protein accessionYP_002464716 
Protein GI219850283 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02618] tyrosine phenol-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00561885 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.374113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGG AACCAGACTT CCCACGCACG ATGGGTCAGC AGTTTGGCCG CCGGTCGTGG 
GCCGAGCCGT GGAAGATCAA GACGGTCGAG CCGCTGCGGA TCATTAGCCG GGCCGAGCGC
GCGGCAGCGC TGAAAGCCGC CGGCTACAAC ACCTTTTTAC TCCGTTCGGA AGATGTTTAT
ATCGATCTGC TCACCGATAG CGGCACCAAT GCTATGAGCG ACCGGCAGTG GGCAGCGTTG
ATGATGGGTG ACGAGGCGTA TGCCGGTAGC CGCAGTTTCT ATCGGTTGGA AGCGGCAGTG
CAGCAGGCTT ACGGCTATCG TCATGTCATT CCCACCCACC AAGGCCGTGG CGCCGAGCAT
CTGATCAGCC GGATCGCTAT TCAACCCGGT CAGTATGTGC CCGGTAATAT GTATTTCACT
ACTACCCGTC TCCACCAAGA ACTCGCCGGT GGCATATTTG TTGATGTGAT TATCGATGAA
GCTCACGATC CGCAGAGCCA ATATCCCTTC AAGGGGAATG TCGATCTCGA TAAGCTGCAA
ACGCTGATCA ATCAAGTTGG CGCAAAACAG ATTGCCTACG TCAGCCTCGC CGGTACGGTC
AATATGGCCG GCGGCCAGCC GGTCAGTATG GCGAACGTGC GCGCTTTGCG CGAATTGTGC
GACCGCTACG GCATCCGCAT TTTTCTCGAC GCGACCCGGT TGGTTGAGAA TGCCTTCTTC
ATCAAAGAGC GCGAACCCGG CTACGCGAAT CATACTATCG CCGAAATCGT GCGCGAGTTT
TGTAGCTATA CCGACGGTGC GTGGATGAGC GCCAAAAAGG ATAGTCTGGT CAACATCGGG
GGCTGGCTGG CGCTGAACGA CGATCAGCTT GCCGATGAGG CGCGCAATCT GGTGGTGGTG
TACGAGGGGT TGCATACCTA CGGCGGGATG GCCGGGCGCG ATATGGAAGC ATTGGCAGTT
GGAATTGAGG AATCATTACA AGAGGACTAC ATTCGGGCGC GAATCGGCCA AGTGCGCTAC
CTTGGCGAAC TGCTGCTCGA TTGGAATATC CCGATTGTCG TGCCGATCGG TGGGCACGCG
ATTTTCCTCG ATGCTCGCCG TTTCTACCCA CACCTCCCCC AAGACCTCTT CCCTGCCCAA
ACTTTAGCCG CTGAGCTGTA CCTCGATTCG GGGGTACGGG CAATGGAACG TGGTATCGCC
AGTGCCGGAC GCGATCCTAA GACCGGGCAG CACCACTATC CCAAACTCGA ACTGACCCGC
CTCACCATCC CACGCCGAGT CTATACCCAA GCCCACATGG ACGTAGTTGC CGAATCGGTG
AAGTCGGTCT ATGACCAACG TGAACGCGCC CGTGGGCTGC GTATGGTTTA TGAGCCGCGT
TACCTGCGCT TCTTCCAAGC CCGCTTTGAA CCGGTTACGG AGTGA
 
Protein sequence
MEMEPDFPRT MGQQFGRRSW AEPWKIKTVE PLRIISRAER AAALKAAGYN TFLLRSEDVY 
IDLLTDSGTN AMSDRQWAAL MMGDEAYAGS RSFYRLEAAV QQAYGYRHVI PTHQGRGAEH
LISRIAIQPG QYVPGNMYFT TTRLHQELAG GIFVDVIIDE AHDPQSQYPF KGNVDLDKLQ
TLINQVGAKQ IAYVSLAGTV NMAGGQPVSM ANVRALRELC DRYGIRIFLD ATRLVENAFF
IKEREPGYAN HTIAEIVREF CSYTDGAWMS AKKDSLVNIG GWLALNDDQL ADEARNLVVV
YEGLHTYGGM AGRDMEALAV GIEESLQEDY IRARIGQVRY LGELLLDWNI PIVVPIGGHA
IFLDARRFYP HLPQDLFPAQ TLAAELYLDS GVRAMERGIA SAGRDPKTGQ HHYPKLELTR
LTIPRRVYTQ AHMDVVAESV KSVYDQRERA RGLRMVYEPR YLRFFQARFE PVTE