Gene Cagg_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2030 
Symbol 
ID7269189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2491619 
End bp2493211 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content56% 
IMG OID643566865 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_002463354 
Protein GI219848921 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAT CCGAAGTAAC GGCGGAGACC GCCATAGGCG GTGTGCGTCC AATGAACGGG 
CGTGAGTATC TCGAAAGCCT ACGCGATGAT CGAGTCGTCT ATTTTCAAGG GGAACGGGTT
AAGGACGTCA CGACTCACCC AGCCTTCCGC AATTCGGCGC GCATGGTAGC GCGATGGTAC
GACCGGCTCC ATGAACTGCA TCAGGAGGAT GTAGCCCGCG GTGATCCCGA TCAGTGGAAG
TGGACGATGC CGACCGATAC CGGTAGTGGT GGCTGGACCC ATCCCTTCTT TGTCGGGTCG
CGGACGGTTG AGGATTTGGT ACGGGCACGG GACACCATTG CCGAGTTGCA GCGGGCAGTT
TACGGCTGGA TGGGACGTGC GCCTGACTAC AAGGCAGCAT TTACCGGCAC ACTTGGAGCG
AATGCCGAGT TTTACGCGCC CTATCAGGAG AATGCGCGGC GCTGGTATCG GAAAACGCAG
GAGGAGTTGA TTTACTGGAA TCATGCGATT GTGAATCCGC CCATCGACCG CAATCGACCG
CCGGACGAAG TTGCAGACGT GTACATGCAT GTCGAACGGG AGACGGATGC GGGGCTGATT
GTATCCGGGG CGAAGGTGGT GGCGACCGGT AGTGCCTTAA CCCATGTGAA CTTTATCGCT
CACTATGGTC CACTGCCGAT CAAAGAGAAG CGATTTGCAC TTATTTTTGC GGTACCGATG
AACGCGCCCG GCGTGAAGTT AATTGCACGC ACCTCGTATG AATACAACGC GGCAGTCGTC
GGTAGCCCCT TCGATTACCC GTTATCAAGC CGGCTCGATG AGAACGACTC GATTTTGGTC
TTCGACCGCG TCCTGATCCC GTGGGAAAAC ATTTTTGTGT ACGGCGACAT TGAGAAGGTC
AACACCTTCT TCCCAATCTC AGGCTTCGGC CATCGCTTCC CGCTGCACGG TGGTACGCGC
TTTGCCGTTA AGCTCGATTT CATTACCGGG CTTATGCTCA AAGCGGTTGA GTCAACCGGC
GTCGCCGAAT TTCGCGGTGT ACAGGCACGA CTTGGTGAGA TCGTCACCTA TCGCAACCTC
TTCTGGCACT TGACCGAGGC GATGGTGCGT AATCCGATGC CGTGGGTAGA TGGGTACCTC
TTACCAAATC TCGAAGCCGC CTTCGCCTAC CGTGTGTTAG CGCCGGATGC CTACGTCAAG
ATCAAAGACC TGATCGAGAA AGATGTTGCG AGCGCGCTGA TCTATCTGCC ATCACATGCA
GCCGATCTGA AGAACCCTGA AGTACGCGCC TACCTCGACC GTTTCGTGCG CGGTTCAAAT
GGCACCAGCG CATTCGATCG GATTAAGCTG ATGAAGTTAC TCTGGGACGC AATCGGCACC
GAGTTTGGTG GTCGGCACGA ACTGTACGAG CGTAACTACG CCGGGAACCA CGAGAACATC
CGGATCGAGA CCTTGGGAGC AGCAATGGCG ATGGGGGTGA CAGCCAATCT GAAGGCATTC
GCCGAGCGGT GCATGGCCGA GTATGACCTC GATGGTTGGA CGGTGGACGA CTTGGTCAAT
CCGACCGATG TTAATGTCGT GATGAGCCGG TAA
 
Protein sequence
MTVSEVTAET AIGGVRPMNG REYLESLRDD RVVYFQGERV KDVTTHPAFR NSARMVARWY 
DRLHELHQED VARGDPDQWK WTMPTDTGSG GWTHPFFVGS RTVEDLVRAR DTIAELQRAV
YGWMGRAPDY KAAFTGTLGA NAEFYAPYQE NARRWYRKTQ EELIYWNHAI VNPPIDRNRP
PDEVADVYMH VERETDAGLI VSGAKVVATG SALTHVNFIA HYGPLPIKEK RFALIFAVPM
NAPGVKLIAR TSYEYNAAVV GSPFDYPLSS RLDENDSILV FDRVLIPWEN IFVYGDIEKV
NTFFPISGFG HRFPLHGGTR FAVKLDFITG LMLKAVESTG VAEFRGVQAR LGEIVTYRNL
FWHLTEAMVR NPMPWVDGYL LPNLEAAFAY RVLAPDAYVK IKDLIEKDVA SALIYLPSHA
ADLKNPEVRA YLDRFVRGSN GTSAFDRIKL MKLLWDAIGT EFGGRHELYE RNYAGNHENI
RIETLGAAMA MGVTANLKAF AERCMAEYDL DGWTVDDLVN PTDVNVVMSR