Gene Cagg_2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2815 
Symbol 
ID7267521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3456847 
End bp3457905 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content58% 
IMG OID643567636 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002464113 
Protein GI219849680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.700951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTCG TAATGGAAGC ACACGCGACG GTGGAGCAGA TCGAAGCTGT TTGCGCCGAG 
ATTCGGGCGA TGGGGTTTAC GCCACACCCA ATGCCCGGCC CGACCCGAAC TGCCATCGGG
ATTACCGGTA ACCAAGGCCC AATCGAGCAG GCCGGGCGGT TGCAGCGGTT GCCCGGTGTG
AGTCAGTTGA TACGGGTAAC CGCACCCTAC AAGCGCGTCA GTCGTGAGTT TAAAGAACTC
GATACGGTGG TGGAGGTCGG TGGTGTACCG ATCGGTGGGG CCGGTATTGC GATAATTGCC
GGTCCATGTA CGGTAGAAAG TCGAGAACAG ACTCTCAACG TTGCACGGGC AGTACGTGCG
GCCGGTGCGG TCATGCTACG CGGTGGAGCG TACAAGCCGC GTACCTCACC GTATTCTTTT
CAGGGCTTAG GCGAAGCCGG CTTACGCATA TTAGCCGAAG CGCGTGAACT GACCGGTCTG
CCGGTGGTGA CCGAGGTCAT GGATACCGAG ACGTTGCCGT TGGTGGTTGA ATATGCCGAC
ATGTTGCAGA TCGGTGCGCG CAATATGCAA AATTATTCGC TGTTGCGGGC AGTTGGACGC
ACTCAGCGAC CTGTCTTGCT GAAACGTGGA TTTGCCGCCA CGGTGAAAGA TTTGCTCTTG
GCGGCAGAAT ACATTTTGGC CGAGGGGAAT CCAAACGTCG TACTGTGTGA GCGAGGTATT
CGTACCTTCG ACGATAGTTT GCGCTTTACC CTTGATCTGG GGGCCGTACC GTTGATCAAA
CAGCTCTCGC ATCTACCGGT GATCGTCGAT CCATCGCACG CGAGTGGGCG GGCCGATCTT
GTCATTCCCA TGGCGCGTGC CGCGTTAGCA GCCGGCGCCG ATGGTTTGAT CGTTGAAGTA
CACGATAATC CGGCCTACGC AGTTTGTGAT GGGACGCAGG CGCTTGTACC GGACAGCTTT
GCTGCGATGA TGCATCAGCT TGCACGCATA GCGGCAGCAG TGGAACGTCC GTTGCTGAGT
CGGGTTGAGG TGAACGGTGG ACACACGACG TTGGCGTGA
 
Protein sequence
MLVVMEAHAT VEQIEAVCAE IRAMGFTPHP MPGPTRTAIG ITGNQGPIEQ AGRLQRLPGV 
SQLIRVTAPY KRVSREFKEL DTVVEVGGVP IGGAGIAIIA GPCTVESREQ TLNVARAVRA
AGAVMLRGGA YKPRTSPYSF QGLGEAGLRI LAEARELTGL PVVTEVMDTE TLPLVVEYAD
MLQIGARNMQ NYSLLRAVGR TQRPVLLKRG FAATVKDLLL AAEYILAEGN PNVVLCERGI
RTFDDSLRFT LDLGAVPLIK QLSHLPVIVD PSHASGRADL VIPMARAALA AGADGLIVEV
HDNPAYAVCD GTQALVPDSF AAMMHQLARI AAAVERPLLS RVEVNGGHTT LA