Gene Cagg_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1819 
Symbol 
ID7267731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2231114 
End bp2232169 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content57% 
IMG OID643566657 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002463152 
Protein GI219848719 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0334818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA TACCATTAGC CAATCTACAC GTTCGTGAAC TATCACCGCT TCAACCACCA 
CGTGTCCTAA AAGCCGAACT TCCGATCAGC CCGGCTGCGG CCCATACGGT TGCCGAAGCC
CGGGCCGCGA TTCGGCGGAT TTTACGTGGT GAAGATCACC GCCGGATCAT GGTGGTTGGG
CCATGCTCGA TCCATGATCC CGAAGCCGCA CTTGATTATG CTCGTCGCCT CCAAGCGCTG
CAACGCCCGC TTGGCGATCA GTTGCTGATC GTGATGCGCA CCTATCTCGA AAAGCCGCGC
ACTACCGTCG GCTGGCGTGG CTTGATCAAT GACCCCCATC TCGACGGCTC GTTCGATATG
GCCGCCGGTC TGCGTATTGC CCGTCAATTG CTTCTGGCGA TCAACGAACT CGGCGTGCCG
GTCGCGACCG AGATGCTTGA TCCGATTAGT CCGCAATACC TCGACGACCA AATCAGCCTC
GCGACGATCG GCGCCCGCAC GAGCGAAGCG CAAACGCACC GAGCGTTGGC CAGTGGGGTT
TCGATGCCGG TTGGTTTTAA GAATGGCACT GATGGCGGTA TTCAGATCGC TGTCAATGCC
TGTGTTTCGG CAGCGGCACC ACACAGCTTT CTCGGTATCG ATGAAGATGG GCGCAGTGCA
GTGGTACGTA CCACCGGTAA TCCTGATAGT TTCGTTATTT TGCGTGGTGG CCGCTACGGC
CCCAATTATC ATCTCGAGTA TATCGTGCAG GCAACGCGGT TGATGCGCGA AGCCGAACGA
ACTCCGGCAG TGATGGTCGA TTGCAGTCAC GCCAACTCCG GTGGCGATTT TCGCCGTCAA
GAAGCGGTTT GGCAAACGGT ACTCGGCTAT ATGGTCGAAG AAGAGTTGCC GATCATCGGG
ATGATGTTGG AGAGTAATTT GTTTGAAGGG AAGCAACCAC TTCTTGCCGA CCGCAGCCTG
CTGAAGTACG GTGTTTCACT GACCGACGGT TGTGTTGGGT GGGACACAAC CGAACGCTTG
TTACACGAGG CCCATCTGGC GCTGAGCAGG CGCTAA
 
Protein sequence
MNQIPLANLH VRELSPLQPP RVLKAELPIS PAAAHTVAEA RAAIRRILRG EDHRRIMVVG 
PCSIHDPEAA LDYARRLQAL QRPLGDQLLI VMRTYLEKPR TTVGWRGLIN DPHLDGSFDM
AAGLRIARQL LLAINELGVP VATEMLDPIS PQYLDDQISL ATIGARTSEA QTHRALASGV
SMPVGFKNGT DGGIQIAVNA CVSAAAPHSF LGIDEDGRSA VVRTTGNPDS FVILRGGRYG
PNYHLEYIVQ ATRLMREAER TPAVMVDCSH ANSGGDFRRQ EAVWQTVLGY MVEEELPIIG
MMLESNLFEG KQPLLADRSL LKYGVSLTDG CVGWDTTERL LHEAHLALSR R