Gene Cagg_1562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1562 
Symbol 
ID7267339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1910937 
End bp1912094 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID643566404 
Product3-dehydroquinate synthase 
Protein accessionYP_002462900 
Protein GI219848467 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGCAC CGCTGAAAGA TAGTTCAATT GTTATGACAA CTACGCTCAC CGTCACTACC 
AGCACGACCC AGTATCCGGT GATTGTCGGC GCCGGCGTAC TGGCGACCCT CGGCGACCGG
CTCACCGAGC TAGGACTACG CGGTACGCTC TGGCTGGTCG CCGACGAACA TCTAGCAGCC
GTCGCCGAGC AGACTACGAC AATGCTACAG GCCGCCGGTT ATCGCGTCCA CACCATCACC
GTCCCTTCTG GAGAAACGAG CAAATCGTTC ACCGAACTAC ACCGGCTCTA CGATTGGATG
ATCGAGAACG GCATCGAACG ACGTGACGCC GTGCTTGCGC TTGGTGGTGG TGTGATCGGC
GATCTGGCCG GCTTTGCTGC GGCTACCATC TTGCGCGGTG TGGCTCTTGT ACAATTACCG
AGCACTCTTT TGGCGATGGT CGATGCTGCG GTCGGCGGCA AAACCGGAAT TAATCACCCA
TTGGGCAAAA ACCTGATCGG TGCGTTTCAC CAACCCCGGC TGGTGCTGGC CGACACCAAC
CTGCTGGCGA CACTGCCGCC CCGTGAGTTA CGCGCCGGTT GGGCAGAGGT GATCAAACAC
GGGGTCATTC GCGACGCCAG CCTGTTTACC GCCCTCGAAG ATCTTGCCGC TACCCGCGGA
TGGAACGCCG CGCATCCCGC CGGATGGAAC GCTGCCGATG CAGAACTCAC CACTTATCTG
ACCGAGATCA TTGCTCGTGC CGTCGCGGTG AAAGTTGCTG TGGTCTCGAA CGATGAGTTC
GAGCGCGGTG AACGGATCAC GCTCAACTAT GGGCATACCA TCGGCCACGC TATCGAACAA
CTGCTCGGCT ACCGCCTACT GCACGGCGAA TGCGTCGCGA TTGGGATGGA TGCAGCAGCG
CGGATTGCCG TCGCTCTCGG TCTGTGTCCA CCCGCATTGG TAGAACGACA GCGCGCCCTG
CTTGCAGCCT ACGGCCTCAC CGTTACGATA CCGGACGAGA CTGACCACAC TGCGATTCTG
CGTCTCATCA CGCGCGACAA GAAGGTACAG GCCGGGAAAG TACGGTGGGT CTTGCCGACG
ACCATCGGGC AGGTGGTTGT ACGCAGCGAC GTACCTATCG AGGTGATCGA ACAGGTATTA
TCATCGTCGG CGGGATAG
 
Protein sequence
MIAPLKDSSI VMTTTLTVTT STTQYPVIVG AGVLATLGDR LTELGLRGTL WLVADEHLAA 
VAEQTTTMLQ AAGYRVHTIT VPSGETSKSF TELHRLYDWM IENGIERRDA VLALGGGVIG
DLAGFAAATI LRGVALVQLP STLLAMVDAA VGGKTGINHP LGKNLIGAFH QPRLVLADTN
LLATLPPREL RAGWAEVIKH GVIRDASLFT ALEDLAATRG WNAAHPAGWN AADAELTTYL
TEIIARAVAV KVAVVSNDEF ERGERITLNY GHTIGHAIEQ LLGYRLLHGE CVAIGMDAAA
RIAVALGLCP PALVERQRAL LAAYGLTVTI PDETDHTAIL RLITRDKKVQ AGKVRWVLPT
TIGQVVVRSD VPIEVIEQVL SSSAG