Gene Cagg_0662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0662 
Symbol 
ID7266913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp815587 
End bp817632 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content58% 
IMG OID643565523 
ProductShikimate/quinate 5-dehydrogenase 
Protein accessionYP_002462033 
Protein GI219847600 
COG category[R] General function prediction only 
COG ID[COG5322] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000472149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGAG TAGTTAGCAT TAGTCTCGGT TCGGCGCAGC GTGACTACCA AATCACGGCA 
ACGGTGCTTG GCCGGCAGGT TGAGGTGCGC CGGATTGGCA CTAACGGTGA TGTAGCCCAG
GCGATGGCCT TGATCCGGGA ATTTGATGGC AACGTTGACG CAATCGGTCT CGGTGGGTTG
ACGCCGGTGT TTCGGATTGG TCGTGCCCGT TATCCGCATC AAGAAGCGAT CCATATCGCA
GCGCAGGCAC GGCGGACGCC GGTAGTTGAT GGTGGTGTCG TCAAGGCAAT CTTGGAACGA
TGGGCGATTG CGCAGGCAGT GCGTCAGATC CCTTCATTAG TGCGGTACAA GCGCGTATTG
ATTGCCAGTG GCGTCGAACG CTATCAATTG GCAGCCGCAA TCGCTCAGTA CGAACCAGAA
TTGCGTTTTG CCGATCCGAT TATCCACGCC GGTCTTCCCT TCTTACCGCC ACCGCGTTCG
CTTGAACAAC TCGAATTGTA CGCGGCTACT GCGCTACCGC TGCTCGCTCT CCTCCCTTAC
CGTTTTATCC ACCCGGTCGC GCTCGGTCAA GAAGGTTATG ACCCACGTGC TGCCGCTCTC
TTTCAATGGG CTGATGTCAT TGCCGGCGAT TTTGCCTTCA TCCGTCGCTT TGCCCCGGCC
GACCTGACCC GCAAAGCAGT TATCACCGAT GATCCGTCTC CGGCGGAAAT CGAGGACTTG
CGCCGGCGCG GAGTGACGAC CTTGGTGACG ATGACGCCAC CCCTGAGTGA CGAACGTCCC
TTTCTGGCGG CTGATGCGAT CGAGGCGATC ATTACGGCGA TTACCGAGAG TACGCGCCAG
CCCGGTGATG CCGAAGTCAT CGATTTTATT ACCGCTGCCG GCTGGGGACC GACGGTGCAA
GACCTTAATC CGCGCCCGAA GCCGCGCTTT GCCTTTGTCA TCCATCCGTT GCGGACCGAA
CTGATTGCCA ATCACCGCTG GTTCCGTTGG ACGCGCTACC TGCCGCCGCG TTTGGTGGAG
CTAGTTGCTG CCGAGTTTCC ACCGCTCTAC CTGTCGCGGA TCCGTGGGAT TCGCTCGAAA
GCAACCGGTG AAGAGGTCGA GGGTATCCTC CTCACCCTCG GCACGACTCC GCGCGAGATG
ATGCGTCGAC CACCGAGTTT TACTTATCGC CGGTTGATCA AAGCGGCGCG GATGGCCGAA
CGGATGGGGG CGCAGATTAT GGGCTTGGGC GCATTCACCT CTGTCGTCGG TGATGCCGGG
ATTACCGTAG CCCAGAAGTC CAACATCGGC ATCACTTCAG GTAATTCGTT GACGGTGGCC
GCAACGCTTG AAGCGGCCAA GCAGGCAGTG TTACTGATGA AGGGGGGCAA ACCGGAACAT
GTGCGGGCCG TCGTGATTGG GGCAACCGGT TCGATTGGCG CCGTCTGTGC CCGCTTGCTG
GCACAGGCAG TACACGATGT CGTACTGGTT GCACCGCGTG CCGAACGGTT GATCGCGCTT
AAGAAACAGA TCGAATCCGA GACGCCGGGA GCGCGGGTTG TGGCCGCGAC CTATGCTGAT
GCCTATCTCG GTGACGCCGA TTTGATTATC ACTACGACCA GTGCTCTGAC CGGTAAAGTC
ATTAATGTCG ATAAACTCAA ACCCGGAGCA GTGGTGTGCG ATGTGGCTCG CCCACCTGAT
GTAAAAGAGG AAGATGCACG GCGACGGCCC GATGTACTGG TGATTGAGAG TGGTGAGATC
GTGTTACCCG GTGAGCCGGA TTTTGGCTTT GATATCGATA TGCCACCCGG TACGGCCTAC
GCCTGTCTCG CCGAAACGGC GCTACTGGCA ATGGAAGGCA AGTTTGAAGA TTATACCCTT
GGTCGCAATA TCGAAATCGA GCGGGTAAAA GAGGTTTACC GACTTTGGAA AAAACACGGC
CTCGAACTCG CTCGTCTGCG CTCGTTTGGG GTGTATGTAA CCGACGAGAT GATCGCCGAG
AAGCGGCGGT TAGCCGAAGA ACGACGGCGT CAGTTGGGCT TGCCGGCGGA TAAGGTGTGT
GAGTAG
 
Protein sequence
MKRVVSISLG SAQRDYQITA TVLGRQVEVR RIGTNGDVAQ AMALIREFDG NVDAIGLGGL 
TPVFRIGRAR YPHQEAIHIA AQARRTPVVD GGVVKAILER WAIAQAVRQI PSLVRYKRVL
IASGVERYQL AAAIAQYEPE LRFADPIIHA GLPFLPPPRS LEQLELYAAT ALPLLALLPY
RFIHPVALGQ EGYDPRAAAL FQWADVIAGD FAFIRRFAPA DLTRKAVITD DPSPAEIEDL
RRRGVTTLVT MTPPLSDERP FLAADAIEAI ITAITESTRQ PGDAEVIDFI TAAGWGPTVQ
DLNPRPKPRF AFVIHPLRTE LIANHRWFRW TRYLPPRLVE LVAAEFPPLY LSRIRGIRSK
ATGEEVEGIL LTLGTTPREM MRRPPSFTYR RLIKAARMAE RMGAQIMGLG AFTSVVGDAG
ITVAQKSNIG ITSGNSLTVA ATLEAAKQAV LLMKGGKPEH VRAVVIGATG SIGAVCARLL
AQAVHDVVLV APRAERLIAL KKQIESETPG ARVVAATYAD AYLGDADLII TTTSALTGKV
INVDKLKPGA VVCDVARPPD VKEEDARRRP DVLVIESGEI VLPGEPDFGF DIDMPPGTAY
ACLAETALLA MEGKFEDYTL GRNIEIERVK EVYRLWKKHG LELARLRSFG VYVTDEMIAE
KRRLAEERRR QLGLPADKVC E