Gene Cagg_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1110 
Symbol 
ID7268563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1367540 
End bp1368880 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID643565952 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_002462456 
Protein GI219848023 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.6072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000437012 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGAAAAACA TCTGTGTTGT TGGTACCGGA TACGTTGGCC TGACGACCGG CGTTTGTTTC 
GCCGATCTTG GTCATTCGGT CACGTGTATC GAAATTGATC TCCAAAAGCT GGAACTGCTG
CGCAGTGGCA AATCACCGAT CTATGAACCC GGCCTGGAAG AGTTACAGGA GCGCAATATG
CGCGCCGGGC GGTTGCGCTT TACCGATGAC TATGCGGTCG GCATTCCTGA GGCCGAATTT
ATCTTTATCA CCGTCGGTAC GCCGATGAGT GAAGATGGTT CGGCCGATCT GACGTATGTA
AAAGCGGCTG CGCGCAGTAT CGGCAAGTAT CTGCGCTCCG GCTCGATCAT TATCGACAAG
AGTACGGTGC CCGTAGGTAC CGGTGATATG GTCGAGAACA TCATCGCCGA ACACGCCGGT
CCTGATGTCA AGTTTGATGT CGTCTCGAAC CCCGAATTTC TGCGCGAGGG CAGTGCGTTA
AGCGACTTTT TCAAGCCTGA CCGGATAGTA TTAGGGGCGA AAAATCGTGA AGCAGCACAG
CGGGTAGCTG CGTTGCACGA GACGCTTGGC GCACCGATTA TCATCACCGA TCTGCGTACC
GCCGAGATGA TTAAGTACGC CTCAAATGCC TTCTTGGCGA CCCGTATTTC GTTTATCAAC
GAGATTGCTC AAATCTGCGA GCGGTTGGGT GCTGATGTGC GAGAGGTGGC GCGCGGTATG
GGCGCCGATA AGCGGATCGG GCCTCATTTT CTTGAAGCGG GTGTTGGCTA CGGCGGCTCC
TGCTTCCCGA AAGATGTGCT GGCCCTGTAC CATATGGCCG CTTCGGCGGG TTGTCACCCG
CAACTGTTGC AAGCGGTGAT GGATATTAAC AGCGATGCGC GGAAGCGATT TGTGAAGAAA
GTCGAGACGG TACTCGGTGA TCTGACCGGT CGCTTGATCG GTGTGTTGGG TCTGTCGTTT
AAGCCAAACA CCGATGATAT GCGTGAAGCG CCGAGCGTTG ACATTATCAA CGCACTGCTG
AAGAAAGGGG CGCGGGTAAA GGCTTACGAC CCGGTCGCAA TGCCACGGGC AGAAGAGTTG
TTGCCAACCG TAACGTTTAC CGCCACCGCC TACGATGTCG CAAAAGATGC CGACGCTCTG
CTGCTCGTTA CCGAATGGAA TGAGTTTAAG CAACTCGACT GGCAACGGAT CAAACGCTAT
ATGCGCCAAC CGGTAGTGAT CGATGGACGC AACCTCTACG ACCCGCGTGA GATGCGGAGC
CTTGGCTTCA TCTACTGGGG TGTAGGCCGT GGCGAAGCGC CGGTGCCGTT GTGGGAAGAA
GCAACGAATA TTGGTGATTA A
 
Protein sequence
MKNICVVGTG YVGLTTGVCF ADLGHSVTCI EIDLQKLELL RSGKSPIYEP GLEELQERNM 
RAGRLRFTDD YAVGIPEAEF IFITVGTPMS EDGSADLTYV KAAARSIGKY LRSGSIIIDK
STVPVGTGDM VENIIAEHAG PDVKFDVVSN PEFLREGSAL SDFFKPDRIV LGAKNREAAQ
RVAALHETLG APIIITDLRT AEMIKYASNA FLATRISFIN EIAQICERLG ADVREVARGM
GADKRIGPHF LEAGVGYGGS CFPKDVLALY HMAASAGCHP QLLQAVMDIN SDARKRFVKK
VETVLGDLTG RLIGVLGLSF KPNTDDMREA PSVDIINALL KKGARVKAYD PVAMPRAEEL
LPTVTFTATA YDVAKDADAL LLVTEWNEFK QLDWQRIKRY MRQPVVIDGR NLYDPREMRS
LGFIYWGVGR GEAPVPLWEE ATNIGD