Gene Cagg_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3079 
Symbol 
ID7269496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3741088 
End bp3742116 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content54% 
IMG OID643567899 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_002464373 
Protein GI219849940 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.544275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000030667 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGAAATC TTCTCGTGAC CGGTGGAGCC GGCTTTATTG GTAGCAACTT CGTGCATTAT 
ATGCTCGGCA AATACGACGA TTATCGCATC GTCGTGTACG ATAAGCTGAC GTATGCCGGT
AATCTTGCTA ATCTGGCACC GGTTGCCAAC GATCCGCGGT TTGTTTTTGT GCGTGGCGAC
ATCTGTGATA TCGATGCAGT GCGGGAAACG GTGCGCACGT ATGATATCGA TACCATCATC
AATTTTGCTG CCGAGACGCA CGTCGATCGC TCAATCATGG CGCCCGATGC CGTAGTGCGC
ACCAATGTAA ACGGTACGTG GGCATTACTG GAAGTGGCAC GTGAACTGAA ACTCGAACGT
TTTCACCAGA TTAGTACCGA CGAAGTGTAC GGCGCTATTC CGGCCCCGCG CCGTTCGCGT
GAGGGTGATC CGCTCGAACC ACGCAGTCCC TATTCGGCCA GCAAAGCCGG AGCCGAACAT
CTCGTCTACG CTTACTACAT CACCTACGGT GTACCGATCA CGATTACTCG CGGCTCGAAT
AACATCGGTC CCTATCATTA TCCCGAAAAG GCGGTACCCC TCTTCACCAC CAACGCCATC
GATAATCTAC CCTTGCCGAT CTACGGTGAT GGTCTCCAGG TACGCGATTA TCAGTACGTG
CTCGATCATT GTGAAGCCAT CGATGTCGTG CTGCACAAAG GCCAGATCGG TGAGGTCTAC
AACGTAGGGA CCGAGGTCGA GACGCCGAAT ATCGAGATGG CGCGCAAGAT TCTCGATATT
CTCGGCAGGC CGCATAGTCT CATTCAGCAC GTTGCCGACC GTGCCGGTCA TGATCGCCGC
TATGCCCTCG ATTGCTCGAA ACTGCGCGCG CTTGGGTGGC GTTCACGCCA TACCTTCGAT
GAAGCGCTGG AAAAGACGGT ACGCTGGTTT GTTGAAAATG AAGCGTGGTG GCGCCCGATC
AAGTCAGGTG AGTATATGGA ATACTACCGT CGCCAGTATC TTGAACGCAG TGGGTATCCG
GTGGTGTAG
 
Protein sequence
MRNLLVTGGA GFIGSNFVHY MLGKYDDYRI VVYDKLTYAG NLANLAPVAN DPRFVFVRGD 
ICDIDAVRET VRTYDIDTII NFAAETHVDR SIMAPDAVVR TNVNGTWALL EVARELKLER
FHQISTDEVY GAIPAPRRSR EGDPLEPRSP YSASKAGAEH LVYAYYITYG VPITITRGSN
NIGPYHYPEK AVPLFTTNAI DNLPLPIYGD GLQVRDYQYV LDHCEAIDVV LHKGQIGEVY
NVGTEVETPN IEMARKILDI LGRPHSLIQH VADRAGHDRR YALDCSKLRA LGWRSRHTFD
EALEKTVRWF VENEAWWRPI KSGEYMEYYR RQYLERSGYP VV