Gene Cagg_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1238 
Symbol 
ID7266224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1514324 
End bp1515373 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content61% 
IMG OID643566080 
Productglycosyl transferase group 1 
Protein accessionYP_002462582 
Protein GI219848149 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0206606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTACA CGATAGGTTT TCTCACCGTT GGCGATCCCC ACCGGCTCAC CGGTGGCTAT 
CTCTACCATC GCGAAGTCTT CCGGCGCTGG CGTGCGCAAG GCCACCGCGT GGATGAAATC
GTCTTAGGAC CGGCCGATGT CGACGGGCAA CTAGCCGCCG CACCACAGGC CGGTTCACGG
ATCAGCGTTG ACCGGTACGA TGCGATCATC GTTGACGCTT TGGCCCGTGC GGTCGTGGCG
CCATGGCTTG AGTATTGGCG CACCCGGCGG CCACTACTCG CACTGATCCA TGAATTACCG
GCGGTAGCCG GCGCAAGCGA TCCGCGGGAA CGGGAATGGG AACAGGCACT GCTGCGCGCT
CATATACTCG TCACGGTAAG CGATGATGGA GCGGCAACCC TGATCGCCCG TGGCGCAGAT
CCGAACCGAC TAGTTGTAGC TTCCGGCGGA TGTGATCGTC TGCTACCCTT GCTACCGGTC
AACTGCACCC GCGAGCCACT GATCATCGCC GTTGGCCAAT GGATTCCGCG CAAGAATCTA
GCCAACCTGG TACGAGCATG GGGACAAGCC GCAGTGAACG GCTGGCATTT AGCACTGATC
GGCGAGACTG AAGCCGATCC AACTTACGCC GCCGAAGTCT GGCAAGCGGT GCGCAACTGC
TCGGCCCCGG TACTGGTGCG AGGCACGCTT AGCGATGATG AGTTAGCCCA CCTCTACGCA
CGCGCCAGCG TCTTTGCCCT CCCCTCCCGC TTTGAAGGCT ACGGATTAGT CTTTGCCGAA
GCACTCGCAT GCGGTCTACC GGTGATTGCC GGCGCGGTCG GACCCGTGCC GGCCCTGGTC
GGTAATGGTG GGCTCCTCGT ACCACCTGAC GACGAACCGG CATTGGCAGC GGCCCTACAC
CGCATTGCTA TCGACACTAC ACTCCGTCAA CGTCTCAGCG CTGCTGCTCG TCAACGCGCC
CATACTCTGC CACGTTGGGA TGACACTGCG CAACGTTTGT TAGCTGCCGT AATCCATGCA
TGCACATCCT CGGCAAGCGC TCAAACTTGA
 
Protein sequence
MSYTIGFLTV GDPHRLTGGY LYHREVFRRW RAQGHRVDEI VLGPADVDGQ LAAAPQAGSR 
ISVDRYDAII VDALARAVVA PWLEYWRTRR PLLALIHELP AVAGASDPRE REWEQALLRA
HILVTVSDDG AATLIARGAD PNRLVVASGG CDRLLPLLPV NCTREPLIIA VGQWIPRKNL
ANLVRAWGQA AVNGWHLALI GETEADPTYA AEVWQAVRNC SAPVLVRGTL SDDELAHLYA
RASVFALPSR FEGYGLVFAE ALACGLPVIA GAVGPVPALV GNGGLLVPPD DEPALAAALH
RIAIDTTLRQ RLSAAARQRA HTLPRWDDTA QRLLAAVIHA CTSSASAQT