Gene Cagg_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1546 
Symbol 
ID7267323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1892854 
End bp1894140 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content57% 
IMG OID643566388 
Productglycosyl transferase group 1 
Protein accessionYP_002462884 
Protein GI219848451 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTAG TTCACGTTAG TACGAACGAC ATCAGCGGTG GTGCGGCACG GGCAGCGTAT 
CGGCTCCATC AGGGCTTATT GCAGTTGGGG TGTGATTCAC GAATGGTGGT TGCCCATCGG
TGGAGTGATG ATCCGACGGT GCAGGAGTTG GCGTCGAAAC CCGTTCTCAT CGGTACGTGG
CAACGACGTT GGCGTGGTTG GCGGATCCGA CGCGATATGC AACCGTACCT CACAACCCGC
CCGCCCGGCC TCGAACCATT CAGCGATGAT CGCAGTCGGT ATGGGTACGA ATTACCAAGA
GCATTGCCGG CTTGCGATGT GGTCACTTTA CATTGGGTTG CCGGTTTGCT CGATTACGGC
AGCTTCTTTC GGACCGTACC GCAACGAGTA CCGGTCGTGT GGCGGCTTTC CGACCAGCAG
CCCTTTACCG GCGGTTGCCA TTACGATGAA GGCTGTGGAC GCTACACGGC GACCTGTGGG
GCATGTCCGC AGCTCGGTTC GCGCGATGAT CACGATCTTT CCCACCGGAT TTGGTTACGC
AAACGGGCTG CCCTCGCTGC CGTGCCACCC GGTCATCTCC ACATCGTTGC GCTCAACCGT
TGGATGGCTG CCGAAGTACA CCGTAGCTCG CTGTTCGGGC ATTTACCGGT GCATATCATT
CCTAACGGTC TTGATACCAC CGTCTTTGCA CCGTATGATC GGGCCTACGC GCGGGCAATA
CTCGGCTTAC CCCAGCAAGC AAAGATCGTC CTGTTTGTCG CGGTTTCGGT CAATAATCGT
CGGAAAGGAT TTGCTCAATT AGCAGCGGCA TTGGCCGGTC TGTATGATGA ACCCGACTTA
TTGCTGGTCT CGGTCGGTAA ACATCCGCCT ACCCTGAATA TCCCCATCGC GCATCATCCT
CTCGGTACGG TTGATGAAGA TACCCGGCTC GCCTTAGCTT ATAGCGCTGC TGATCTTTTT
GTTATTCCGT CGTTGCAAGA CAATATGCCG AGTACGGTGC TCGAAGCACT GGCATGTGGC
ACGCCGGTCG TCGGTTTTGA TACCGGTGGT ATTAGCGAAT TGGTGCGCCC CGGCCAAACC
GGTTGGTTGG CGCCGGTCGG TGATGTCGAC GGGTTGCGTG AGGCCATTCG GCATGCGCTC
CACAACGATG ATGAGCGCGT ATGGTTGGGA CGCCGCTGCC GAGAAATTGC CCTTGCTGAG
TATCGGCAAG AAATACAGGC GCAACGCTAT CTCGACCTCT ATCAACAAAT TACGACTACC
GCCAATGCTA CAGTACGGGT GGGATGA
 
Protein sequence
MRVVHVSTND ISGGAARAAY RLHQGLLQLG CDSRMVVAHR WSDDPTVQEL ASKPVLIGTW 
QRRWRGWRIR RDMQPYLTTR PPGLEPFSDD RSRYGYELPR ALPACDVVTL HWVAGLLDYG
SFFRTVPQRV PVVWRLSDQQ PFTGGCHYDE GCGRYTATCG ACPQLGSRDD HDLSHRIWLR
KRAALAAVPP GHLHIVALNR WMAAEVHRSS LFGHLPVHII PNGLDTTVFA PYDRAYARAI
LGLPQQAKIV LFVAVSVNNR RKGFAQLAAA LAGLYDEPDL LLVSVGKHPP TLNIPIAHHP
LGTVDEDTRL ALAYSAADLF VIPSLQDNMP STVLEALACG TPVVGFDTGG ISELVRPGQT
GWLAPVGDVD GLREAIRHAL HNDDERVWLG RRCREIALAE YRQEIQAQRY LDLYQQITTT
ANATVRVG