Gene Cagg_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1534 
Symbol 
ID7267311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1878592 
End bp1879815 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID643566376 
Productglycosyl transferase group 1 
Protein accessionYP_002462872 
Protein GI219848439 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.994246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATTC TCTATCTGCA TCAATACTTT ACTACCCGCG CCGGGAGTGG CGGAACCCGC 
TCGTATGAGT TTGCTCGCTA TTTTGTTCAA AACGGGCATC GGGTGACTAT CGTGACGGCT
GCCGATCCGC AGACACCGTG GGCAGGCGGA TGGTGGCGGC AACGTGTGGT AGACGGGATT
AACGTAGTCG AGGTACGGGC CGGTGATACC GATTATCGTC GAAAGACGGC CCTTGGTTAC
GGTCAGCGAA TGGTCGCGTT TCTCCTCTTC GCCCTTGCCA GCGTCATCGC CGTACTGCGC
GTCGCACGAC CCGATGTAGT CTTCGCGACC AGCACACCGC TTACCATCGG CATTCCCGGT
ATAATCGCTA GCCGTTGGCA TCGCGTTCCA CTGGTATTCG AGGTGCGCGA TCTGTGGCCG
GAAGCGCCGT TACAGATGGG CGCATTGCGC CATCCGGCCT TGATACTGGC CGCACGCTGG
CTCGAACGCA CTATCTACCG CCACTCGCGC CATATTATCG CACTCTCGCC CGGTATGCGG
CAGGGCATCT TAGATACCGG CGTACCACCG GAAAAGGTTA GCGTGATTCC CAATGCCGCC
GATCTTGATC TCTTCCATCC GTTACGTGAT GGTCGGTGTT GGCGTGAACG ACTAGGTCAT
CCCCCATTTC TGGCGCTCTA CTTTGGCACG ATGGGTGAGG CTAACGATCT ACAGCAGGTG
ATCGAGGCTG CACGAATATT GCAGTCACAA GGCCGTGACG ACATCCTGAT TGTGCTGGCG
GGGCAAGGCC GGCAACGTCC GCAGCTTGAG GAAAAAACTC GTGACTACCA ACTGCGCAAT
GTGCGTTTTC TCGATCCGTT ACCCAAAACC GAGGTCGCCG ATCTGGTCGC TGCCGCCGAT
GTCTGTCTGA CCATCTTCAA AGCCATACCG GTGCTGGCAA CGTGCTCACC AAACAAACTC
TTCGACGCAC TGGCCGCCGG TAAAGCGGTT ATCGTCAATA CACCGGGTTG GTTACAACAG
TTGGTCGAAA CGCATCAATG TGGGCGCTAT GCACGTGCCG GCGATCCCGC CGATCTAGCA
GCCCAGATCG CCTATTTATG TGACCATCCG GCCTTCACCA AGCACGCCGG TCAACAGGCT
CGGTATCTGG CCGAGCAACA ATTTGATCGC CGGCAATTGG CAGCCGCAGC ACTGACCATC
CTGCAAACGT GTACAACAAA CTAA
 
Protein sequence
MHILYLHQYF TTRAGSGGTR SYEFARYFVQ NGHRVTIVTA ADPQTPWAGG WWRQRVVDGI 
NVVEVRAGDT DYRRKTALGY GQRMVAFLLF ALASVIAVLR VARPDVVFAT STPLTIGIPG
IIASRWHRVP LVFEVRDLWP EAPLQMGALR HPALILAARW LERTIYRHSR HIIALSPGMR
QGILDTGVPP EKVSVIPNAA DLDLFHPLRD GRCWRERLGH PPFLALYFGT MGEANDLQQV
IEAARILQSQ GRDDILIVLA GQGRQRPQLE EKTRDYQLRN VRFLDPLPKT EVADLVAAAD
VCLTIFKAIP VLATCSPNKL FDALAAGKAV IVNTPGWLQQ LVETHQCGRY ARAGDPADLA
AQIAYLCDHP AFTKHAGQQA RYLAEQQFDR RQLAAAALTI LQTCTTN