Gene Cagg_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0334 
Symbol 
ID7268435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp416040 
End bp417275 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content56% 
IMG OID643565202 
Productglycosyl transferase group 1 
Protein accessionYP_002461716 
Protein GI219847283 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.145244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTACA CCATTTTGAT GATTGCCTCG ACCAGCTTCT TCTCCGATTA CGGTAATCAT 
GTTCGTATTC TCGAAGAGAT TCGCGCATTA CAACGGCGTG GTCATCGGAT TGTACTGGTC
ACCTACCACA ACGGCGATGA TATTCCCGGC ATTACTATCT ATCGCTCGTG GGATGTACCG
TGGATCAAAC GTGCCGTAGT CGGTTCTTCG CGTCATAAAC TGTACCTTGA TGTCGGATTA
GGCTGGCGTG CCCTCCGTAC TGCCCTTGCA CTCCGCCCCG ACATTATCCA CGCCCATACG
CACGAAGCCG CAGCCATCGG GCTGCCGTTA CAGAAGCTGC TACGCCGACC GCTTATCCTC
GATTATCAGG GCAGTATGAC GAGCGAAATG CTCGACCACG GCTTTATTCG GCAGACGAAC
CCGCTCTTTC TGCCACTGAC GCGGCTCGAA CGAATGCTGA ATCGTTCTGC CGATGCAGTG
ATGACTTCTA CGCACAACGC TGCGAATCTG TTGCGGCGCG ATGGGTCGGT GCCGGAAGAC
CGCCTTTTTA CAATAACCGA TGGGGTGGAT ACCGAACGAT TCCGGCCCTA CGACGGTTCG
CCGGCTTGGG CTGCACAGCG CGCTGAATTA CGCGCACAAC TCGGCATCCC GCCTGACCGG
CGGATCGTTG TCTATATCGG CTTGCTCGCG CCCTATCAGG GTACCAACTT GTTGCTCGAA
GTTGCCCGGC ATTTGTGTCA AAAATACGAT GATCTTCATT TTCTGATTAT GGGCCATCCT
GACCCACAAA GCTACCGTAA CCTTGCGGCG AGCTTGGGCA TTGCCGACCG GGTCACGTTG
CCGGGGCGAA TCATCTACCG CGATCTGCAT AGTTATTTAG CGTTGGGCGA AGTAGCCGTC
GCCCCAAAGA TGAGTCTGAC TGAAGGGAAC GGCAAAATCG GTAACTACAT GGCGATGGGG
CTACCGACAG TAGCATTTGA CACGCCGGTC AATCGTGAGA TTCTAGGGCC GTATGGGTTC
TACGCCAAAC GCGGCAGTGC CGAAGATTTT GCGGCGCAGT TGGAGCTGGC CCTTACCAAT
CGCGAGCTAG CCGCCGAGCG TGCCGCCGGA GCGCGCGCAC GAGCGGTGGC TGAACTATCG
TGGGAACGGG CAGCGATTGC AATTGAGGCA ATCTATGCGG CAGCGATTGC TCGCCGCCGC
CAAAGTCTGA CCGACGAAGA GAACGTAGAG GCTTAA
 
Protein sequence
MPYTILMIAS TSFFSDYGNH VRILEEIRAL QRRGHRIVLV TYHNGDDIPG ITIYRSWDVP 
WIKRAVVGSS RHKLYLDVGL GWRALRTALA LRPDIIHAHT HEAAAIGLPL QKLLRRPLIL
DYQGSMTSEM LDHGFIRQTN PLFLPLTRLE RMLNRSADAV MTSTHNAANL LRRDGSVPED
RLFTITDGVD TERFRPYDGS PAWAAQRAEL RAQLGIPPDR RIVVYIGLLA PYQGTNLLLE
VARHLCQKYD DLHFLIMGHP DPQSYRNLAA SLGIADRVTL PGRIIYRDLH SYLALGEVAV
APKMSLTEGN GKIGNYMAMG LPTVAFDTPV NREILGPYGF YAKRGSAEDF AAQLELALTN
RELAAERAAG ARARAVAELS WERAAIAIEA IYAAAIARRR QSLTDEENVE A