Gene Cagg_2959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2959 
Symbol 
ID7268832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3628477 
End bp3629673 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content59% 
IMG OID643567781 
Productglycosyl transferase group 1 
Protein accessionYP_002464255 
Protein GI219849822 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000811696 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCATTC TCTATTTAGC CTCTGGCATT CCGGTACCGG GGACATTGGG CGGCAGCATC 
CACACCTTGG AAGTTGCGCG TGGGCTGGCT CAACGCGGTC ATGACGTTCA CGTGGTCGCA
GCGAGCCGTG AGCTGCCGCT CAGTTATGTT CGGCTCCGGC CAATGCGTCA ACTCACCGCG
CAGTCGTGGA ACGGTTTTAC CCTCTACCAC CAAGACATTC CCAAAGCCCT TAGCCTACTG
GGAACGGCTG CAATTATCAA GCTCACCCGG CAACTCCGGC CCGATCTGAT AATGGAGCGC
TACTACAACT TTGCCGGCGC CGGCCTCATC GCCGCCCGTC GACTCGGCAT CCCGACCCTG
TTGGAAGTAA ACGCCTTGAT CGTTGATCCA CCGGAGATTC TCAAACGACG GATCGATGAC
GCGCTCGGTG GGCCATTTCG ACGCTGGGCA GAACAACAGT GTCGTTGGGC GAGTCGAATT
GTGACGCCGC TGCATACGAC GGTTCCGGCA GGCATTCCGC GCGACAAGAT CATCGAGCTA
CCTTGGGGAG CGAATGTAGA GACCTTCACC CCACCACCTA CCCCACCGCC CGGACCGCCC
AAGGTGATCT TTATGGGTTC CTTCCGCGCA TGGCATGGAG TGAGCGATTT TGTCTACGCG
GCCCGCTTAC TTATCGAGCG GGGGCACCCC GCTCACTTCG TGCTCATCGG TGATGGACCT
GAACGGGCCG CTGCCGAATC CTTAGCTGCA CCCTACCGGG ATCGGTTCAC TTTTACCGGC
GCAGTACCAC ACCAACAGAT TCCTACCTTG CTCGGCCAAG GCCATCTGGG TGTGGCACCC
TTCAACACCG CGCCCCATCC GGCCCTACGC GCCGCCGGCT TTTTCTGGTC ACCCCTCAAA
ATCTACGAAT ACATGGCCGC CGGTCTGCCG GTCGTTACTG CCGCGATCCC TCCGCTCACC
ACGATTATTC GTGAGGGAAT TGAAGGGGCA CTCTTTCGCG AAGGTGATGT ACATGACCTG
GCAGCGGCGA TTGAACGGGT CTTAGTCAAC CCTGCGGCTG CCTTTGCAAT GGGGCAACGT
GCCCGCGCGC GCGTCGTCGC CGAGTTTTCG TGGCAACGAC ATTGTGCCGA GCTAGAGCAC
ATTGGAGAAT CTTTGATCAA AACAAACTCT CACCACATTT TAAGAAAAAT CCAATAA
 
Protein sequence
MRILYLASGI PVPGTLGGSI HTLEVARGLA QRGHDVHVVA ASRELPLSYV RLRPMRQLTA 
QSWNGFTLYH QDIPKALSLL GTAAIIKLTR QLRPDLIMER YYNFAGAGLI AARRLGIPTL
LEVNALIVDP PEILKRRIDD ALGGPFRRWA EQQCRWASRI VTPLHTTVPA GIPRDKIIEL
PWGANVETFT PPPTPPPGPP KVIFMGSFRA WHGVSDFVYA ARLLIERGHP AHFVLIGDGP
ERAAAESLAA PYRDRFTFTG AVPHQQIPTL LGQGHLGVAP FNTAPHPALR AAGFFWSPLK
IYEYMAAGLP VVTAAIPPLT TIIREGIEGA LFREGDVHDL AAAIERVLVN PAAAFAMGQR
ARARVVAEFS WQRHCAELEH IGESLIKTNS HHILRKIQ