Gene Cagg_0880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0880 
Symbol 
ID7268333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1100759 
End bp1101889 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content59% 
IMG OID643565728 
Productglycosyl transferase group 1 
Protein accessionYP_002462235 
Protein GI219847802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0960108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATTG CGCTTTACAA TCTCACTACC ACCACCAAAA TCGGTGGCGT CGAGAGTTTT 
GTATGGGATT TGGCGCGCGA GTTGGTGAAA CGCGGCCATG CGGTAGATAT TCTGGGCGGG
GTGGGAACGC GGCGCGAGAC TACCGGGGCG CGCGTCTTCA CGTTCCCCTT CATCTCACGC
CGGTTTTGGC AGGCGCTACC GCCGCTACGT CGGGCGTATG CTGAAGCGAA ACTGCTCGAA
CGGCTAACGA TGGCGATTGC AGCCTTACCG ACACTGGTGC GCGGCAACTA CCAGATCATT
CACTTACAAA AACCCTATGA TCTCTTGCCG GCACTTCTCG CCCGCCGATT GAGTGGGGCC
AAGGTTATTC TCGGCTGTCA TGGCGAGGAT TTCTACCGTG GTGACCGCTG GCTAGCGCGA
CATGTTGACG CCGCTGTCTC GTGCTCGCGC TTTAATGCGC AGACTATCGC CGGACGCTAC
GGGTTCACAC CGGAAGTTGT CTTCAACGGG ATCGACACTT CTCTCTTTCG TCCGCAGCCG
CCCGATCCGA CACGGCGGAC ACGGTGGGGA TTACCGACCG ATCGGCCATT GCTGCTCTTT
GTTGGTCGTT TGCAACCGTG GAAAGGCGTA GAGACGGCCA TTCGCGCATT ATCGTACATT
CCGCATACCC ACCTCATTAT CGCCGGCGAC GGTGAAGATC GCGAGCGACT AGCGACAATC
GCTACCGAAC TCGGCTTACA CGAGCGTGTA ACCTTTTTGG GTAGTGTCCC GCGTCAACAG
TTACCGGACC TGTACGCAGC AGTTGACATA TTAGTAGCCA CTAGCTACGC GAGCGAAACC
TTTGGCATTG GACCGGTTGA GGCTCAAGCC TGTGGCTTAC CGGTCGTTGC CAGTCGGTTT
GGCGGGTTTC CCGAAGTTGT TGCCGACGGT CATACCGGCT TATTAGTCCC GCCGCGCGAC
CCGCCGGCGT TAGCCGAAGC CATAAATACC TTGCTGCGCG ATCCGGATCG TCGGGCAGCC
ATGGCCGCAG CGGCCCCGGC GTGGGCAGCG CAATTTGCGT GGCCGGCGGT GGTGGATCGG
ATTGAGGCCG TGTATCGGGC AGTGGTGGGG GAGACCGTAA GGGCGCGGTA G
 
Protein sequence
MRIALYNLTT TTKIGGVESF VWDLARELVK RGHAVDILGG VGTRRETTGA RVFTFPFISR 
RFWQALPPLR RAYAEAKLLE RLTMAIAALP TLVRGNYQII HLQKPYDLLP ALLARRLSGA
KVILGCHGED FYRGDRWLAR HVDAAVSCSR FNAQTIAGRY GFTPEVVFNG IDTSLFRPQP
PDPTRRTRWG LPTDRPLLLF VGRLQPWKGV ETAIRALSYI PHTHLIIAGD GEDRERLATI
ATELGLHERV TFLGSVPRQQ LPDLYAAVDI LVATSYASET FGIGPVEAQA CGLPVVASRF
GGFPEVVADG HTGLLVPPRD PPALAEAINT LLRDPDRRAA MAAAAPAWAA QFAWPAVVDR
IEAVYRAVVG ETVRAR