Gene Cagg_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0139 
Symbol 
ID7266878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp186235 
End bp187389 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID643565011 
Productglycosyl transferase group 1 
Protein accessionYP_002461526 
Protein GI219847093 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00161479 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTATCG GTGTCGACTT TACAGCAGGT ATTTGGCAAG GAGCAGGGAT TGGTCGCTAC 
ACGCGAGAAC TGGTGCGTGC TGCTGCCCAA GCCGGGCCTG ATCTGACGTT TCACCTCTTC
TACGCGGCCG GTGGGATCAG GCCAAATAAT CCCTTTGCCC ATTATGCACA GGAACTGGCG
GCTACCTATC CAAATGTCAC GTTACGGCCG CTACCGATCA GTCCGCGCCT GCTCACGATC
ATCTGGCAAC GGCTACGATT GCCACTGCGG ATCGAGTGGT TTATCGGACC GATGGATGTG
GTGCATGCAC CGGATTTTGT CTTACCGCCG ACACAGTCGC GCACCCTGCT GACTATTCAC
GACCTCACGT TTCTGGTCGA ACCGGGTTGT GCCGAACCCG GCTTGCGACG CTATTTGAGT
GAGGCAGTAC CCCGTTCACT CCGACGGGCC GATCTCATTG TTGTCGACTC ACAGTCTACG
GCGAACGATT TGGGGCGGCT CTATGGGATA CCGAGTCGGC GTGTACGTCT GCTCTATCCG
GCCGTGGATG CACGTTTTCG ACCATTACCG CCGGACGAAC TCGCCACGGT GCGCACAAGG
CTAGCGCTAC CGGATCGATT CCTGCTCTTT GTCGGAACGC TTGAACCGCG TAAAAACCTT
GTCCGCCTGT TACATGCCTT TTCCCTGGTA CAATCTGACT ATCCCGACTT GCAGCTCGTT
ATTGCCGGGC GACGTGGTTG GTTGTACGAC GAGATTTTTG CTGCGGTAAC GCAGTATCAG
GTGGCCGACC GAGTACGTTT CCTCGATTTT GTTGCCGACG ACGATCTACC GGCATTGTAT
AATTTAGCCG AAGCCTTCGT TTACCCATCG TTGTACGAAG GGTTTGGCTT TCCGGTACTC
GAGGCGCTCG CCTGCGGAAC GCCGGTTGTC ACGACTAAAG TGGCGAGCTT ACCAGAAGTG
GCCGGATCGG CCGCCATTAT GGTCGATCCG CTAGAGGTCG AAGATATTGC TGCTGGTATC
CACGCTGCGC TCGCCGATCC GGCGCCGCTC CGCGCTGCCG GGCCACCGCA GGCTGCGACC
TTTCGCTGGG AACAGACCGG ACAGGCGCTG GTGGCAATCT ACCGTGAGCT CGCGGCAAAA
GCCGCTGCCA CTTGA
 
Protein sequence
MRIGVDFTAG IWQGAGIGRY TRELVRAAAQ AGPDLTFHLF YAAGGIRPNN PFAHYAQELA 
ATYPNVTLRP LPISPRLLTI IWQRLRLPLR IEWFIGPMDV VHAPDFVLPP TQSRTLLTIH
DLTFLVEPGC AEPGLRRYLS EAVPRSLRRA DLIVVDSQST ANDLGRLYGI PSRRVRLLYP
AVDARFRPLP PDELATVRTR LALPDRFLLF VGTLEPRKNL VRLLHAFSLV QSDYPDLQLV
IAGRRGWLYD EIFAAVTQYQ VADRVRFLDF VADDDLPALY NLAEAFVYPS LYEGFGFPVL
EALACGTPVV TTKVASLPEV AGSAAIMVDP LEVEDIAAGI HAALADPAPL RAAGPPQAAT
FRWEQTGQAL VAIYRELAAK AAAT