Gene Cagg_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2079 
Symbol 
ID7266980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2542866 
End bp2544071 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content58% 
IMG OID643566914 
Productglycosyl transferase family 2 
Protein accessionYP_002463403 
Protein GI219848970 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.959735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCGC TCGTCTGGTT GGTGCTGACG GTCGTGTACT GGGTTGCCGG CATTGTGCTG 
GCGTTGATCG TGGGCTATCT GCTGTTACTG ACGGGAGCTG CGTTGTTTGC CCGCCGCACA
ACACCGCTGC GCGCGCAACC GACTACGCGC TTCGTGATTA TGATTCCGGC GCATAATGAA
GAGCGCCTGT TGCCCGATCT GCTAACCAAT CTCAACCAAC TCGATTATCC ACGTGACCTG
TACAGCATTC ACGTTGTTGC CGACAACTGC ACCGACCGCA CGGCTGCTGT GGCGATGGCC
CATGGTGCGA TTGCCTATGA GCGATTTGAC CAGACGCTGC GTGGGAAGGG ATACGCGCTC
GAATGGCTGT TACAGCAGAT TTGGGCACGC AACGAACCGC ACGACGCCGT TGTTATTCTT
GATGCCGACT CGGTTGTCTC ACCGACCTTT CTGCGCGTGA TGGATGCTCG CCTTGCGCGG
GGCGAGCGGG TGATACAGGC CTATTACGCG GTACGTCAGC CGGAAGGGGC GTGGAGTGCG
GGGATACGGG CGGTGGCGTT GATCGTCCTT CACTACCTGC GTCCGCTAGG GCGCATGGTT
TTGGGTGGTT CGACCGGTTT GAAGGGCAAT GGCATGGTCT TTGCCGCCGA TATTTTGCGG
CGCTACCGCT GGACGGCATC ACTCACCGAG GACATTGAAT ATCACATGAC CCTGATTCTT
GCCGGTGAGC GCGCAATGTT TGCACCTGAT GCAGTGGTAT GGGCCGAGAT GCCCGATAGT
CTCCGGGCGG CCCAGAGCCA AAATGAGCGA TGGGAAAGGG GCCGGCTGGA GATGGTGCGT
CGGTATGTAC CGCAATTGCT GCGCGAGGGA TTGCGCCGAC GCAGCTTTTT GCTGATCGAT
GCAGCGATTG AGCAACTGAT CCCGCCATTT TCGGTGGTCA CCGGTATGAG TATTCTGGTG
GCGTTGGTAG CGATCGTACT ACGCGAACCG GCAGCACTGG CACTGGCCGG TTTCATCATT
GGTGGGCAAG TAGTATATGT CCTCAGTGGG TTGCTGCTAG TACGTGCGCC GTGGTCGATC
TACCGGTCGT TGTTGTTTAC CCCCTTCTTT TTAGGGTGGA AGCTCTGGCT CTACATTCGC
TTGTTACTCG GCGTTAAACC GCGCGATTGG ATTCGCACGG CTCGTAATCG GGCGCAACGT
CCATAG
 
Protein sequence
MIALVWLVLT VVYWVAGIVL ALIVGYLLLL TGAALFARRT TPLRAQPTTR FVIMIPAHNE 
ERLLPDLLTN LNQLDYPRDL YSIHVVADNC TDRTAAVAMA HGAIAYERFD QTLRGKGYAL
EWLLQQIWAR NEPHDAVVIL DADSVVSPTF LRVMDARLAR GERVIQAYYA VRQPEGAWSA
GIRAVALIVL HYLRPLGRMV LGGSTGLKGN GMVFAADILR RYRWTASLTE DIEYHMTLIL
AGERAMFAPD AVVWAEMPDS LRAAQSQNER WERGRLEMVR RYVPQLLREG LRRRSFLLID
AAIEQLIPPF SVVTGMSILV ALVAIVLREP AALALAGFII GGQVVYVLSG LLLVRAPWSI
YRSLLFTPFF LGWKLWLYIR LLLGVKPRDW IRTARNRAQR P