Gene Cagg_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1781 
Symbol 
ID7267693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2187529 
End bp2188509 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content61% 
IMG OID643566622 
Productglycosyl transferase family 2 
Protein accessionYP_002463117 
Protein GI219848684 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.456914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.601286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC ACACCGTTTC GATTATCTGC ACCGTGCGTG ACGAAGCCGA TAACATTGCC 
GCGCTGCTTG ATTCAATGTT AATGCAGACC CGCACCGCCG ACGAGATCGT GATTAACGAT
TGCCAGAGTG TTGATGAGAC ACCGGCCATC GTTGCCGCCT ACGCCGCGCG CTACCCGCAG
ATCAAGCTGG TGCGCGGCGG GCACAATATT TCGTCGGGCC GCAATAATGC CATTCGCCAC
GCGCGCGGCC CGCTCATCGC CAGTACCGAT GCCGGTCTGA TCCTCGATCC GCACTGGCTC
GCCCGCATTA TCGCCCCGCT CGAAACCGGC GATGCCGATC TGGTTGGCGG CTTCTTCCAT
CCCACACCGC GCTCACTGTT TGCGCTGGCG CTGGGCGAAA CCAACTATCG TCGCAGCAGC
GAGATCGATC CGCTCGCATT CTTACCCTTC GGTAAATCAA TGGCCTTTCG CAAAGAGGTG
TGGGAAGCAG TAGGCGGCTT CCCGGAATGG GCCAGCCACT GCGAAGACTT GCTCTTCGAT
CTGGCCGTTG AGCGAGCCGG CTTTCGCCGC GTTTTTGTCC CAGAAGCGGT GGTACACTTT
GCACCGCGTT CCACCCTCCG AGCCTTTATT CGCCAGTATT ACCTCTACGC TCGCGGAGAT
GGTCGGGCCG GGTTGTGGTC ACAACGTCAC GCGCTGCGGT ACGCCGTCTA TCTGACGCTC
AGTGGGCTAA TGGGGATTGC CCTCAACCAA CCGCGCCTAC GAGCACCGAT TGGAGCGTTG
ATCGGGCTAG GGGTCGCTGC GTATACCCGT GGTCCTTATC GCCGACTTTG GCCGAAACTC
CGCGGCCGAC CACTCGGTGA ACGGCTCTTC GCGCTGGCAT TGGTCCCCCT GATCCGGCTG
GTCGGCGACG TGGCCAAGAT GGTTGGCTAT CCGGTTGGTT TGTGGCGACG GCTTCAGCAT
AACGGGAGGG CCGCAGGATA A
 
Protein sequence
MQQHTVSIIC TVRDEADNIA ALLDSMLMQT RTADEIVIND CQSVDETPAI VAAYAARYPQ 
IKLVRGGHNI SSGRNNAIRH ARGPLIASTD AGLILDPHWL ARIIAPLETG DADLVGGFFH
PTPRSLFALA LGETNYRRSS EIDPLAFLPF GKSMAFRKEV WEAVGGFPEW ASHCEDLLFD
LAVERAGFRR VFVPEAVVHF APRSTLRAFI RQYYLYARGD GRAGLWSQRH ALRYAVYLTL
SGLMGIALNQ PRLRAPIGAL IGLGVAAYTR GPYRRLWPKL RGRPLGERLF ALALVPLIRL
VGDVAKMVGY PVGLWRRLQH NGRAAG