Gene Cagg_0885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0885 
Symbol 
ID7268338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1109352 
End bp1112516 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content60% 
IMG OID643565733 
Producthypothetical protein 
Protein accessionYP_002462240 
Protein GI219847807 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00133597 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0493729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGA TTATTATTGG CACGTTACTA CTTCCATTGA TTATCTACGC ACCGGGATGG 
GCGTTGACAC GGGTGATCCC GGAGCTAGAG ACCAACGACG GCCTTGAACG CCATTTCGAG
CGCTGCTTGA TCGGCGCGCT CTGGAGCGGT TGGCTAGCGT TGGTGTTGGC CAGTTTCGGC
ATTTTTTCAA TTTGGCTGCA CCTTGGCATT ACGTTGGCGG TGAGCGCCGG CTTGATCTGG
CTGGGACAAC GGCATTCGCC GCCGCGCACT GCCGTGCCAC GCACTGCCGT GCGCGGGTAC
GCCGTTACGT TGTTGGTGCT GGCGCTGTTG GTTGCCCGGC CCTTCGAGGT TATTCTCGGT
GTGCGCGACG CCGGCGTCTA TGCGGTCACC GGGTTTGCCA TCGCCCGCAC CGGCAGCATC
GTCCAAACCG ATGCGGTCGT CGCCGAGCTA GGTCAAGCCG CGCAAAGCAG TGATCCGGCA
GTGCGTGAGC CGGCTGAGCA GGCGATCAGT AATCTGATGA TTGGGCAGGC CCGTGATCGC
TATATCGCGA CCCGCTTGCG AGCAGCCGGT TTTTTGATTA ACGAAGGTGA ACTTGCGCAG
GGGCGGATTG TTCCCCAAGG GTTCCACCTC TTCCCGGCAT GGATCGGATT GTTGACCGCT
GCCGGTGGGC CGCTGTTCGG CCTGTTTGCT ACCGGTCTGC TCGGTTTGCT TGGGGTGTGG
AGCGTCGGCA TGATCGGTCG CCGCCTTGCC GGACCATGGG TGGGGTGGCT GGGGATGGTC
TTGCTCGGCT TAAATGCGGT ACAGGTCTGG TTTTCCCGTT ATTCAACCGC CGAAACTACC
GCCCAGTTTC TGACGTGGGG TGGTCTGTAT CTGTTTGCAA AGTTCAACGA TCATGGGTTG
CCGGCCCGCG CACGTATCGC GTATGCGGCG CTGGCCGGAA TTGCGTTCGG GCAGGTAGCA
CTCGCCCGTA TCGACTTTTT CTTGCTGGCA CCGGCAATTC TCTACCTCGG CTATTGTTGG
CTCACCCGAC GCTGGCAGCA CGTCCAAACG GCGCTGGCGC TGGGAATGTT GGTGATGCTC
GTTCATGCCG CGCTCCATAT CATCTTTATT GCCCGCGCCT ATTTCTTTGA TACCGGCTTT
GCCCGTTTTC AAGATTTCGC ACTGACCTCA CTGATTGCGT TGCCCTTCCT CACGCCTGCC
GTGCGCGAGT CGTATTTCAC CTCGAAGTTT AGCTCGCTCG GTGATCCGGG ACGGATTTGG
ATTGAATTGG CCCTGATCGG GTTGATCATC GTGGCGCTGA TCATGATCCG CCGGAGTGGC
CGGCTGTTGC ATATCGAGCG TCAGATGGTG TCCCGCCGGC AGAGCTGGCA AAACGCGATC
GCCCTCGGTC TGGTGCTGCT GGCAGGTTGG GCATATTTTG TACGTCCGCA GATCATCGAT
GCCGATCTGC TCTTTAATAC CCGCGGCGGC TGGAACGATC CGTTGACCCG CGATCCGAAC
CTCGTGGCAG GTGATGTGCG CAGCGGTGTG ATGACTCCCA CCGAGGCCCG TTTGCAAGCC
GGTGTGGTGC TGACCGGACG ACCATGGGAA GCCGAACCCG ATCTGGCTGC GACCGAGGCG
TTGCGCACCG AGCTGGCCGC CACACGTGGG CCGTGGCAAG GTCCGTTCTC GAACCAAACG
CTCAATTGGC TCCGGATTCA AGGTTATGTC GGCGCACCAA TTCGGCTGCC GCTGGTGCTC
TATTATCAGG AATACAACGG CATGAGTTGG TGGCAACGTA TGCTGGCCGA TCCCAGTACC
TTTACCAGCG AACCGGCGCC TGTCCAACCC AAAGAATTGA TCCCGTTGGC CGGTTTGGTC
CGAGTTGGCT GGTATCTATC ACCACTCGGC GTGGTGTTGG CCGTGATCGG GTTTGCTCTC
TGGTGGCGGC GCGGACTGAG CGCGGCCAGT TGGCTGATGC TCACCGTCGG GTTTCTGGGC
AGCTTCTTTT ATCTTCGCCA GACCTACGGC ACGTCTGAGC AGACGTACAT CTATATTTTG
CGCCGGTTTG TGCCGATCGC TTACCCGATC TTCGCTCTCA GTGCAGCCTA TGCACTTGCC
GCCTTGGCCG GAGCGTGGCA ATTCCGGCCT CACGCCGCCC GCTGGCGCCA AGGGTTGGCC
GCCGCTCTCG CCGGACTGTT GATCCTGTTC CTCGGCTGGA CGGGACGGTA CTATTTTGTC
CATACCGAAT ATGCCGGCGC GTTAGCCCAA GTCGAAGCGA TTGCGAAACA CTTTACGCCT
GACCGCGATA TTGTGCTCTT ACGCGGTGGC GCGCCGATCT ACAGCGATGC CCGTGATATT
CCAGATTTGT TGGCAACACC GTTGCGTTTT GCCTACGACA TCAATGCCTT TACGGTCAAG
AGTGTGTCTA CCGCGCCATA CGCTACGCTA CTGGCCGAAC AGGTCAAGCG CTGGCAAGCT
GCTGGACGCA CTGTCTATCT GATGCTGAGC GCGAGCGGCG GTAATCTCGC GCTACCTGGA
TTTCATCTCA CCTTCATCAC CGAAGTTGCG CTCGATCTGG CCGAATTCGA GCAGTTGACC
GATCAGAAGC CGCAGAATGT TTCCCGCCTG ACTCTGCCCT TTGCCATCTA CCGGCTCGAT
CCGGTCGAGT CGCCGGTGGT CGATACCGCC CCCCCACCGC TCACTCCCAC TTCGTTCGCC
GCGCAGGTCA GTGGGTTTTA CCGCCCCGAA CAGAGTAAAG ACGGCTGGCA GTACAGTTGG
AGCAATGGCG AGGCGGTTCT CCGGCTGCCT TGGCCGGCAG ACGCAGTGCA ACAGACGGTG
GCGATCGAAG TGGCCGGTGG CCTTCGCCCA GACCATCTCG GCCCGGCCAC CCTCTGCATT
GCAGCCCAGC GCGAAGACAC CTTGTGGCCT ACTACCAGCT CTCCCCTCGT TGAACTCGGT
TGCCATCAGA TCGGGCAAGA ACCGACGCTC GTGCGCGTCA CCCTCGATCC CACCCAACTT
CCCCCGACAA CCTCCGGTGC GCTCCTGCTC CATCTCAGCG GGCCGGCGTG GATTCCGGCC
AACGAAGACC CGCGCCTCAC CGACCGGCGG GTGTTGCACG TGCAAATCGG GCACATATGG
ATACACCACC CGTCGGCGCA CGGCACACCG TCCCCCACGC TGTAG
 
Protein sequence
MTQIIIGTLL LPLIIYAPGW ALTRVIPELE TNDGLERHFE RCLIGALWSG WLALVLASFG 
IFSIWLHLGI TLAVSAGLIW LGQRHSPPRT AVPRTAVRGY AVTLLVLALL VARPFEVILG
VRDAGVYAVT GFAIARTGSI VQTDAVVAEL GQAAQSSDPA VREPAEQAIS NLMIGQARDR
YIATRLRAAG FLINEGELAQ GRIVPQGFHL FPAWIGLLTA AGGPLFGLFA TGLLGLLGVW
SVGMIGRRLA GPWVGWLGMV LLGLNAVQVW FSRYSTAETT AQFLTWGGLY LFAKFNDHGL
PARARIAYAA LAGIAFGQVA LARIDFFLLA PAILYLGYCW LTRRWQHVQT ALALGMLVML
VHAALHIIFI ARAYFFDTGF ARFQDFALTS LIALPFLTPA VRESYFTSKF SSLGDPGRIW
IELALIGLII VALIMIRRSG RLLHIERQMV SRRQSWQNAI ALGLVLLAGW AYFVRPQIID
ADLLFNTRGG WNDPLTRDPN LVAGDVRSGV MTPTEARLQA GVVLTGRPWE AEPDLAATEA
LRTELAATRG PWQGPFSNQT LNWLRIQGYV GAPIRLPLVL YYQEYNGMSW WQRMLADPST
FTSEPAPVQP KELIPLAGLV RVGWYLSPLG VVLAVIGFAL WWRRGLSAAS WLMLTVGFLG
SFFYLRQTYG TSEQTYIYIL RRFVPIAYPI FALSAAYALA ALAGAWQFRP HAARWRQGLA
AALAGLLILF LGWTGRYYFV HTEYAGALAQ VEAIAKHFTP DRDIVLLRGG APIYSDARDI
PDLLATPLRF AYDINAFTVK SVSTAPYATL LAEQVKRWQA AGRTVYLMLS ASGGNLALPG
FHLTFITEVA LDLAEFEQLT DQKPQNVSRL TLPFAIYRLD PVESPVVDTA PPPLTPTSFA
AQVSGFYRPE QSKDGWQYSW SNGEAVLRLP WPADAVQQTV AIEVAGGLRP DHLGPATLCI
AAQREDTLWP TTSSPLVELG CHQIGQEPTL VRVTLDPTQL PPTTSGALLL HLSGPAWIPA
NEDPRLTDRR VLHVQIGHIW IHHPSAHGTP SPTL