Gene Cagg_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0079 
Symbol 
ID7266817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp110523 
End bp112742 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content59% 
IMG OID643564952 
Productglycoside hydrolase family 35 
Protein accessionYP_002461468 
Protein GI219847035 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.579135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC TAACCGTTCA CAACCAACAA TTCTGGCTCG ATGAACGTCC ACTTTTACTG 
CAAGCGGGTG AATTCCACTA CTTCCGCACC CCCGCCGATC AATGGGAACG CCGGCTCAGC
CTGATTGTAC AAGCCGGGTT TAACGCAGTT GCTTGCTACA TCCCGTGGCT CTGGCACCAA
CCGCAACCGG AGCTGGTCGA TCTGGACGGC ACAAGCCATC CTATGCGCGA CCTTGCTGGA
TTCCTCGATC TAGCCCAACG TATGGGTCTC TATGTGATCG CCCGTCCCGG CCCGTACATT
ATGGCCGAAA CGATCAACGA AGGCATCCCG CCGTGGGTTT TTGAACGCCA CCCGCAAATC
GCACTGATTA ACCAGCGCGG CGAAACCGAA AACATCGCCA GCTACATGCA TCCCGATTTT
CTGAGTTGTG TCGCTGAGTG GTACCGCGCT GTCTTTGCCG TGCTGGCATC GCGCCAGATC
ACCCGCGGCG GCCCGATCGT GCTGGTGCAG CTCGATAATG AAATGGGTAT GCTGCACTGG
GTGCGCAACA GTTTCGACCT CAACCCGGTG GCGATGGAGC ATTTCGCTGC GTGGCGAGAG
GCAATGTACG GCGCTGACCA AACCAGCAAT CCGGCGATGC TAGCCGAACG CTTGCGTTAC
GCCGATGGCG CAGAAGGCGC GCAACTAGTG GCCGATTATC GCCGTTTCTA TCGTACCTAT
TTGCGCGACT ACACAAGCTG GATGCTGGCA ACGGCCCGCG CGCACGGTCT TGAGGTGCCG
GCGGTGATCA ATATTCACGG CTTTGCGAAC GGTGGTAAGA CCTTCCCAAT CGGATTATCG
CAATTGGCCG ATGTGTTACG CATGCCCGAC GTGATCAGTG CCATCGATGT CTATCCGAGC
CAGATAGGCG AGGGCACCTT TCACCAACTG GTGCTGGTCA ACGCCATGAC CGCTGCTATC
CAAAACCCCG CTCAACCACT CTTTTCGATT GAATTTCAGG CCGGCGGCAA CCTCGACTTT
AGCAATATGT CGAGTTCTTT CTACGATCTC CATACCCGCC TGTGCCTATC GAACGGTATG
CGCGCCATCA ACCACTACCT CTTTTTCGAT GGCGAAAACG ACCCGCTCCT TAGTCCGGTC
AAGCGGCACG ACTGGGGCCA CCCGGTGCGC AAAGATGGCA CATTGCGCAG CCACTACCAC
CGCTACCCGT TGCTGTCGCG CACCTTGGTC AGCTACGGCG AAGCACTCAC CTTGGCACGA
CCAGAAACGA TAACGGCTAT CGGCTTCCGC TTAGATGATT TTATGACCGA GGTCAACCAA
CCTTGCACCC AAGCAGCAAC CAACATCATC ACCCATCAGC GCGAGGTTAT CCTGTTCGAC
TTCATCGCGC GCGGGTTGGC GCTGAGCCAT CGCGACTTCA CCGCGGTTGA CCTGCAACAG
GCCACGCTCG ATCCGGGCCG CATGCCGCTC ATGTGGGCCA TGCTCGATAC CACCAGCGAC
CCGGCCACCG AGCGCAAGCT AGTGGACTAT GCTCGTGCTG GCGGGAAGGT GGTGATCGTT
GGCCGCCTGC GCGCAGAGGG TGAGCTGGCG CAGGCGCTAG CGGTGCAGAT CACGAGTGAC
CCGCCCTTCT CCCCGCGCCG GGTGCAGGTC TTTGACATTG CTGATATTCC GGCCAGTTTC
GTGCAGACCT ACCACGGCGA ACTCGGCACG GTCTTTGCGA CGGCTGATGG CCTGCCGATA
GGATTCCGCA AACCGATTGG CAACGGTGAA ATCATCGTGC TCGGCGCAGC CTTCCCGATC
ATCGCCCTTG ATGATCTACG CGCCTTCACC GCGCTGGCGG ATTGGGCCGG CTGTCCAGCG
CCATTCACCC TCAGCCACTG GGCCGATGTG CGGCTCAGTC GTGGCCCGAA CGGCGACTTC
CTCTTCATCA ACCACTACGG CGACGACCCG CTGGCAACCG AGATCAGGTA TCGCGGCACG
CAACTCTTCG ATGCCCACCC GATCCATCTG CCGGCCCGCA GCGGAGCGAT CTTACCGCTC
AACTGGCGCA TACGTCCCGA TCTCACCATT CGCTACGCCA CCGCCGAAGT CCGATCAATC
ACCGAGGAAC AAGGCCAGAT CGTCGTCACA TTCGCCCAAC CAAGCGGGCA TGTTTGTGTA
GAAAAGAATG GTATTACTCA AACTATGCAA TTCACCAACG GCGGCGCCAC CCTCCCGTAG
 
Protein sequence
MPKLTVHNQQ FWLDERPLLL QAGEFHYFRT PADQWERRLS LIVQAGFNAV ACYIPWLWHQ 
PQPELVDLDG TSHPMRDLAG FLDLAQRMGL YVIARPGPYI MAETINEGIP PWVFERHPQI
ALINQRGETE NIASYMHPDF LSCVAEWYRA VFAVLASRQI TRGGPIVLVQ LDNEMGMLHW
VRNSFDLNPV AMEHFAAWRE AMYGADQTSN PAMLAERLRY ADGAEGAQLV ADYRRFYRTY
LRDYTSWMLA TARAHGLEVP AVINIHGFAN GGKTFPIGLS QLADVLRMPD VISAIDVYPS
QIGEGTFHQL VLVNAMTAAI QNPAQPLFSI EFQAGGNLDF SNMSSSFYDL HTRLCLSNGM
RAINHYLFFD GENDPLLSPV KRHDWGHPVR KDGTLRSHYH RYPLLSRTLV SYGEALTLAR
PETITAIGFR LDDFMTEVNQ PCTQAATNII THQREVILFD FIARGLALSH RDFTAVDLQQ
ATLDPGRMPL MWAMLDTTSD PATERKLVDY ARAGGKVVIV GRLRAEGELA QALAVQITSD
PPFSPRRVQV FDIADIPASF VQTYHGELGT VFATADGLPI GFRKPIGNGE IIVLGAAFPI
IALDDLRAFT ALADWAGCPA PFTLSHWADV RLSRGPNGDF LFINHYGDDP LATEIRYRGT
QLFDAHPIHL PARSGAILPL NWRIRPDLTI RYATAEVRSI TEEQGQIVVT FAQPSGHVCV
EKNGITQTMQ FTNGGATLP