Gene Cagg_0086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0086 
Symbol 
ID7266824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp120910 
End bp122952 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content57% 
IMG OID643564959 
ProductNHL repeat-containing protein 
Protein accessionYP_002461475 
Protein GI219847042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTGA TGCGTACCCG CAAACGAAGA GGGTTTTTGA TCGGCGCGAT CATCGGGTTG 
GCGCTAGGAT TGGCGTTGCT CACGCCATCG GCGCGGGCCG ACACGCCCTA CGTGACGTGG
ACTCCCGGTC CCGGCGGTAA GCTGTATATG ACCCAAGACG CCTACATCCC GACGGATGAG
ATTAACTTAC CGGTGAACGC ACCGGAAGAC CTCTTTGTGA CGCCAGATGG CGTGATCTAT
CTTGCCGATA CCGGCAACGG GCGGGTGGTG CGGCTCGACG CGACGCTGGC CGTGGCGGCG
GAATATGGCA AAGGGGTGCT GAACAAGCCC ACCGGCGTGT TTGTCGATGA TGAAGGGACG
GTCTACGTCG CCGATGCCGG CCTCAATCAG ATCGTGGTCT TTGCCGCCGA CGGCACGTTG
CGCCATCAAT TTGGCCGGCC CAGCGAACCG CTCTTCGGCA AGAACCGCAC CTTCTTGCCG
CGTAAAGTGG CAGTTGACCG GCGCAAAAAT CTCTATGTGA TTAGCGAAGG CTCGGTGCAA
GGGGTCATCC AGCTCAACCC CGACGGACGC TTCATCGGCA ACTTCGCCGC CAACACTGCC
CAGATGTCGC TGCGCATGAT CTTGCAGCGC ATGTTTCTGA GCGAAGAGCA ACTGGCTCAA
CTGGTACGCA ATGAAGCTGC CTCACCTTCC AACCTTGCCA TTGATCGCCA ATCAATGCTC
TACACTCTCA CTGCCAGCAC CTTCGCCGAT CAAAGTATTC GCAAATTTAC GGTGGCAGGG
AGGAATATCT TCCCAACCAT CTACGGATCG ACCTCGTTCC GCGACATCTA TGTCGATACC
GAAGGGTTGC TGGTGGTGGT TGACGGTGAA GGGCGTATCT TCGAGTATGA CCAAAACGGC
ACCCTGCTGT TTATGTTCAA TGCCCATGAC AACGGCGACC AACGCCGCGG TACACTCATT
AATCCGACCG GTATTGCCCG TTACGGCGAT ACAATCTATG TGCTTGACAA AGAGAAGAAT
GCTCTGATCC GTTACCAAAC GACTGCCTTC GCCGAGACCG TGCATCGCGC GATGCGGCTC
TATTTGGCCG GCTTCTACCG TGAAGCGAAA CCCTACTTTG AACAAGTGCT CAACTACAAC
GGCTCGTTCA TCATGGCCTA TCAGGGGTTG GCCGACGCCT ATTTCAAAGC GGGAGACTAC
CCTGCTGCGC TGGCCGCTTA CCGCTACGCC GAAGACCGGA ACGGCTATTC GGAAGCCTTC
TGGGAGCTGC GCAATGCGGT GTTGCAGCAG TACCTCGGCC CGTTCATCGT GGCGGTGACG
CTGGCCGCAC TTGGGCAGAG CGTCTTCCGC CGGTTCGAGC GCCGGCACGG CTGGCTAACG
CCATTGCGTG ACGGGATGCG CCGGCTGCGT CGGTACCGGC TGGTCGATGA CGCGATCTTC
TTGCTGCGCT TCGTGCGCCA CCCGGTTGAT AGCTTTTACT ACATCAAGGC CAGCCAGCGC
GGCAGCCTGC GCTTTGCCCT GCTGATCTAC CTGTGGGTGA TCGCAGTGCA TGTATCCTCG
CTCTACCTGA TCGGCTTCCC ATTCAACCCG TATGCCTATC CCGGCCAGAT TCGGGTCGAG
AACGAGATCG CGCTGTTGGT GGCGCTGTTT GGCTTGTGGA ACGCCGCTAA CTATCTGGTT
TCAACCATCA GCGACGGCGA AGGGCGGGTG CGCGATGTCG TGATCGGCAG CGCCTACAGT
CTCGTTCCCT ATGCACTGTT CATGCCGGTG GTGATTGCGG CCTCGAATGT GCTGACACTG
AACGAAGTCT TTTTGGTCAG CTTTTCGCAG CAGGTCATCT TGGCCTGGAC TGGATTGATG
CTTTTCATTA TGGTGCGGGA AATTCATAAC TACACCATTT CCGAAACGAC GAGCAATGTG
CTCAAGACGC TGTTCACGAT GGCGATGCTG GCGTTGACCG CCTACATCTT GTCCTTACTG
TTTGGGCAAC TGTTCGACTT TGTGAGCGCG GTCTGGCAAG AGATCGGGTT ACGTGGTCTG
TAA
 
Protein sequence
MTLMRTRKRR GFLIGAIIGL ALGLALLTPS ARADTPYVTW TPGPGGKLYM TQDAYIPTDE 
INLPVNAPED LFVTPDGVIY LADTGNGRVV RLDATLAVAA EYGKGVLNKP TGVFVDDEGT
VYVADAGLNQ IVVFAADGTL RHQFGRPSEP LFGKNRTFLP RKVAVDRRKN LYVISEGSVQ
GVIQLNPDGR FIGNFAANTA QMSLRMILQR MFLSEEQLAQ LVRNEAASPS NLAIDRQSML
YTLTASTFAD QSIRKFTVAG RNIFPTIYGS TSFRDIYVDT EGLLVVVDGE GRIFEYDQNG
TLLFMFNAHD NGDQRRGTLI NPTGIARYGD TIYVLDKEKN ALIRYQTTAF AETVHRAMRL
YLAGFYREAK PYFEQVLNYN GSFIMAYQGL ADAYFKAGDY PAALAAYRYA EDRNGYSEAF
WELRNAVLQQ YLGPFIVAVT LAALGQSVFR RFERRHGWLT PLRDGMRRLR RYRLVDDAIF
LLRFVRHPVD SFYYIKASQR GSLRFALLIY LWVIAVHVSS LYLIGFPFNP YAYPGQIRVE
NEIALLVALF GLWNAANYLV STISDGEGRV RDVVIGSAYS LVPYALFMPV VIAASNVLTL
NEVFLVSFSQ QVILAWTGLM LFIMVREIHN YTISETTSNV LKTLFTMAML ALTAYILSLL
FGQLFDFVSA VWQEIGLRGL