Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0086 |
Symbol | |
ID | 7266824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 120910 |
End bp | 122952 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643564959 |
Product | NHL repeat-containing protein |
Protein accession | YP_002461475 |
Protein GI | 219847042 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACTGA TGCGTACCCG CAAACGAAGA GGGTTTTTGA TCGGCGCGAT CATCGGGTTG GCGCTAGGAT TGGCGTTGCT CACGCCATCG GCGCGGGCCG ACACGCCCTA CGTGACGTGG ACTCCCGGTC CCGGCGGTAA GCTGTATATG ACCCAAGACG CCTACATCCC GACGGATGAG ATTAACTTAC CGGTGAACGC ACCGGAAGAC CTCTTTGTGA CGCCAGATGG CGTGATCTAT CTTGCCGATA CCGGCAACGG GCGGGTGGTG CGGCTCGACG CGACGCTGGC CGTGGCGGCG GAATATGGCA AAGGGGTGCT GAACAAGCCC ACCGGCGTGT TTGTCGATGA TGAAGGGACG GTCTACGTCG CCGATGCCGG CCTCAATCAG ATCGTGGTCT TTGCCGCCGA CGGCACGTTG CGCCATCAAT TTGGCCGGCC CAGCGAACCG CTCTTCGGCA AGAACCGCAC CTTCTTGCCG CGTAAAGTGG CAGTTGACCG GCGCAAAAAT CTCTATGTGA TTAGCGAAGG CTCGGTGCAA GGGGTCATCC AGCTCAACCC CGACGGACGC TTCATCGGCA ACTTCGCCGC CAACACTGCC CAGATGTCGC TGCGCATGAT CTTGCAGCGC ATGTTTCTGA GCGAAGAGCA ACTGGCTCAA CTGGTACGCA ATGAAGCTGC CTCACCTTCC AACCTTGCCA TTGATCGCCA ATCAATGCTC TACACTCTCA CTGCCAGCAC CTTCGCCGAT CAAAGTATTC GCAAATTTAC GGTGGCAGGG AGGAATATCT TCCCAACCAT CTACGGATCG ACCTCGTTCC GCGACATCTA TGTCGATACC GAAGGGTTGC TGGTGGTGGT TGACGGTGAA GGGCGTATCT TCGAGTATGA CCAAAACGGC ACCCTGCTGT TTATGTTCAA TGCCCATGAC AACGGCGACC AACGCCGCGG TACACTCATT AATCCGACCG GTATTGCCCG TTACGGCGAT ACAATCTATG TGCTTGACAA AGAGAAGAAT GCTCTGATCC GTTACCAAAC GACTGCCTTC GCCGAGACCG TGCATCGCGC GATGCGGCTC TATTTGGCCG GCTTCTACCG TGAAGCGAAA CCCTACTTTG AACAAGTGCT CAACTACAAC GGCTCGTTCA TCATGGCCTA TCAGGGGTTG GCCGACGCCT ATTTCAAAGC GGGAGACTAC CCTGCTGCGC TGGCCGCTTA CCGCTACGCC GAAGACCGGA ACGGCTATTC GGAAGCCTTC TGGGAGCTGC GCAATGCGGT GTTGCAGCAG TACCTCGGCC CGTTCATCGT GGCGGTGACG CTGGCCGCAC TTGGGCAGAG CGTCTTCCGC CGGTTCGAGC GCCGGCACGG CTGGCTAACG CCATTGCGTG ACGGGATGCG CCGGCTGCGT CGGTACCGGC TGGTCGATGA CGCGATCTTC TTGCTGCGCT TCGTGCGCCA CCCGGTTGAT AGCTTTTACT ACATCAAGGC CAGCCAGCGC GGCAGCCTGC GCTTTGCCCT GCTGATCTAC CTGTGGGTGA TCGCAGTGCA TGTATCCTCG CTCTACCTGA TCGGCTTCCC ATTCAACCCG TATGCCTATC CCGGCCAGAT TCGGGTCGAG AACGAGATCG CGCTGTTGGT GGCGCTGTTT GGCTTGTGGA ACGCCGCTAA CTATCTGGTT TCAACCATCA GCGACGGCGA AGGGCGGGTG CGCGATGTCG TGATCGGCAG CGCCTACAGT CTCGTTCCCT ATGCACTGTT CATGCCGGTG GTGATTGCGG CCTCGAATGT GCTGACACTG AACGAAGTCT TTTTGGTCAG CTTTTCGCAG CAGGTCATCT TGGCCTGGAC TGGATTGATG CTTTTCATTA TGGTGCGGGA AATTCATAAC TACACCATTT CCGAAACGAC GAGCAATGTG CTCAAGACGC TGTTCACGAT GGCGATGCTG GCGTTGACCG CCTACATCTT GTCCTTACTG TTTGGGCAAC TGTTCGACTT TGTGAGCGCG GTCTGGCAAG AGATCGGGTT ACGTGGTCTG TAA
|
Protein sequence | MTLMRTRKRR GFLIGAIIGL ALGLALLTPS ARADTPYVTW TPGPGGKLYM TQDAYIPTDE INLPVNAPED LFVTPDGVIY LADTGNGRVV RLDATLAVAA EYGKGVLNKP TGVFVDDEGT VYVADAGLNQ IVVFAADGTL RHQFGRPSEP LFGKNRTFLP RKVAVDRRKN LYVISEGSVQ GVIQLNPDGR FIGNFAANTA QMSLRMILQR MFLSEEQLAQ LVRNEAASPS NLAIDRQSML YTLTASTFAD QSIRKFTVAG RNIFPTIYGS TSFRDIYVDT EGLLVVVDGE GRIFEYDQNG TLLFMFNAHD NGDQRRGTLI NPTGIARYGD TIYVLDKEKN ALIRYQTTAF AETVHRAMRL YLAGFYREAK PYFEQVLNYN GSFIMAYQGL ADAYFKAGDY PAALAAYRYA EDRNGYSEAF WELRNAVLQQ YLGPFIVAVT LAALGQSVFR RFERRHGWLT PLRDGMRRLR RYRLVDDAIF LLRFVRHPVD SFYYIKASQR GSLRFALLIY LWVIAVHVSS LYLIGFPFNP YAYPGQIRVE NEIALLVALF GLWNAANYLV STISDGEGRV RDVVIGSAYS LVPYALFMPV VIAASNVLTL NEVFLVSFSQ QVILAWTGLM LFIMVREIHN YTISETTSNV LKTLFTMAML ALTAYILSLL FGQLFDFVSA VWQEIGLRGL
|
| |