Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0406 |
Symbol | |
ID | 7266574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 501673 |
End bp | 502959 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643565273 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002461787 |
Protein GI | 219847354 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00219397 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGCAA TGAAGAGTGC CCTGATCGCT ATGTTTCTCG CCGTAGCCAT GATGCTTGCG GCATGTGGCG GTGGTCAAAC CGCACAACCG ACTACTGCGC CGGCGGAGCA ACCGACTACT GCGCCGGCGG AGCAACCGAC TACTGCGCCT GCTGCCCAAC AGGTGACGAT TAAGATTTGG CACCAGTGGG ATGGTGCCTA CCTGACCGCA ATCGAGCAAG CGTTCCGTGA TTACGAGGCT GCTCACCCGA ATGTCAAGAT CGACCTCTCG AAGCCGGAAG ATGTGAGCAA CGCGCTCAAT GTGGCCATCC CGGCCGGTGA AGGTCCCGAT ATTATCGGTT GGGCCAACGA CCAGATCGGC CAGCAGGCGC TGGTGGGTAA CATCGTCGCG CTCAACGATT ATGGGATCAC CGAAGAATTC CTGCGCAGCA CCTACGAGCC GGCGGCTGTG AATGGCGTCA TCTGGCAGGG CAAGATCTGG GCGTTGCCGG AGACGCAGGA AGGTATTGCC TTGATCTACA ACAAGGCCGT CATTGGTGAC ATGCAGTTGC CGACCAACCT CGATGAGCTG CTGGAAATGG CGACGAAGTT CCGCGCTGAG AACCCTGATA AGACGCTTGT CTGCAATCAG GGATTCGGTG GGAACGATGC TTATCACGTC GCTCCGATCT ACTTCGGGTA TGGCGTGCCG AGCTACGTGG ACGATCAGGG CAATGTCTAC GTCAATACGC CGGAGATGAT CAAGGGCGGT GAGTGGCTGG CCGCGATGAG CAAAGTCTCG TTCAGCGAGC AGAGCTACGA TATCTGCAAG GCAGCATTGG CCGAGGGTAA GGCTGCCATG TGGTGGACCG GCCCGTGGGC GATTGCCGGT ATCGAGCAGG ATGGCGTTGA TTACGGTATT CTGCCGTTGG GCAAGCCCTT CGTCGGTATC AAGACCTTGA TGCTGACCCG CAACGCGGTT GAGCGTGGCA ATGCCGAGGT CGCGTTGGAC ATTATGAAGT ACTTCACCAG TGCGGAGGTG CAGACCAAAC TGGCGCTGAC CAACAAGACG GTGCCGGCGG CGACGGCTGC ACTCAAGAAT CCAGAGGTGG CTGCCCTTCC GACCCTGGCC GGGTTTGGTG CTGCCCTGAA TGCGGGTGTA CCGATGGCGA ATACACCTTA CGCTTCGGCC CAGTGGGGTC CGGTGGGTGA GGCCAGCGTT GCGATTTGGA CCGGTGCGCA GACGCCTGCT GATGCGCTAG CTGCCGCTGC TAAAGCGATT GAAGAAGCCA TCATGCAGAT GAAGTAG
|
Protein sequence | MKAMKSALIA MFLAVAMMLA ACGGGQTAQP TTAPAEQPTT APAEQPTTAP AAQQVTIKIW HQWDGAYLTA IEQAFRDYEA AHPNVKIDLS KPEDVSNALN VAIPAGEGPD IIGWANDQIG QQALVGNIVA LNDYGITEEF LRSTYEPAAV NGVIWQGKIW ALPETQEGIA LIYNKAVIGD MQLPTNLDEL LEMATKFRAE NPDKTLVCNQ GFGGNDAYHV APIYFGYGVP SYVDDQGNVY VNTPEMIKGG EWLAAMSKVS FSEQSYDICK AALAEGKAAM WWTGPWAIAG IEQDGVDYGI LPLGKPFVGI KTLMLTRNAV ERGNAEVALD IMKYFTSAEV QTKLALTNKT VPAATAALKN PEVAALPTLA GFGAALNAGV PMANTPYASA QWGPVGEASV AIWTGAQTPA DALAAAAKAI EEAIMQMK
|
| |