Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2195 |
Symbol | |
ID | 7266768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2689993 |
End bp | 2691345 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643567026 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002463514 |
Protein GI | 219849081 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATGA AACGTACATT CACCCGGCGC GAGTTGTTGC GCCTGATGGT GGCCGGGAGC GGTGCGGCGG TATTGGCGGC GTGTGGTACG CAAGGCGGGC AGACGGGAAC GCAGGCCACC CAAGCACCGG CTGTGGTCAG CCAGCCCGGA TCGAAGGTCA AGATTACCTA CTGGGGTTCG TTCAGTGGGA ATCTGGGCGA AGCTGAGCAG GCGATGGTCA AGGCGTTTAA CGAGGAGCAG GATGAGGTCG AGGTTGAGTA TCAGTTTCAA GGCAGTTACG AAGAGACGGC ACAGAAGTTC ACCGCTGCTT TGCAAGCTAA TACCACGCCT GATGTCATCC TGCTCTCGGA TGTCTGGTGG TTTGGCTTTT ATCTGGCCGG TGCGATTACA GCACTCGATG ACCTCGCCAG GCAGGTGAAT CTCGATTTCA ATGATTACGA ACCGGTGTTG CTCAATGAAG GTGTGCGCAA AGGTGTCCAT TACTGGATCC CATTTGCGCG CAGCACACCG CTCTTCTACT ACAACAAAGA CATTTGGGCC GAGGCGGGTC TGCCCGATCG CGCCCCAGAG ACGTGGGCCG AGTTTAGCGA GTGGGCGCCG AAGTTGGTCA AGAGCGATGG CAGCCGGTCC GCGTTTGGTC ACCCTAACGG TGCGAGCTAC ATTGCGTGGC TCTTCCAGGG GGTGGTGTGG CAGTTTGGCG GTCAGTACTC GCAACCTGAC TTCACCATGA CGATGACCGA TCCGAATACG TTGCGCGCGG CTCAGTTCTA CCAGGATACG GTGGTCAAAA ATAAGTGGGC TATCTTGTCG CCCAACCTTA ATCAAGATTT CATCGGTGGG GCGATTGCCT CGATGATGGC CTCAACCGGT TCATTAGCCG GGATTCAGGC TAACGCTACC TTCCCGGTGG GAGTCGGCTT CTTGCCGCGA GAGACCAACT TTGGTTGCCC GACCGGTGGC GCCGGTTTGG CGATTGTCAG CCGTGCTCCT GCCGAGAAGC AACTGGCGGC GATGAAGTAT ATCGCGTTTG CGACCAACCC TACCAGCGCC GGTGTGTGGT CGCGGAGCAC GGGATATATG CCGGTACGGA TTAGCACCAA GCAGACGCCG GAGATGATCG AGTTCTTCAA ACAAAACCCC AACTTCAAGA CGGCGGTTGA TCAATTGCCT AAGACTCGTG CGCAAGATGC GGCACGTGTG TTTGTGCGCA ACGGTGACCA AATTATCGGT AAGGGACTCG AGCGGATCAT CGTCAACGGT GAAGCACCGA GTGCTGTGTT TGCCGAGGTT AATAACGAGC TGACCGAGGG CGCCAAGCCG ATCCTGGAGG ATCTCAAAGC ACGCGAAGGC TGA
|
Protein sequence | MSMKRTFTRR ELLRLMVAGS GAAVLAACGT QGGQTGTQAT QAPAVVSQPG SKVKITYWGS FSGNLGEAEQ AMVKAFNEEQ DEVEVEYQFQ GSYEETAQKF TAALQANTTP DVILLSDVWW FGFYLAGAIT ALDDLARQVN LDFNDYEPVL LNEGVRKGVH YWIPFARSTP LFYYNKDIWA EAGLPDRAPE TWAEFSEWAP KLVKSDGSRS AFGHPNGASY IAWLFQGVVW QFGGQYSQPD FTMTMTDPNT LRAAQFYQDT VVKNKWAILS PNLNQDFIGG AIASMMASTG SLAGIQANAT FPVGVGFLPR ETNFGCPTGG AGLAIVSRAP AEKQLAAMKY IAFATNPTSA GVWSRSTGYM PVRISTKQTP EMIEFFKQNP NFKTAVDQLP KTRAQDAARV FVRNGDQIIG KGLERIIVNG EAPSAVFAEV NNELTEGAKP ILEDLKAREG
|
| |