Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2267 |
Symbol | |
ID | 7266680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2768001 |
End bp | 2769338 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643567098 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002463583 |
Protein GI | 219849150 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00064551 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCGCC CGATCAAATT CCTCATGCTT CTGGCCGTAC TGGCATTGGT GCTGAGCGCC TGTGGTCAGA CTACAACCCA ACAACCGGCA TCGACACCAC AAACGATACG GGAAACGGTG ACCGTTAAAG AGACGGTGAC CGTCAAAGAG ACGGTTGTCG TTCCGGCCGA AAAAGTGACC GTCCGCTTGT CAGGGTGGGC TTCAAGCCCG GCAGAGACCG CACTGCTCGA GTCGCTGCTC TATAAGTTCT CGGTGGAGAA CCCCGATATT CTGGTCAAGT ACGAGCCGAT CACCGGCGAC TATAAGCAGG TGCTCTTGAC CTCGATTGCA TCGGGGACGG AACCCGATAT TTTCTATGTT GACATCTTCT GGTGGTTGGA GCTGGCCGCC AACGATGCGT TGTTGCCTCT CGATGACTTG ATGGCAAGCA GCGGAGTGTC GCGTGACGAT TTTATTCCGG CACTGATCGA TGCCTTTACC TACAATGGTA AAACCTATGG TATTCCCAAG GACTTTAACA CCCTTGGTAT GTTCTACAAC AAGGCGCTAT TTGATAAAGC CGGTCTAGCT TATCCAACCG ACGATTGGAC ATGGGACGAT CTGCGTAATG CAGCAGCAGC GCTGACCGAT CTGAGCGATC CGAACAAGCC GATCTACGGC TTCTGCACTC CGCCCGATCC AGGCCGGTTC CCGGTCTTCA TCTTCCAAAA CGGCGGCATG GTGATGAACC CCGATTATAC CGACACCATG CTCGATAGCG ATCCGGCGGT GAAAGCAGCC GAATTCTATA CCTCGTTCCG CACCAATCAG ATCGGCGCGC TGCCGTCGGA TCTCGGCGAG GGTTGGCAAG GCACGCTCTT CGGCAAAGGC CAGTGCGCGA TGGTCTATGA AGGCGGCTGG CTGATCCCCT ATCTGCGCGA TCAGTTCCCC AACACTCAGT ACGGTGTGGT CATGCCGCCG GCCGGCCCGG GTGGCGAGGG TAACTTGATC TTCACGGTGG CGTGGGGTAT CTCGGCCAAC ACCAAGAACC CGGAAGCGGC GTGGAAGGTG GTCAACTTCC TTACTAGCGA AGCCAGCCAG AAGACGGTGC TTGAGAGTGG CTTTGCATTG CCGTCGCGTC AGTCGTTGCA AAACAGCGAC TATCTGAAAA ACAATCCGAA CTCGGCAGCT ATCTTCAACG GTTCGTTCTT CGGTGCGAAG CCCTTCTTCT GGGGCGCCGT CGGCTCCGAT GTGAACGATC AGATGTCGAA GGCGCTTGAG CGGATGTTTA AAGAGAACCA GCCAGCGCCA GAGGCTATGA AGCAGGCGGC TGAAGCCATC CGTAAGGCAA TGAAGTAA
|
Protein sequence | MNRPIKFLML LAVLALVLSA CGQTTTQQPA STPQTIRETV TVKETVTVKE TVVVPAEKVT VRLSGWASSP AETALLESLL YKFSVENPDI LVKYEPITGD YKQVLLTSIA SGTEPDIFYV DIFWWLELAA NDALLPLDDL MASSGVSRDD FIPALIDAFT YNGKTYGIPK DFNTLGMFYN KALFDKAGLA YPTDDWTWDD LRNAAAALTD LSDPNKPIYG FCTPPDPGRF PVFIFQNGGM VMNPDYTDTM LDSDPAVKAA EFYTSFRTNQ IGALPSDLGE GWQGTLFGKG QCAMVYEGGW LIPYLRDQFP NTQYGVVMPP AGPGGEGNLI FTVAWGISAN TKNPEAAWKV VNFLTSEASQ KTVLESGFAL PSRQSLQNSD YLKNNPNSAA IFNGSFFGAK PFFWGAVGSD VNDQMSKALE RMFKENQPAP EAMKQAAEAI RKAMK
|
| |