Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0089 |
Symbol | |
ID | 7266827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 124799 |
End bp | 127654 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643564962 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002461478 |
Protein GI | 219847045 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTGGC ACCGGCAGAT CACAACCGCT ATCCGCCTCC TCATGGTACT TTGGCTGGCG CTCGCTAGCC TGCTCTCCGG CGCACTGCCG GTGCGAGCCG AGGGCGGTAA CACGCCGGTC ATTGAAGTGC GCGCCGATAT GCTCGGTCTG CCGGTGGGGA CCCATCCGTT GGCCGCTAGT TATCCCAAGC CGAGTGTCAT CATGCGTGCC GGCGATCAGG TCTCGTTTAC CGTCACCGTG CCGCAAGCCG GCGCGTATGC GTTAGCGATT GACGCCGCCG TGCCGGAAGC GGTCACCTTG ATCCCGCCTG AACTGCTCGT GCAGTTCGAG GGCCAAACTG AGCGCTGGCG GCTGATTGTG CCGCTTTTCT ATCACGATAC TGCCGATCAG TTCCCGCTTG ACCGCTACGG CAACGAAACC CTGATCCCGC AGGCCCGGTT GGCCCGCTGG AGCCGCGTCT TCTTGCGCGA CCCCAACTTT AGCCGGCCGT ACCCGATCGC GCTCACGCTG CCGGCGGGCC ACCAGCGCAT TTCACTCAGC CTCGGCGAAG GCGAGCTGGC GCTGGGCAGC TTCTATCTGT TGCCGGCGAT CGACTCGTAC CCATCTTACG CTGTCGACCA CGCTAGCCGT CAAGCACCGA GCAGCAGCGG AGTGCTGATC GAACTCGAAG CCGAGCGGCC AAGTTTTAAG AATGATACCA GTGTGCGCCC GATCAGCCGC CGTAATCTGG AAGTAACGCC GTATGATCCG TACCATCTGC GGATGAATGC CCTCGGCGGC GAGAGCTGGA GCCGCAGCGG TACTGCGGTC TACTACGAGT TTACCGTGCC GCAGAGCGGC TGGTATGCGA TCACCCTGCG CGCCATCCAA GATTACAAGA ATAATTTTAC CGTGTATCGC CGCATTCTAC TTGATGATCA CGTGCTCTTT GCTGAGTTGA ATGCAGTGCC GTTTGGCTAC ACTACGAATT GGCGCAACTA CACGCTGGGC GGATCCACCC CATACCAGAT TTATCTTGAG CAAGGTCGGC ACGTCTTGGG CATCGAAGCC ACAACCGCGC CTTATCACGA CTCAATCGAG CGGGTGCGAG TCGGTTTAAA AGCGATTGCC GATCTGGCCT TCGCCATTAA GCGCCTGACC GGTAACCAGA TCGATATTTA CAAAGAGTGG GAGATCGCCG ATTACATTCC CGATATTCGC GAACGACTGG CGGCGCTGGT CAGCCAACTG CGCGCCGACC AACAGGCCTT GCTGGCCGTC AACCAGACCC CGGCCTCACC AGAGGTGCTC GCTTACCAGA TGGCGATCGA CAACCTCGAG GTGCTGGCCC AAGACCCCAA CCGGATTCCA ACCCGTATGA GCCGGTTATC AGAAGGGGCC GGTTCGGCGG CGCACCTGCT CGGTAGCATT TTGCCCTCGT TGCAGAGCCA ACCGCTGGCG CTTGATAAAA TCTATATCCA TTCGCCGGAC GTCATTCCTC CTGAACCGAA CATCACCACC GGCGCAATCG TGACCGATTG GTTCCAGCGC TTCCTCGGCT CATTCCGCAG CAATCCCTAC CAGAGCATCG GCGCGGCGCC GGATGAGCTG GAAGTGTGGG TGAATCGCCC GCAACAATAT GTCAATTTAC TTCAGCGCAT GACCGACGAG CGCTTCACAC CACAAAGCGG GATCAAGGTC AAGTTTTCGA TCATGCCCAA CGAATCGAAG CTGATCTTGG CCTGCGCCGC CGGCACCCAA CCCGATATTG CGCTCGGTGT CAGCACCAAC ATCCCGTATG AGCTGGCGAT CCGCAATGCG CTCTACGATT TGCGCAGCTT CCCTGACTTT GACCACTTCA TCCGCATCTA TGCCCCCGGC TCGCTCTTGA GCTACATCAT TAACGACTCG GTATACGCCA TTCCTGAAAC GCAAGATTTT TGGGTGACGT TCTATCGCAA AGATATTCTG GAAACACTGA ATTTGCCGGT GCCGCAAACG TGGAATGAAG TATTAGAGAT TTTGCCCGAA TTACAGCGTT TCGGCATGAA CTACAACACG CCGCTCTCAA GCGGCGGCGG CATGAAGGGC TATCTGGTCA CGGCCCCTTA CCTCTTCAAC TACGGCGCGT CGCTCTACAC ACCCGATGGC ATGTCGGGCT TGGGGTCGGA CGAAGCGATT CAGGCGATCC GGTTTATGGC CGAGAGTTTC ACCATCTACG GCATGCCGCT GACCACGGCC AGTTTCTACG ACAGTTTTCG CGCCGGTGAA ACGCCGGTCG GCATCTCGAA CTTTGAAACC TACCTCAAAT TGCTCACTGC GGCGCCGGAG ATCGATGGGT TGTGGGATAT CGCCCTCTAC CCAGCCACCG TCTTGCCTGA TGGCAGGCAG TTGCGCTATG CGACCGGCTC GGCGCAGGCG GCGATGATGT TTGCCAACAC CGATAAGCCG CAACAGGGCT GGGCCTTCCT CAAGTGGTGG ATGTCAACCG AGACTCAGGT CGCCTTCCAA CAAGAATTAA TTATGAATTT TGGGTTGGAA TATTTGTGGA ATTCAGCGAA CCTCGAGGCG TTTCGCTTTA CCCCGATTTC GGCGACGCAC CGCGACGTGA TTTTGCAGCA GTGGCAATGG CTGCAAGAGC CGATCAAGCT GCCGGGCAGC TATATGCAAG AGCGCGAGTT GAGCAACGTC TGGAACCGGA TCGTCTTTCA GGGAGCTAAC CCGCGCGTCG CGATCGACAA TGCGGTGACG GTGATCAACC GCGAGATCGT GCGCAAGATG ACCGAGTTCG GCTACATCCG CAATGGCGAG CGGGTGCGCA CGATGACGAT CCCGACTATC GAGACGGTGA AGGAGTGGAT GGCGCATGCA AACTGA
|
Protein sequence | MGWHRQITTA IRLLMVLWLA LASLLSGALP VRAEGGNTPV IEVRADMLGL PVGTHPLAAS YPKPSVIMRA GDQVSFTVTV PQAGAYALAI DAAVPEAVTL IPPELLVQFE GQTERWRLIV PLFYHDTADQ FPLDRYGNET LIPQARLARW SRVFLRDPNF SRPYPIALTL PAGHQRISLS LGEGELALGS FYLLPAIDSY PSYAVDHASR QAPSSSGVLI ELEAERPSFK NDTSVRPISR RNLEVTPYDP YHLRMNALGG ESWSRSGTAV YYEFTVPQSG WYAITLRAIQ DYKNNFTVYR RILLDDHVLF AELNAVPFGY TTNWRNYTLG GSTPYQIYLE QGRHVLGIEA TTAPYHDSIE RVRVGLKAIA DLAFAIKRLT GNQIDIYKEW EIADYIPDIR ERLAALVSQL RADQQALLAV NQTPASPEVL AYQMAIDNLE VLAQDPNRIP TRMSRLSEGA GSAAHLLGSI LPSLQSQPLA LDKIYIHSPD VIPPEPNITT GAIVTDWFQR FLGSFRSNPY QSIGAAPDEL EVWVNRPQQY VNLLQRMTDE RFTPQSGIKV KFSIMPNESK LILACAAGTQ PDIALGVSTN IPYELAIRNA LYDLRSFPDF DHFIRIYAPG SLLSYIINDS VYAIPETQDF WVTFYRKDIL ETLNLPVPQT WNEVLEILPE LQRFGMNYNT PLSSGGGMKG YLVTAPYLFN YGASLYTPDG MSGLGSDEAI QAIRFMAESF TIYGMPLTTA SFYDSFRAGE TPVGISNFET YLKLLTAAPE IDGLWDIALY PATVLPDGRQ LRYATGSAQA AMMFANTDKP QQGWAFLKWW MSTETQVAFQ QELIMNFGLE YLWNSANLEA FRFTPISATH RDVILQQWQW LQEPIKLPGS YMQERELSNV WNRIVFQGAN PRVAIDNAVT VINREIVRKM TEFGYIRNGE RVRTMTIPTI ETVKEWMAHA N
|
| |