Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1410 |
Symbol | |
ID | 7269242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 1735984 |
End bp | 1737831 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566253 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002462753 |
Protein GI | 219848320 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCACA AACGTCTTTG GGCATGGATG ACCCTCATGG CAGTAGCGGC AATGATCCTG GCAGCCTGCG GCGCGGGTCA ACAGCAGACC CAACCAACTA CTGCGCCGGC TGCGCAGGCA ACCACTGCCC CGCAGCCAAC CGCTGCCCCG CAGCCAACCG CTGCCCCGCA GCCAACCGCT GCCCCGCAGC CAACCGCTGC CCCGACTGAA TCACCGGCAC CACAGAAGGG TGGCAAACTG ACGATTCTCT ACTGGCAAGC AGTAACTACC TTGAACCCAC ACCTTGCCAC CGGTACGAAA GACTTCGACG GCGCGACCGT CATCCTTGAG CCGCTTGCCC GCTACAACGA AAAAGACGAG CTGGTACCTT TCCTCGCCGC CGAAATTCCG ACCGTTGAGA ATGGCGGTGT TGCAGCCGAC GGCACCAGCG TGACCTGGAA GATCAAGCCG GGTCTCAAGT GGTCGGACGG CACAGACTTC ACAGTTGAAG ACATTATCTT CACGTGGAAA TACTGCGCCG ACCCGGCGAC GGCCTGCACG ACCAAGGCGG CCTTCGATCC GATTGCTAAC ATCGAGAAGA TCGATGACCT GACGATCAAG ATCACGTGGA AAGAGCCGAC CGCCGACCCC TACATCGCTT TTGTCGGTCC GTTTGGAATG ATCTTACAGC AGAAGCAGTT CGGCAACTGC ATCGGTGCAG CAGCCAGCAC CAGTGCCGAG TGCCAGGCGG CTAACCTTGC CCCTATCGGT ACCAATGCGT GGAAGCTGCG TGAGCTCCGC CCCGGCGATA CCGTCATCTA CGAGCGCAAC CCCTTCTTCC GCGACGCCGA TAAGGTCTTC TTCGACGAAG TCGAGATCAA GGGTGGTGGT GATGCTACCT CGGCGGCACG TGCAGTCTGC GAGACCGGTG AGGTCGACTT TGCCTGGAAC TTGCAGATTC CCAAGGCGGT GCTCGAACCG ATCTTAGCTG CCGGGAAGTG CGATGCGATT GCCGGTGGTT CGTTTGGCGT TGAGCGCATT GTAGTCAACT TCGCCAACCC GGACCCCGCC TTGGGAGACA AGCGCAGCGA GCCTGATCAG CCCCACCCGT TCTTGACCGA CCCCGCAGTA CGGCGAGCAA TCGCGCTGGC AATCGACCGC AAGGCGATCG CCGAGCAGCT CTACGGTCCG ACCGGCGAAC CGACGTGTAA CATTCTGGTG GTGCTGGCAG CCGTTAACTC GCCCAATACT ACTTGCGAGC GCAATGTTGA GGAAGCCAAG CGGCTCCTTG AGGAAGCCGG TTGGAAGCTC AATGGGTCGG TACGCGAGAA GAATGGCGTG AAGTTGATCG TCAGCTTCCA GACCAGCATC AACACCCTGC GGCAAGGTGA GCAGGCGATC ATCAAGTCGA ATCTGGCCGA GATCGGCATC CAGGTCAACG TTAAAGCAAT CGATGCTGCC GTCTTCTTTG GTGGTGATGC CGGTAATCCT GATACGCTGA ACAAGTTCTA CGCCGACCTG CAAATGTACA CCAACGGCCC CAACTCGGCC GACCCACAGC AGTATCTGCA AGGCTGGATT TGTGCCGAGA TGTCATCGTC GGCCAACCAG TGGAATGGCA GCAATGATGG TCGGTACTGC AACCCTGAAT ACGACGCGCT CTTTGAGCAG TTGAAGACTG AGCTTGATCC GACACGCCGC GCACAACTGG CGATCCAGAT GAATGATCTG CTGGTAAACG ATGTGGCAGT GATCCCGTTG ATCAATCGCC GCACACCCAA CGCGAAGCTG AAGAATCTCG AAGGTCCAAC CTTCAACACG TTCGACAGCA GCATTTGGAA TATTGCAACG TGGCGGCGCG TACCGTAA
|
Protein sequence | MSHKRLWAWM TLMAVAAMIL AACGAGQQQT QPTTAPAAQA TTAPQPTAAP QPTAAPQPTA APQPTAAPTE SPAPQKGGKL TILYWQAVTT LNPHLATGTK DFDGATVILE PLARYNEKDE LVPFLAAEIP TVENGGVAAD GTSVTWKIKP GLKWSDGTDF TVEDIIFTWK YCADPATACT TKAAFDPIAN IEKIDDLTIK ITWKEPTADP YIAFVGPFGM ILQQKQFGNC IGAAASTSAE CQAANLAPIG TNAWKLRELR PGDTVIYERN PFFRDADKVF FDEVEIKGGG DATSAARAVC ETGEVDFAWN LQIPKAVLEP ILAAGKCDAI AGGSFGVERI VVNFANPDPA LGDKRSEPDQ PHPFLTDPAV RRAIALAIDR KAIAEQLYGP TGEPTCNILV VLAAVNSPNT TCERNVEEAK RLLEEAGWKL NGSVREKNGV KLIVSFQTSI NTLRQGEQAI IKSNLAEIGI QVNVKAIDAA VFFGGDAGNP DTLNKFYADL QMYTNGPNSA DPQQYLQGWI CAEMSSSANQ WNGSNDGRYC NPEYDALFEQ LKTELDPTRR AQLAIQMNDL LVNDVAVIPL INRRTPNAKL KNLEGPTFNT FDSSIWNIAT WRRVP
|
| |