Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3624 |
Symbol | |
ID | 7269768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4403783 |
End bp | 4405060 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643568431 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002464897 |
Protein GI | 219850464 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.541774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAAAAC GGTTCGTATT CCGTCTTCTA CTATTGGGCT GGTTGCTGGC CTTCACGGCA TGTGCTGCAT CACCGGCGAG CGTACCGCCG GCTCAAGCAG GCCGAACGGT GTTGCGCCTG TGGCATGCGT GGCCTTCAAC CGAAGGACGT GTGCTGCAAA CGCTTGTCGA ACAGTTTAAC CAAGCCCATC CGCAGTGGCA AATTGTCGTC CAAGCTCGTC CGGCAGTGTC TCTACCTGCC GATCTGATGA CGGCCGTGCA AGAAGGTGGT GGCCCGCATT TGGCGATTGT CCAAAGCCAT ACCCTTGGCA CTTTGGTCGA TGCCGGGGTT GTTCGCCCGC TCGATGATGT GATCGCAGCC GGTGAATTGT CTAGCCTGTT GCGGGCTGCC GTCGGGTCGG CCCAAGTCAC CGTTGCCGGT CAACCAACAC TCGTTGGCGT ACCTATCAGC TTTGATACAT TGGCTCTCTA CTACAACCGT GCTAACGTCT TGCAACCACC AACTACGATC GAAGAGCTGT TGCAGACCGG GCGAGCTTTG ACCGATCGCA ATCGGGTGCC ACCGGTGTGG GGATTGGCCT ACAATCTGTC ATTAGATCGC ACGATTGGTT ATCTCTACGC CTTCGGTGGG CGTGTTTTTG ATGAAAATGG CACGTTAGTG CTTGGCGATA GTGGGCGGGA AGGCACAGAG CGTTGGCTGG CATGGCTCGG GCAGTTATAC CGTGATGAAC AATTGTTAGC CACACTCGAT GGTGTGGTGG TGGATCGGGT ACTCCAATCA CGTGACGCAA TTATGGCGAT CGATTGGGCG CATGCCCAAG CTGAATATCG TGCAATTTGG AACGATCAAC TAGGTGTCGT GCCTTTACCA CGGTTAGGGG CAACCGATCG TCTTCCGCAA CCTTACGTGC AAGCCGATGT TATTGTGATG AACGCCCGGC TTACCGATCA GGCCGAACAA ACGGCCGCTC AAGCGTTTAT GCGTTTTATG ATTGAGCCAT CTAGCCAACG GGTGTTGCTG GCTGTCAACC GCCAGCCTAC CCAACTTGCG CTTCTGCTTA GTGATACCGA TCTCGATGAT CAAATCCAGT TGGCTGCGGC ACGAGCGTTT CGGGCACAGG CTCAGCACGG TTTGCCGATG CCATCTGATC GACTTGCCAA CGAATTCGTC TGGACAACCC TGGCCGATAT GCATCTCAGT GCGGTGCGTG GGTTGCTTAC TCCTGAACAG GCAGTCTCAC AGGCCGTCGA GATCTTGCAT AGTCGCTTCA CACCCTAG
|
Protein sequence | MQKRFVFRLL LLGWLLAFTA CAASPASVPP AQAGRTVLRL WHAWPSTEGR VLQTLVEQFN QAHPQWQIVV QARPAVSLPA DLMTAVQEGG GPHLAIVQSH TLGTLVDAGV VRPLDDVIAA GELSSLLRAA VGSAQVTVAG QPTLVGVPIS FDTLALYYNR ANVLQPPTTI EELLQTGRAL TDRNRVPPVW GLAYNLSLDR TIGYLYAFGG RVFDENGTLV LGDSGREGTE RWLAWLGQLY RDEQLLATLD GVVVDRVLQS RDAIMAIDWA HAQAEYRAIW NDQLGVVPLP RLGATDRLPQ PYVQADVIVM NARLTDQAEQ TAAQAFMRFM IEPSSQRVLL AVNRQPTQLA LLLSDTDLDD QIQLAAARAF RAQAQHGLPM PSDRLANEFV WTTLADMHLS AVRGLLTPEQ AVSQAVEILH SRFTP
|
| |