Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0090 |
Symbol | |
ID | 7266828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 127748 |
End bp | 129217 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643564963 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002461479 |
Protein GI | 219847046 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.271567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCG CTGCCAAAGG TGGCTTCCTG CTCGTGATAC TGATGCTGCT CGCGGCCATG GCCGGCTGTA CGCCAGCGGC TATACCACCG ACTCCCCAAG CGCCAACCGC TGCGCCACAG ACACCCCCTG AGCAACCGAC CGCCGCACCG ACCGCCGCAC CGACCGCTGC ACCGACTGCT GCTCCCACTG CTGCCGCCAA GCAACCGGTG ACCCTGCGCT ACGCCAACTG GAACCTCGGC ACTGAGGAAG AGAACAACAT TCAGCGCCGC TTGGTCAAGG CGTATACTGA GATGAACCCG CACGTGACGA TCGAGTTCGT TGATATGTCG GGTGGTGGCT GGGACGATAT GCTCAACACC TATGCAGCCC GCGGTGAGCT ACCCGATGTC TTTATGGCCA ACAACATGCC GCTCTACGTT AAGAACGGCT GGCTGGCCGA TTTGACCGAG CTGGTGGCGA ATGATCCCGA TTGGGCGCTC ATCCCGCAAG TGCTGCGGTC AGGTGTCACC TATAACGGCA AGGTGATGGG TTTGCCGGCG GCGCAGTTCA TTATGGGCTA TTTCGTCAAC CGCGATCTCT ACGAAGCGGC TAACCTTGAT GCGCCTGAGT ACGGTTTCAC GCTCGACGAG TTCAACGCGG CGGTGACCGG CTTACACAAC CCGTCCCGAG GCGTTCTCGG TCTCGACGAG ATGGAGTTCG TGATGGGCTG GTACCCGCAC GTGCTCGACA ACAAGTTGCA GTGGTTCAGC TTCGATGGCG TTCACATGAA CTACAACTCA CCGGCGTTCA AAGACACGGT GGCGCGGGTG GCCGAGCTGA AGCCCTACAC ATGGCAGGGC TTGACCGATG AGCAGAAGGT CAACTTCAAA TCGGCCGGAC CGTGGGAGCT GTTCCTGAAC CAAGAAGTCG GCATGCGCTG GGAAGGCGGT TGGGCCATTC CGCAGATTGC GCAGAACGCT ACCTTTGACT GGGACTTCGT CGGCATCCCC GGCGGTAATC AGGCGATTGT GATGGACATC ATCGCTGTCT CGAAGACGGC GCCGAATCTG GAGGAAGCCT ACCAATTCGC GCGCTGGATG ACCTTTGCCC GCGCCGCTTA CGCCAAAGAG GTGGAACTGG CCCGCGAGAT GGGTAGCGTG CCAAGCAAAA TGCCGGTCGC GATTGACACT GAGTCGCTGG CGCTCTACCG CCAATTCTTC GACAAGCCGG GTCTCAATGC AGCCCTTGAG AATCTGAACA ATAGCCTTGT CGAGTCACTG GCCAAACTCG TACCGGGTTA TATCCAGGCA CGCTGGGAAG GCAAACCCGG CATCGACATC GGCGAAGATA AAGATGTGAA CATGTGGTTC ATGTTCGCCC ATGCCGGCGA TGGCATCTAC AAGTACGAGG ATTACGCACC GAAATTAGAG ACGTTCGCCA ACAATATCCT CGATACGGCG CGGGCCGAGG TTGACGCCGC CTTGCGATAG
|
Protein sequence | MKPAAKGGFL LVILMLLAAM AGCTPAAIPP TPQAPTAAPQ TPPEQPTAAP TAAPTAAPTA APTAAAKQPV TLRYANWNLG TEEENNIQRR LVKAYTEMNP HVTIEFVDMS GGGWDDMLNT YAARGELPDV FMANNMPLYV KNGWLADLTE LVANDPDWAL IPQVLRSGVT YNGKVMGLPA AQFIMGYFVN RDLYEAANLD APEYGFTLDE FNAAVTGLHN PSRGVLGLDE MEFVMGWYPH VLDNKLQWFS FDGVHMNYNS PAFKDTVARV AELKPYTWQG LTDEQKVNFK SAGPWELFLN QEVGMRWEGG WAIPQIAQNA TFDWDFVGIP GGNQAIVMDI IAVSKTAPNL EEAYQFARWM TFARAAYAKE VELAREMGSV PSKMPVAIDT ESLALYRQFF DKPGLNAALE NLNNSLVESL AKLVPGYIQA RWEGKPGIDI GEDKDVNMWF MFAHAGDGIY KYEDYAPKLE TFANNILDTA RAEVDAALR
|
| |