Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2944 |
Symbol | |
ID | 7268817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3606873 |
End bp | 3608636 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643567766 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002464240 |
Protein GI | 219849807 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0305195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTT TCCTAATAGT TCTCGTGGCC CTGTGGGGAT TGGTAACATT TGCCGGATGT CGGGTCGAGT TAACCAATCC AGAAAATACG CCGGTCGTGA TGGCAACACC GACGCCGATA GTAACGCTAC CCACCGAGCG AATACTACGG ATTGGATTGG TGGCCGAACC TCCTGATCTA TTGCCGTACC ACAGTACTCC AACTGATGAG CGGTTGAGTG GGGCGATTAC CGAGTTGATC TTTCCTTCAC CACTTTTACC GGTGAATTTT ACGCTGCAAC CTACCGGTGT ATTAACTGAC TTGCCCAGTT TGGTCAACGG TGGTGTCGTT ACAACAACGG TGACCGTTTT TCGTGATGCA TTGGGCCAGA TTACCGATAC ACCGACTGAA CAGACCGATG AAGTAACCCA AGTTAGTGTG ACTTTTCATT GGAACGCTGA ACTGCGTTGG TCTGATGGTA CACCGGTGAC GGCTGCCGAT TCGGTCTTTG CCTACGAGCT AGCACAGAGC GCCAATCTCG GTCAAGCGGC TGCCAGTCGG TTGGCGATGA TTGCCCGCTA TGAGCAAGTT GATGAGCATA CGACCCGTGC TGTGCTCAAT CCTGATGTTA CCGATCCGGC CTATCTGACC AGTTTCTGGA CGCCACTACC TCGTCATTTG CTGAGCAAGG TCGATCCGAA ACAAATCTGG CAAAGTGATT TTGCACGCCG TCCTGTCGGC TATGGCCCTT ATATGATCCG CAGTTTTGAG GGCGGTTCGT TAGAGTTAGT GCGTAATCCC TATTACACCG GGCCGAAAGC TCCATTTGAT ACGATACTCT TCGTGTTTCG TAACGATCCG GCGCAGTTGA TCGAATTGGT CAACAGTGGC AGCCTCGATC TTGTCTTCAT TGAGCAGCCG ATGCCGGAAC TCTTAACCAG CCTATTGCAG AGCGATAACC GTGGGTTTCA GGTCAGCACA TCACCGAACC CAATTTGGGA GCATCTCGAT TTCAACCTTG ATGTACCGCT GCTGCAAGAT ATTCGTGTGC GGCGAGCGAT TGCGTATGCG ATTAACCGCC CGTTGCTCGT GGAGCAGTTG TTAGGTGGGA AGAGTAACGT CCTCGAGAGC TGGATATTGC CCGGTCAGCT TGGGTATCCA CCGCTCGATC AGATTACGCG CTACCCGTAT AATCCCGATG AGGCACGTCG TCTGCTTGAT GAAGCAAAGT TAGTCGACAC CGATGGGGAT GGTTGGCGGG AGTATGAAGG TTTGCCGTTG TTACTTTCGT TGGTAACAAG CGCCAACTCG CCGCTACGTC AAGCAGTCGC CGAGCGGATT GCCAACGATC TGGCACAGGT CGGGATTCAA GTTGAAGTGA CGTCGTTGCC GGTGAGCGAG TTGTATAGTG TCGATGGCCC ACTATACCGG CGCACCTTCC AACTGGCGTT GTTTGGTTGG ATTGCCGGGC CACACCCTCG TGGTTGGGAG CTGTGGAGCT GTGCCGGTGT GCCGGGCGAG GCGAACAATT GGACGGGCAA TAATTTTGCG GGCTGGTGTT TCTTTGAGGC GAATGAGGCG ATTAATACGG CAACGACGGC GCTTGATCTA GAAACACAAA AGGCGGCGTA TTTACGCCAA CAGCAGTTGT TTACCCAAGA GTTGCCGGTG CTGCCGCTCT TTCAGCGGAT TGATGTGCTC GTGGCGCGTG AGGGTTTGTC GGGTTGGCGC CTTGATCCGA TTGCACCGTT TACGTGGAAC ATTAGCGAGT GGCAGCTTCG GTAA
|
Protein sequence | MKRFLIVLVA LWGLVTFAGC RVELTNPENT PVVMATPTPI VTLPTERILR IGLVAEPPDL LPYHSTPTDE RLSGAITELI FPSPLLPVNF TLQPTGVLTD LPSLVNGGVV TTTVTVFRDA LGQITDTPTE QTDEVTQVSV TFHWNAELRW SDGTPVTAAD SVFAYELAQS ANLGQAAASR LAMIARYEQV DEHTTRAVLN PDVTDPAYLT SFWTPLPRHL LSKVDPKQIW QSDFARRPVG YGPYMIRSFE GGSLELVRNP YYTGPKAPFD TILFVFRNDP AQLIELVNSG SLDLVFIEQP MPELLTSLLQ SDNRGFQVST SPNPIWEHLD FNLDVPLLQD IRVRRAIAYA INRPLLVEQL LGGKSNVLES WILPGQLGYP PLDQITRYPY NPDEARRLLD EAKLVDTDGD GWREYEGLPL LLSLVTSANS PLRQAVAERI ANDLAQVGIQ VEVTSLPVSE LYSVDGPLYR RTFQLALFGW IAGPHPRGWE LWSCAGVPGE ANNWTGNNFA GWCFFEANEA INTATTALDL ETQKAAYLRQ QQLFTQELPV LPLFQRIDVL VAREGLSGWR LDPIAPFTWN ISEWQLR
|
| |