Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3058 |
Symbol | |
ID | 7269475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3719554 |
End bp | 3721218 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643567878 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002464352 |
Protein GI | 219849919 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000104524 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCGCC GCATCCGTTG GCAGATTGCC ATTGCCCTAT TCGGTATTAG CGTTATCGTC GGTTTGTTTG GGCGTTTGGC CGCACTGAGC GCCTCGACCG ATAATCCGGC TATCGGTGGT ACCTACCGCG AGGCAGTGAT CGGTTCGCCC AGCGTCCCGA TCCCGCTTCT CAACGATCCA ATTACCGATC CGACCGGTCG TGTGTTGATC AGTCTGCTCT TCGACGGTCT CACTCGCATT GGGATCGATG GATTGATTGA GCCGGCACTC GCGGAAGGGT ATACGGTTGA TGCAACCGGT GAGATTTATA CCTTTAAACT ACGCTCTGGA TTACGCTGGC ACGATGGGAC GCCGATTACC ACCGATGATG TTATCTTTAC CGTCCGCACT TTGCAAGAGT TAGAGACGCC CGGCGAGCCG GCACTTGCCA ATTTTTGGCG CACGACCCTT GTCGAACGGG TAGATAACCG AACGGTACGG TTTATTCTAT CCGGCCCTTT TGCACCTTTT CCGGCATTGG CACGTATGCC GATCTTGCCG GCGCATCTCT TGCGTGGTAT TGCGCCTTCC GAATGGCCAA CCGGTGACTT TGCGCGTCGG CTGATCGGGA GTGGCCCCTT CCGTTTGGCC GAATTGCGGG CTGATGCCGC CATCCTCACT GCCAATCCAA CCTATTACAA TGGGCGGCCA TTCCTCGATC AGATTGAGTT GCGCTTTATG GCTACGCCCG AAGCAGCGGT TGCTGCACTC TTGCGGGGTG AAGTGATGGG TTTTGCCGAA CGATGGGGAA CTAATCTCAA AACTGTTGAT GTACCCGGTG AAGAACGCCG CATCGTTTTA CCGATAGATG AATATACGAC GCTAACGTTT AATCTGCGCC TACCTTTATT TCAAGAGATT CCCCTGCGCC GAGCACTGGC CCTCGGCCTC AACCGTGATG CCCTGATTGA GACGGTACTC AACGGACTAG CCCAACCGAT CGATACACCG CTCTTGCCCG GTACATGGGC CGATGATCCG ACAATACGTC GGGTACCTGC CGATCCGACG GTTGCAGCCG AACGACTGGC CGAAATTGAC TTCGAGCCGG GTAGTGATGG AATCCGCCAG CGTGGTGCTC AGCGCCTGAG CTTTAGTTTG CTGGTTGATC AAAATGAACG ACGGTTGGCA GTTGCTACGG CCATTGCTGA ACAATGGCGT GCGATTGGGG TAGAGGTAAC GGTTGAATCG GTTGAGAGTA CTACCCTCGT CGAACGATTG CGCAAGGGCG ATTTCATGGC AGCAATCCAT ACGTGGACGC GAATCGGCCC TGATCCCGAC GTGTACAGCC TCTGGCATTC TAGCCAGGCC AAGGGTGGTC TGAATTATGC CGGTCTGAAC GATGGACGGA TCGACACCTT GCTCGAACAA ACCCGATCGG AACCAGAATT GGCTGCACGG GCAGAACTCT ACCACGAGTT TCAACGCCGA TGGCTGGAGT TGTCGCCGGC AATAACCCTC TACCAACCGC AGTACCTCAT CGTCAATAGT GCGAATGTGC AAGGACGAGC CTTTGCCAGT CCTGATTTTG CCAACCAAAC CCTCTTCGGC GCCGAAGATC GGTTCCGCGA TGTGCAGCGT TGGTTCGTCA ATAGCTTTCG CCGCCTCGAA GGTGATCTGC CGTAG
|
Protein sequence | MARRIRWQIA IALFGISVIV GLFGRLAALS ASTDNPAIGG TYREAVIGSP SVPIPLLNDP ITDPTGRVLI SLLFDGLTRI GIDGLIEPAL AEGYTVDATG EIYTFKLRSG LRWHDGTPIT TDDVIFTVRT LQELETPGEP ALANFWRTTL VERVDNRTVR FILSGPFAPF PALARMPILP AHLLRGIAPS EWPTGDFARR LIGSGPFRLA ELRADAAILT ANPTYYNGRP FLDQIELRFM ATPEAAVAAL LRGEVMGFAE RWGTNLKTVD VPGEERRIVL PIDEYTTLTF NLRLPLFQEI PLRRALALGL NRDALIETVL NGLAQPIDTP LLPGTWADDP TIRRVPADPT VAAERLAEID FEPGSDGIRQ RGAQRLSFSL LVDQNERRLA VATAIAEQWR AIGVEVTVES VESTTLVERL RKGDFMAAIH TWTRIGPDPD VYSLWHSSQA KGGLNYAGLN DGRIDTLLEQ TRSEPELAAR AELYHEFQRR WLELSPAITL YQPQYLIVNS ANVQGRAFAS PDFANQTLFG AEDRFRDVQR WFVNSFRRLE GDLP
|
| |