Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_1661 |
Symbol | |
ID | 5376056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | + |
Start bp | 1867158 |
End bp | 1868489 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640843170 |
Product | extracellular solute-binding protein |
Protein accession | YP_001378849 |
Protein GI | 153004524 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.407934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGG ACGGATCGCG CGGCCTGACG CGCAGGGATC TCTTGAAGGC GGCCGGAGCG GGCGCCATCG CGGCGGCGGC GGGGAGTGCC GGGCGCGCGC GCGCGCAGCC GAAGACGCTC AAGATCGTGC AGTGGAGCCA CTTCATCCCA GCCTACGACA AGTGGTTCGA CGGCGTGTTC TGCAAGCGGT GGGGGGAGAA GAACGGCACG CAGGTGATCG TCGATCACAT CGCGATCGGC GAGATCAACG CGCGGGCGGC GGCCGAGGTG TCGGCGCAGC GCGGTCACGA TCTGTTCATG TTCCTGTCGC CGCCCGCCGC GTACGAGAAG CAGGTCATCG ACCACTCGGA GATCTACCAG GCGGTGGAGA AGAAGTGGGG CAAGGTCATC GACCTCGGCC ACAAGTCCAC CTTCAACCCG AAGACGAAGA AGTACTTCGC CTTCTCCGAC AGCTACGTGC CGGATCCGGG CAACTACCGT CAGGACCTCT GGTCGCAGGT CGGGTTCCCG AAGGGACCCG ACACCTGGGA GGACGTGCGC AAGGGCGGCA AGGCCATCAA GGACAAGTTC GGCAACCCCG TCGGCATCGG GCTCTCGCAG GAGCTCGACA CGAACATGGC CATGCGCGCG CTGATGTGGT CGTTCGGCGC GTCGGAGCAG GACGCCGAGG GGCGCGTGAC GATCAACTCG CCGCAGACCA TCGAGTCGCT CAAGTTCATG CGCGCGCTCT TCAAGGAGGC CGAGACGAGC GAGGTCTTCA CCTGGGACCC TTCGTCCAAC AACCGCGGGA TCCTCGCGGG CAAGCTGTCC TTCGTCTGCA ACGCCATCTC GGTGACGCGC ACCGCCGAGA AGGAGAACCC GGACATGTCG AAGAAGCTCC AGATCGTGCC CGCGCCGAAG GGTCCGGTGC GCCGCATGGC GGCCGAGCAC GTGATGGACT GCTACGCGAT CTGGAAGTTC GCCGAGAACA AGGAGGGCGC GAAGCAGTTC CTGGCCGACT ACATCGACGC GTTCGGCGAG GCGTTCAAGC AGAGCGAGTT CTACAACTTC CCGTGCTTCC CGAAGACCGT CCCCGACCTG AAGCAGCAGA TCGCGAACGA TCCGAAGGGC GTCCCGCCCG ACAAGTACAA CGTGCTCGGC GACGTGCTCG AGTGGGCGAC CAACGTCGGC TATCCCGGCT ACGCGTCCGC CGCGGTGGAC GAGGCGTTCA ACACGTTCGT CATCCCCACC ATGTTCGCGA AGGTGGCGCG TGACGAGCTG TCGCCCGAGG ACTCGGTGCG GGCGGCGGAG AAGGAGCTGA AGCGCATCTG GGACAAGTGG AAGACGGCCT GA
|
Protein sequence | MAKDGSRGLT RRDLLKAAGA GAIAAAAGSA GRARAQPKTL KIVQWSHFIP AYDKWFDGVF CKRWGEKNGT QVIVDHIAIG EINARAAAEV SAQRGHDLFM FLSPPAAYEK QVIDHSEIYQ AVEKKWGKVI DLGHKSTFNP KTKKYFAFSD SYVPDPGNYR QDLWSQVGFP KGPDTWEDVR KGGKAIKDKF GNPVGIGLSQ ELDTNMAMRA LMWSFGASEQ DAEGRVTINS PQTIESLKFM RALFKEAETS EVFTWDPSSN NRGILAGKLS FVCNAISVTR TAEKENPDMS KKLQIVPAPK GPVRRMAAEH VMDCYAIWKF AENKEGAKQF LADYIDAFGE AFKQSEFYNF PCFPKTVPDL KQQIANDPKG VPPDKYNVLG DVLEWATNVG YPGYASAAVD EAFNTFVIPT MFAKVARDEL SPEDSVRAAE KELKRIWDKW KTA
|
| |