Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0872 |
Symbol | |
ID | 4117202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 912890 |
End bp | 914515 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638035655 |
Product | extracellular solute-binding protein |
Protein accession | YP_643651 |
Protein GI | 108803714 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00426332 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGAC GGAACGCGCG AGCCAGGGTA GGGCTAGCCG GGATCTCCCG CGGGGAGTTC CTCAAGCTGA GCGGCGCCGG CCTCGCCGGC GCCGCTCTTC TGGGTGCTGC CGGCTGCGGC GGCGAGCAGG GCGGCGGCCA GCGGGGCGGC GGTGGCGGCG GGCGGACGGA GCTCATCGTG GGTCTGGACC AGGAGCCCGC CATCCTGAAC GGGTACATCG TGGGCGGGGA TCTGGTCGCC ACCTCCAACG TCACCCGGCC CGTCATGGAG AGCGTGCTGC AGATAATGCC GGACCTCTCG TACGCGCCGA AGCTCGCCGA CGGGGAGCCG CGGGTGGCCA GCGAGGACCC GCTCACCATC GAGTTCAGGC TGAAGGACGG GATCACCTGG TCCGACGGGG AGCCGCTCAC CGTCGAGGAC TACGTGTTCA CCTACAACAC CGTCATGAAC GATCGGTGGC AGATCATCAC GCGCGAGATC TGGGACAGCA TAGACCGGAT CGAGACCCCC GACGAGCTGA CCGCCAGGAT CATCTTCAAG AGGCCCGACG CCCGCTGGCG GGACATCCTT GCCGCCGACG TGCTGCCCAA GCACGTCCTG CAGGGGAAGA ACTTCAACAA ATACTTCAAC GACCGGATCG TGGGCAGCGG CCCCTACGTC TTCGAGGAGT GGCGCAAGGG GCAGAGCCTC ACGGTCGTCG CCAACGAGAA CTACTGGGGC GACCCCCCGG CGATAAAGAA GATCACCTTC CGCTTCATCC CCGACACCAA CTCGCTGAAG GCCGCCCTGC GCTCCGGCGA GGTGCAGTTC ATCAACCCGC CGCCGGACAT CGGGCTGATC GAGGAGCTGC GGGGCTACGA CGGGGTGACC GTCCAGACCA AGTTCGGCAC GGTCTGGGAG CACCTGGCCT TCAACGTGGA GAAGGTGGAC AACCTCAACA TCCGGCGGGC CATCGCCTAC GCGGTGAACC GCCGCCAGCT GATCCAGGAG ATCCTGCAGG GCGAGGCCCG GCCGCTGCAG AGCGTGTTGG TGCCCGAGCA GGAGCCCTTC TACACCCCGG CCTGGGAGCG CTACTCCTTC GACCCGGACC GGGCGCGCCG GCTCGTGGAG CAGGCGCGGG GGGAGGGGGC CTCCACGGAG ATAGAGTACT CCACCACCTC CGGGAACGCG CTGCGGGAGA CGGCCCAGCA GGTCATCCAG CAGCAGATGG AGCAGGTGGG GATAACGCTC CGGATAAACA ACTCCTCCGC CGAGACCTAC TTCGGGGAGC GCACGCCCGA GGGCGACTTC GAGATGGGCG AGTGGGCCTG GAGCGCGACC CCCGACCCCT CCATCACCAC GCTCTTCGGG GCGAACCAGG TGCCGCCGAA CGGGCAGAAC TACTACCGCT ACCGCAACGA GGAGGTCACC CGGCTGCTCG AGCAGGCCGA CATCACCGTG GACCAGCAGG AGCGGGCCCG GCTCACCCGC AGGGCCCAGG AGCTCATGGC AGAGGACGTG CCGCTGGTCC CGCTCTACCA GCGGCCCGAG ATCTACGCCT ACGCCGACAA CCTCGAGGGG CCGAGGGTCA ACCCGACGCT GGCCACCGCC TTCTGGAACG TCGGGGAGTG GCGCTTCACC GGGTAG
|
Protein sequence | MGRRNARARV GLAGISRGEF LKLSGAGLAG AALLGAAGCG GEQGGGQRGG GGGGRTELIV GLDQEPAILN GYIVGGDLVA TSNVTRPVME SVLQIMPDLS YAPKLADGEP RVASEDPLTI EFRLKDGITW SDGEPLTVED YVFTYNTVMN DRWQIITREI WDSIDRIETP DELTARIIFK RPDARWRDIL AADVLPKHVL QGKNFNKYFN DRIVGSGPYV FEEWRKGQSL TVVANENYWG DPPAIKKITF RFIPDTNSLK AALRSGEVQF INPPPDIGLI EELRGYDGVT VQTKFGTVWE HLAFNVEKVD NLNIRRAIAY AVNRRQLIQE ILQGEARPLQ SVLVPEQEPF YTPAWERYSF DPDRARRLVE QARGEGASTE IEYSTTSGNA LRETAQQVIQ QQMEQVGITL RINNSSAETY FGERTPEGDF EMGEWAWSAT PDPSITTLFG ANQVPPNGQN YYRYRNEEVT RLLEQADITV DQQERARLTR RAQELMAEDV PLVPLYQRPE IYAYADNLEG PRVNPTLATA FWNVGEWRFT G
|
| |