Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0621 |
Symbol | |
ID | 4116916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 651816 |
End bp | 653099 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638035407 |
Product | extracellular solute-binding protein |
Protein accession | YP_643404 |
Protein GI | 108803467 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.178997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGCTT GCGCGCGGGT TTTGTGTGGG GTGCTGGCGG CCGCCGGGCT TGTGCTCGGG GTCTTGGGCT GCGGCGGGGG CGCCTCTGGC GGGGAGGGCG CAATCACCGT GTGGAGCTGG CGCACGGAGG ACGTGGCCGC CTACCAGAAG ATCTTCGCCG AGTTCGAGGA GGAGACCGGC ATAAAGGCCG AGTTCAAGCC GTACAAGAAC ACCGAGTACG ACACCATCCT GGAGACCGCC CTCAAGGGGG GCAAGGGGCC CGACGTGATG CAGCTGCGGG CGTACGGCGG GCTGCAGCCG CTCGCCGACG CCGGCTATCT CGTGCCGCTC GACGGGAAGG TGGAGCGGCT GGACGAGTTC TCCGAGCAGG CGCTGGACGG GGCGCGGAGG CTGGAGGACG GCAGGATCTA CGGGGTGCCC TTCGCCATCC AGACCCTGCA GGTCTTCTAC AACAAGAGGA TCTTCGAGGA GCACGGCCTC GAGGAGCCGC GCACCTACCG GGAGTTCGTC GCGGCCGCCG AGAGGCTGGA GGAGGCCGGG GTCACGCCCA TCGCCGCCGG GGGGCGGGAC ATCTGGACGC TGCCCATCCT GCACTCCGTG GTCGGGGCCG AGGTCTACGG CGGGGACCGC TTCGTGGACC AGGTGCTCGA GGACACCTCG GCCTTCACCG GCCCGCGGTT CGTGGAGTCC GTCGCGGCCG TCCGGGAGAT CCTGCCCTAC CTCCCCGAGG ACCCGGCGGG CGTCTCCTAC ACCGACACGC AGGTGCTCTT CACCCAGGAG CGGGCGGCCA TGTTCATCGG GGGCAGCTGG GAGGCCGGGT ACTTCCGGAG CACGAACCCC GACCTCCGGT TCGGGACCTT CCCCATGCCG CCGCGCCAGG GGAGCGGCCC GGGGCTCGTC TCCGCCTTCG TGGACGGCTC CTACGGCGTC AACGCCGCCT CGGACGACAA AAAGGCGGCG CTCCGGCTCG TCGAGTTCAT GGCCAGCGAG AGGTTCGGCC AGATGTTCGC CGACGAGCTC AAGCAGATCT CGCCGGTCCC CGGCGTGGAG TTCAGGGACC CCGTGCTCCG CGGGATGGTC TCCGACTACG AGGCCAACCA CACCCCCTAC CTGCTGCTGG TCTACTTCCG CTACGGCGAC CCCTCCGGCA CCGACCTGCT CGGCCAGGGC CTCCAGAACA TGATGCTCGG GAAGGCCACC CCGCGGCAGG TCGCCGAAAG CCTGCGGAGG GGCGTCTCCC AGTGGTACGA GCCCGGGATG CTGCGGGAGG TGAGCGTTGG CTGA
|
Protein sequence | MRACARVLCG VLAAAGLVLG VLGCGGGASG GEGAITVWSW RTEDVAAYQK IFAEFEEETG IKAEFKPYKN TEYDTILETA LKGGKGPDVM QLRAYGGLQP LADAGYLVPL DGKVERLDEF SEQALDGARR LEDGRIYGVP FAIQTLQVFY NKRIFEEHGL EEPRTYREFV AAAERLEEAG VTPIAAGGRD IWTLPILHSV VGAEVYGGDR FVDQVLEDTS AFTGPRFVES VAAVREILPY LPEDPAGVSY TDTQVLFTQE RAAMFIGGSW EAGYFRSTNP DLRFGTFPMP PRQGSGPGLV SAFVDGSYGV NAASDDKKAA LRLVEFMASE RFGQMFADEL KQISPVPGVE FRDPVLRGMV SDYEANHTPY LLLVYFRYGD PSGTDLLGQG LQNMMLGKAT PRQVAESLRR GVSQWYEPGM LREVSVG
|
| |