Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0070 |
Symbol | |
ID | 4117914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 71453 |
End bp | 74266 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638034864 |
Product | glycoside hydrolase family protein |
Protein accession | YP_642863 |
Protein GI | 108802926 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG CGAGGGAGAT CGCCGAGCGC AGCGAGGCGG GCCGGGAGCC ACTAAACATA GTAGTCATAC CGCACTCCCA CTGGGACAGG GAGTGGTATG CGACCTTCGA GGAGTTCCGG TTCTACCTGG TCCGCTTCAT GGACGAGCTG CTCGAGGTTC TTGAGAAGGA CGAGGCCTTC CGGTCCTTTC TGCTCGACGG GCAGGTCTCG CTGCTCGAGG ACTACCTGCA GGTCAGGCCG GAGAAGCTGG ACGAGCTGCG GCGGCTCGTG CAGGAGGGCC GGCTGGACAT CGGGCCGTGG TACGTCCAGC CCGACGAGTT CCTCGTCTCC GGCGAGGCCC TGGTGCGGAA CCTCCTGATC GGCGACCGGA TCGGCCGGCA GTTCGGCCCC GTGATGAAGC AGGGCTACGT GCCGGACACC TTCGGGCACG TAAGCCAGCT GCCCCAGATC CTGCGGGGCT TCGGGATCGG CACCTTCTAC TTCATGCGCG GGCTCGCCGA GAGCGTGGAC GAGCTGCGCA GCGAGTTCTG GTGGGAGGCG CCCGACGGCT CAAGGGTGCT CGCCCACTTC CTCTCCGAGA CCTACTCCAA CGCCGCGGTG CTCGGGCCGG ACCCCGAGAA GATCTCCTTC CGCCACGACC GCCTCGCCGA GCCCAGCAGC TACTTCGTAA GCTACGACAG CCTCTACGAG CTCAGGGACC GGCTCGCCGC GCGGGCGGCC GACAGGACCA TCCTGTTCAT GAACGGCAGC GACCACGTCA CCGTACAGCC GGACTTCTCC CGCTACGTCT TCGCCCTCAA CCAGGCCATG GAGGACCGGC TCTTCAACGG GCGCCTGGCG GACTTCGAGC GGCTGGTGCT GGAGGAGAAC CCGCCCCTCA AGACCTACCG GGGAGAGCTG CGCTGGGCGC GCTACCAGCC CATCCTCAAG GGGGTCTACT CCTCCCGGAT GTACCTCAAG CAGGAGAACG AGCGTACCCA GCAGCTGCTC GAGGGCCTGG CCGAGCGGGC GGCGGCCGTG GTCTACGCCC TCGGGGGGCC GGACTACTCT CCCTTTCTGC GCTACGCCTG GAAGGAGCTG CTCAAGAACC ACGCCCACGA CTCCATCGGC GGCTGCTCCG CCGACGCCGT CCACCGGCAG ATGCTCGGGC GCTACGACAC CGTCCGGCGC GTCGGGCAGA AGGTGGTGGA CGAGGCCCTG GACTACATAG CCTCCCGGGT GGCCCCCGAG CCCGACGCCG GCTCCATCCC CATCGTCGTC TTCAACCCCA GCCCGTGGGA GCGGGGGGGC ACCGTGCAGG TCGAGGTCTC GCTCAACCTG GACGCGCCCC TGAAGCGCCG GATCTTCGAC TGGATCGGCC AGAAGGAGTT CGACGTGGAG CGGGCCGCGC TCCTGGACCC GGACGGCAGG GAGGTGCCCT TCTCCATACG GGGGCGGCGG CTGCACATAG AGGACGCCCT CTACCGCCGC AAGGCCGTCC GCCGGGCCAC GGTGGAGTTT CTGGCCGAGA AGATCCCGCC GCTCGGCTAC AAGGTCTACC GGCTCGTCGA GACGCCCGAG CGGGAGCCGA CCTCCGAGGA GCGGGAGGTG CCCTGGGAGG TGGCCCTGGA GAACGAGCGC CTCAAGGTGA CCGTCGAGGG CGACATGACG CTCACCATCC TGGACAAGAC CACCGGGGAG CGCTACGCGG GGCTCAACCT CTTCGTGGAC GAGGCCGACG CCGGGGACGA GTACACCTTC TGCCCGCCGC GCGAGCAGCG GCGGGTGCTC TCCTCGGAGG AGGACTGGCG GGTGGAGGAC ACCGGCGACC CGCACACCCT GCTGCTCCGG GGGAGCATGC TGCTGCCCAA GGGCCTCGCC CCGGGCCGGA GCGCCCGCTC CCTGCAGACG GTCCGCTGCC CGCTCAACGT CCGGCTCCGG CTGCTGCCGG GGGCGCGGCG CCTGGAGATC TCCACCGAGT TCTTCAACCG GGCCAAGGAC CACCGGCTGC GGGTGGTCTT CCCGGCGGGG TTCCCGGCCC GGGAGTCGGT CGCCGAGACC GCCTTCGGCA CCGTCCGGCG CCCCACGCGC CCGACGGACA GCGCCGGCTG GCGCGAGAAG GACACCGCCA CCTACGCCCA GCGGCGCTTC GTTTGCGTCG AGGACCCCGA GACCGGAAGG GGGCTCGCGG TCCTCAACAA GGGGCTGCCG GAGTACGAGG TGACCCCGGA GGGGGAGGTG TGCCTGACGC TCCTGCGCGG GGTGGGCTGG CTCTCACGCA ACGACCTCTC CACCCGCACC GGCCACGCCG GGCCGGGGCT CGCCACCCCG GAGGCCCAGT GCTCCGGGCG GCACGTCTTC GAGTACGCCG TCGTCCCGTA CACCGGCGGC CACCGGGAGG CGGGCATCTT CCGGGAGGCC GAGGAGTACT GGCTGCCGCT GGAGGCCTGG GACGTCCACC GGGGCGAGCG GCCCCGGGAG GGCTCGCCGG CCGGGGCCCC CGGCTCCTTC CTGCGGGTGC ACGGGAAAGA CGCCGTGCTG AGCACGCTCA AGAAGGCCGC CGACCGGGAC GGCCTCGTCC TGCGCCTCTT CAACGCCTCG GAGGAGGAGA GCCGGGCGGT GCTCAACTTC GGCATCCCCA TCGCCGCCGC CTACAGGACC AACCTGAACG AGGAGATCCT GGAGGAGCTC GCCCCCAAGG GCCACCGGCT GAGGGTGAAC CTCAGGCCCT GCGGCATCGA GACCGTGCTC GTCAAGCTGC ACAGGCCCGA AAGGTGGGCC GAGAGGATCA GGCGGACCGC GCCGGAGCGG ACGAGAGGAA GGAGGACCGG ATGA
|
Protein sequence | MSTAREIAER SEAGREPLNI VVIPHSHWDR EWYATFEEFR FYLVRFMDEL LEVLEKDEAF RSFLLDGQVS LLEDYLQVRP EKLDELRRLV QEGRLDIGPW YVQPDEFLVS GEALVRNLLI GDRIGRQFGP VMKQGYVPDT FGHVSQLPQI LRGFGIGTFY FMRGLAESVD ELRSEFWWEA PDGSRVLAHF LSETYSNAAV LGPDPEKISF RHDRLAEPSS YFVSYDSLYE LRDRLAARAA DRTILFMNGS DHVTVQPDFS RYVFALNQAM EDRLFNGRLA DFERLVLEEN PPLKTYRGEL RWARYQPILK GVYSSRMYLK QENERTQQLL EGLAERAAAV VYALGGPDYS PFLRYAWKEL LKNHAHDSIG GCSADAVHRQ MLGRYDTVRR VGQKVVDEAL DYIASRVAPE PDAGSIPIVV FNPSPWERGG TVQVEVSLNL DAPLKRRIFD WIGQKEFDVE RAALLDPDGR EVPFSIRGRR LHIEDALYRR KAVRRATVEF LAEKIPPLGY KVYRLVETPE REPTSEEREV PWEVALENER LKVTVEGDMT LTILDKTTGE RYAGLNLFVD EADAGDEYTF CPPREQRRVL SSEEDWRVED TGDPHTLLLR GSMLLPKGLA PGRSARSLQT VRCPLNVRLR LLPGARRLEI STEFFNRAKD HRLRVVFPAG FPARESVAET AFGTVRRPTR PTDSAGWREK DTATYAQRRF VCVEDPETGR GLAVLNKGLP EYEVTPEGEV CLTLLRGVGW LSRNDLSTRT GHAGPGLATP EAQCSGRHVF EYAVVPYTGG HREAGIFREA EEYWLPLEAW DVHRGERPRE GSPAGAPGSF LRVHGKDAVL STLKKAADRD GLVLRLFNAS EEESRAVLNF GIPIAAAYRT NLNEEILEEL APKGHRLRVN LRPCGIETVL VKLHRPERWA ERIRRTAPER TRGRRTG
|
| |