Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_1612 |
Symbol | |
ID | 4116359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 1634889 |
End bp | 1637792 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638036412 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_644386 |
Protein GI | 108804449 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTTAC GCGGGGAGCC GGACAGGTCC TTGATCGAAC GGGTGGCGAC GAGGCTCGGG ATCATCCCGG ACGTGGACCA GCCTGAGGCG GGCGGCGTGG AGCATCTAGG CTCGGGTTCC GCTCTTGCGA ACTTTCCGCC GCCCGAGAAG TGGGACGAGT GGACCGAATA CGAGGCCAGA GGCTGGACCC GGCGGGAGAA GAGAACATAC CAGATCGTGC CGACCATCTG CTTCAACTGC GAAGCGGCCT GCGGCCTTCT GGCCTACGTG GACCGGAACA CGAGGGAGAT CCGTAAATTC GAGGGGCACC CGCTGCACCC GGGGAGCCGG GGCCGCAACT GCGCGAAGGG CCCGGCGACC ATAAACCAGA TCAAGGACCC CAACCGCATT CTCTACCCCG TAAGGCGCAA GGGACCTCGC GGCAGCGGCG AGTGGGAGCG CGTCTCCTGG GAGGAAGCGC TCGAGGACAT CGGCGCGCGC ATCCGGAAGG CTATAGTGGA AGACAGGCGC ACGGAGATCA TGTACCACGT CGGACGCCCG GGCTTCGAGC ACCCGCACAT GGAGCGCGTC TTCCGGGCCT GGGGCGTGGA CGCCCACAAC TCCCACACCA ACGTCTGCTC GAGCAGCGGT CGGTTCGGCT ACCAGATAGT CGGCGGCTTC GATAGGCCCT CGCCGGACTA CGCCAATGCC GGGGCCATCG TGCTGGTGAG CGCGCATCTG GAGAGCGGGC ACTACTTCAA CCCGCACGCC CAGAGGATCA TCGAGAGCAA GATGAAGGGC GGAAAGCTCA TCGTGCTCGA TCCCCGGCTC TCCAACTCGG CGGCGATGGC GGACTACTGG CTTCCTACCC GGCCCGGGTC AGAGGCCGCG GTCCTTCTGT GCGTGGTCAA CGTCATCCTG CAGGAGGGCC TCTGCGACGA GCGGTTCCTC TACGACTGGA CCAACTGGCG CGAGTACCTG AGGAACCGGC GGCCGGGGGA GGATCCCGAC TCCTTTGAGA GCTTTCTGGA GGCGCTCAGG GAGGAGTACG CAAGGTACAC GCCGGAGTAC GCCGAGGCCG AGAGCGGGGT TCCGGCGGAG AGGATCCTTG AGGCCGCGCG CGTGGTCGGC AATGCGGCCC CCGCCGTGGC CGCCCACACC TGGCGCGCGG CCTGCGCGGG GAACCTGGGC GGTTGGGCCG TCTCCCGGGC GCTGGCTTTC CTGGGCGTCG TCACCGGAAG CTGGGGGAGG CCGGGCGGCA CCAACCCCAA CGGCTGGAAC AAGTGGCTGC CGCATTTCTG GGAGGAAGCC CCGCCGCAGA AGGCCTGGAA CGAGCTCACC TACCCCAGGG AGTACCCTCT TGCGGCCAAC GAGATGAGCT TTCTGCTGCC GCACTTCCTC ATGGAGGGTC GTGGGAGGAT CGAGGTGTAC TTCAGCAGGG TCGTAAACCC CGTGTGGACC TATCCGGACG GCTTCTCCTG GATCGAAGCG CTCGCCAACG AGGACTACAT AGGGTGTCAC GTTGCGCTGA CGCCGACGTG GAACGAGACG GCGTACTTTG CGGACTACGT GCTGCCTTTG GGCCACGGCC CCGAGCGCCA CGACGTGATG AGCCAGGAGA CCCACTCGGG GGTGTGGCTC TCCTTCAGGC AGCCTGTCCT GCGGGAGGCG TCGCGCAGGC GGGGTGAGAG GGTCGCCTAC ACCTACGAGG CGAACCCCGG AGAGGTTTGG GAGGACGACG AGTTCTGGAT CGAGCTCTCC TGGAAGATAG ACCCCGATGG CTCGCTGGGC ATAAGGCGGC ACTTCGAGAG CCCCTACGAG CCGGGCAGGA AGCTCACCAC CGAGGAATAC TACCGCTGGA TCTTCGAGAA CGCCGTGCCA GGGTTGCCGG AGAAGGCGGC CGAGGAGGGG CTCTCTCCCC TGGAGTACAT GCGCAGGTAC GGGGCGTTCG AGGTAAAGAG CGCCGTCTAC GAGCGCAACA TGCGGGAGCT CTCCGAGAAG GAGCTCGAAG GCTCCCGCGT CGAGCCCGAC GGCACCATCA CCACGGAGAA GAGCCGCACC CGGGGTTCTC TGCGAGCCAA CCGCCTGATG CCCCACGTCG GGGTGGTGGT GGATGGGAAG GCGCGCGAGG GCTTCCCCAC CCCCTCCGGG AAGCTGGAGA TATGGTCTTC GACCATGGCG GCGTGGGGCT GGGAGGAGCA CGCCACGCCT GGCTACATAA AGAGCCACGT CCACCCCGAG AACCTCGAAG AGGGGCAGCT CGTGCTCAAC GCCACCTTCC GGCTGCCGAC GCTCATCCAC ACCAGGAGCG GCAACTCCAA GTGGCTCAAC GAGATCTCCA ACAGGAACCC CCTCTGGATC CACCCCAAAG ACGCCGAGGA GCGCGGTGTC GAGACCGGGG ACCTGGTGCG CGTGGTCACG GAGATCGGCT ATTTCGTGAA CCACGCCTGG GTGACCGAGG GGATAGCCCC CGGCGTGGTG GCCTGCTCGC ACCACCTGGG CCGCTGGCGG CGCAGGAAGG ACAGGGCGGA CAGGTGGAGC AACGCCCTCG TAGACATAAC CCGGGACGGA GACGGAGGCT GGAAGCTCAG GCAGGTGGAG GGGCCCGGCC CTTACGACAG CCCCGACCCG GATACCAGGC GCATCTTCTG GAGCGACGGC GGGGTGCACC AGAACTTGGC CTTTCCGGTG CACCCAGACC CCGTAAGCGG GATGCACTGC TGGCACCAGG CGGTGTACGT GGAGAGAGCC CACCCCGAGG ATCGCTACGG CGACGTCTAT GTGGATACCC GAAAAAGCCG GGAGGTCTAC CGCAGGTGGC TCTCGATGAC CAGGCCGGGG CCCCTGGAGA ACGGCCTCAG GCGCCCGCCG GTCTTCGACA GGCCCTACCG CCCGGACGAG AGCTGCTTCT ATGTAAGGGA GTAA
|
Protein sequence | MVLRGEPDRS LIERVATRLG IIPDVDQPEA GGVEHLGSGS ALANFPPPEK WDEWTEYEAR GWTRREKRTY QIVPTICFNC EAACGLLAYV DRNTREIRKF EGHPLHPGSR GRNCAKGPAT INQIKDPNRI LYPVRRKGPR GSGEWERVSW EEALEDIGAR IRKAIVEDRR TEIMYHVGRP GFEHPHMERV FRAWGVDAHN SHTNVCSSSG RFGYQIVGGF DRPSPDYANA GAIVLVSAHL ESGHYFNPHA QRIIESKMKG GKLIVLDPRL SNSAAMADYW LPTRPGSEAA VLLCVVNVIL QEGLCDERFL YDWTNWREYL RNRRPGEDPD SFESFLEALR EEYARYTPEY AEAESGVPAE RILEAARVVG NAAPAVAAHT WRAACAGNLG GWAVSRALAF LGVVTGSWGR PGGTNPNGWN KWLPHFWEEA PPQKAWNELT YPREYPLAAN EMSFLLPHFL MEGRGRIEVY FSRVVNPVWT YPDGFSWIEA LANEDYIGCH VALTPTWNET AYFADYVLPL GHGPERHDVM SQETHSGVWL SFRQPVLREA SRRRGERVAY TYEANPGEVW EDDEFWIELS WKIDPDGSLG IRRHFESPYE PGRKLTTEEY YRWIFENAVP GLPEKAAEEG LSPLEYMRRY GAFEVKSAVY ERNMRELSEK ELEGSRVEPD GTITTEKSRT RGSLRANRLM PHVGVVVDGK AREGFPTPSG KLEIWSSTMA AWGWEEHATP GYIKSHVHPE NLEEGQLVLN ATFRLPTLIH TRSGNSKWLN EISNRNPLWI HPKDAEERGV ETGDLVRVVT EIGYFVNHAW VTEGIAPGVV ACSHHLGRWR RRKDRADRWS NALVDITRDG DGGWKLRQVE GPGPYDSPDP DTRRIFWSDG GVHQNLAFPV HPDPVSGMHC WHQAVYVERA HPEDRYGDVY VDTRKSREVY RRWLSMTRPG PLENGLRRPP VFDRPYRPDE SCFYVRE
|
| |