Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0815 |
Symbol | |
ID | 4116191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 849866 |
End bp | 851026 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638035599 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_643595 |
Protein GI | 108803658 |
COG category | [C] Energy production and conversion |
COG ID | [COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0124541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATCG AGATCGGGAA GGGAAAGACA GCCCGCAGGG CCTACGGGCT GGACGAGATA GCCATAGTAC CCAGCCGCCG CACGCGCGAC CCCGAGGACG TGGACATCTC CTGGAACCTG GGGGATCTGC ACCTGGACCT GCCGTGCCTG GCCAGCGCGC TCGATGCGGC GGTGGACCCC ACCACGGCCG GGATCATCGG GGAGCTTGGG GGGCTGGCGG TGCTCAACCT GGAGGGCATC CAGACCCGCT ACGAGGACCC TGAGCCGGTC TTCGAGGAGA TCGCGAGCCT GCCGCAGGAG AAGGCCACGC GGGTGATGCA GGAGATCTAC TCCGAGCCCG TGAAGGAGGA GCTGATCTTC CGGCGGGTGC AGGAGATAAA GGACAAGGGG GTCATCGCCG CCGCCTCCCT GACCCCGCAG CGGGTCGAGC GGTACCACCG GGCGGCCATA GAGGCCGGGC TGGACGTGCT CGTCATCCAG GGCACGGTGG TCTCCGCCGA GCACGTCTCC CGGCAAGCCA AGCCGCTGAA CCTCATGGAG TTCATCCCCT CGCTCAACGT CCCGGTGGTG GTGGGCGGTT GCGCCAGCTA CTCGACGGCG CTGCACCTCA TGCGCACCGG GGCCGTGGGG GTGCTGGTGG GGGTGGGGCC GGGCCGCATC TGCACCACCC GGGGCGTGCT CGGGGTGGGG GTCCCGCAGG CCACCGCCAT CGCCGACGCC GCGGCGGCCC GCACCCGGCA CTACATGGAG ACCGGCCAGT ACGTCAACGT GATCGCCGAC GGCGGGATGC GCACCGGCGG CGACATAGCC AAGGCCATCG CCTGCGGGGC CGACGCGGTG ATGCTCGGGA GCGCCTTCGC CCGGGCCGAG GAGGCGCCGG GGAAGGGGTA CTCCTGGGGG ATGGCAACCT TCCACCCCAC GCTCCCGCGG GGGACCAGGA TAAAGACCGG CACCGTGGGG ACCATAGAGG AGATCCTGCT CGGCCCCGCG CACGAGAACG ACGGCACCCT GAACCTGATG GGGGCGCTGC GCACCAGCAT GGCCACCACG GGCTACCAGA ACATAAAGGA GTTCCAGAAG GCCGAGGTCA TGGTGGCCCC CGCGCTGGCC ACGGAGGGCA AGCTGGAGCA GTTCTCCCAG GGCGTGGGGA TGGGCCGTTA G
|
Protein sequence | MDIEIGKGKT ARRAYGLDEI AIVPSRRTRD PEDVDISWNL GDLHLDLPCL ASALDAAVDP TTAGIIGELG GLAVLNLEGI QTRYEDPEPV FEEIASLPQE KATRVMQEIY SEPVKEELIF RRVQEIKDKG VIAAASLTPQ RVERYHRAAI EAGLDVLVIQ GTVVSAEHVS RQAKPLNLME FIPSLNVPVV VGGCASYSTA LHLMRTGAVG VLVGVGPGRI CTTRGVLGVG VPQATAIADA AAARTRHYME TGQYVNVIAD GGMRTGGDIA KAIACGADAV MLGSAFARAE EAPGKGYSWG MATFHPTLPR GTRIKTGTVG TIEEILLGPA HENDGTLNLM GALRTSMATT GYQNIKEFQK AEVMVAPALA TEGKLEQFSQ GVGMGR
|
| |