Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0472 |
Symbol | |
ID | 8567106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 518649 |
End bp | 520697 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | transketolase |
Protein accession | YP_003289762 |
Protein GI | 268316043 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAC AGAAGGCGAC GGAGGCGATT CGGGAAACCG ACTTTTCGCC GGGTGAACTG GACAATCTGT GCATCAATAC GATTCGATTT CTGGCCGTCG ATGCCGTCGA GCAGGCGAAG TCCGGCCATC CAGGCATGCC GATGGGCGCG GCGCCAATGG CCTACGTGCT CTGGACCCGG CATCTGCGGC ACAATCCGCG CGATCCGAAG TGGCCGAACC GGGATCGGTT CGTGCTTTCG GCCGGTCATG GCTCGATGTT GCTCTACGCG CTGCTGCACC TGACCGGCTA CGACCTGCCC ATGGAGGAGT TGCAGCGCTT TCGTCAGTGG GGATCGCGCA CGCCGGGCCA TCCGGAATAC GGGTTAACGC CCGGGGTCGA GACCACGACG GGACCGCTGG GCCAGGGCTT CGGAAATGCC GTCGGGATGG CGATCGCCGA GCAGTATCTG GCGGCGCATT TCAACCGGGA CGGATTCCCG CTGTTCGACC ATTTCACGTA CGTGATCGCC TCCGACGGCG ATCTGATGGA GGGCATCTCG CACGAGGCGG CTTCGCTGGC CGGACATCTC GGGCTGGGGA AGTTGATCGT GCTCTACGAT GACAACGACA TTTCCATCGA CGGCTCGACG GACATCACCT TCACCGAGGA CGTCGGCGCG CGCTTTGAGG CCTACGGCTG GCATGTGCAG CGCGTGGACG ACGGGAACGA CCTGGTAGCC ATCGATGCGG CGCTTCGGCA GGCCAAAGCC GAGACCGAAC GCCCTTCGCT CATCATCGTG CGCACGCACA TCGGCTACGG AAGCCCGAAC AAGCAGGACA CGCCCGCCGC GCACGGCGCG CCGCTCGGAC CCGAAGAGGT GCGCCTGACC AAGCGCAACC TGGGCTGGCC CGAGGACAGG ACCTTCTACG TGCCGGACGA AGTCTACCGG CACATGCGGC AGGCCGTCAC GCGGGGGCAG CAGTGGCAGG CCGAATGGGA GGCGCTGCGC GCCCGCTATC GGGAGGCCTA TCCCGCCGAA GCCGCTGAAC TGGACCGCTG GCTGAGCCGG CGGTTGCCCG AAGGGTGGAG CGAGGGACTC CCGACCTTCG AGGCGGGCAA GGCCGTGGCC ACGCGTAATG CCGGCGGCGC CGTGCTCGAC GTACTGGCCG CCCGTATTCC CGAGCTGATC GGCGGCTCGG CCGACCTGGC CGAGTCGAAC AAGACGCATC CGAAAGGGCG CGAGGCCTTC AGCCGCGACA ACCGCAAGGG CGGCTACATC CATTTCGGGG TGCGCGAGCA TGCCATGGCG GCCATCTGCA ACGGGCTGTC GCTGCACGGA CTGCGGGCCT ACGCGAGCAC CTTTCTGGTC TTCAGCGATT ATCTGCGGCC GTCGCTGCGG CTGAGCGCCC TCATGGAGCA GCCGGTCATC TACGTGTTCA CGCACGACTC GATCGGGCTG GGCGAGGACG GGCCCACGCA TCAGCCGATC GAGCATCTGG CCAGCCTGCG CGCCATCCCG CACGTGGTGG TGCTGCGGCC GGCCGACGCG ACCGAGACGG TGGAAGCCTG GAAGGTGGCG CTCGAGCGCG AGGACGGTCC CACGGTGCTC GTACTGACAC GTCAGAACGT GCCGGTGCTG GACCGGAGCC GCCTGGCACC GGCCGATGGG GTGCGCCGTG GCGCTTACGT GCTCAAAGAA GCACAGGGCG CGCTGCAGGC GATCCTGCTG GCTTCGGGCA GTGAAGTGCA TGTGGCGCTG GCGGCCGCCG AACAGCTCGA AGCCGAAGGC ATCGGCACGC GCGTCGTCAG CGTGCCTTCC TGGGAGCTGT TCAAAAAGCA GGAGGCGGCC TATCGGGAAT CGGTACTCCC GCCGGAGGTG ACCGTGCGGG TGGCCGTCGA AGCCGGGGTC GGACAGGGAT GGGAGCAGTT CGTGGGGTGC CGGGGCCGCA TCGTCAGCAT CGAGCGCTTC GGCGCTTCGG CCCCCGGCAA GGTCCTGTTC GAAAAATTCG GCTTCACGCC CGAGCGGGTG GCCAGTGAAG TGCGTGCGCT GCTGGCGCAG AACCATTGA
|
Protein sequence | MQEQKATEAI RETDFSPGEL DNLCINTIRF LAVDAVEQAK SGHPGMPMGA APMAYVLWTR HLRHNPRDPK WPNRDRFVLS AGHGSMLLYA LLHLTGYDLP MEELQRFRQW GSRTPGHPEY GLTPGVETTT GPLGQGFGNA VGMAIAEQYL AAHFNRDGFP LFDHFTYVIA SDGDLMEGIS HEAASLAGHL GLGKLIVLYD DNDISIDGST DITFTEDVGA RFEAYGWHVQ RVDDGNDLVA IDAALRQAKA ETERPSLIIV RTHIGYGSPN KQDTPAAHGA PLGPEEVRLT KRNLGWPEDR TFYVPDEVYR HMRQAVTRGQ QWQAEWEALR ARYREAYPAE AAELDRWLSR RLPEGWSEGL PTFEAGKAVA TRNAGGAVLD VLAARIPELI GGSADLAESN KTHPKGREAF SRDNRKGGYI HFGVREHAMA AICNGLSLHG LRAYASTFLV FSDYLRPSLR LSALMEQPVI YVFTHDSIGL GEDGPTHQPI EHLASLRAIP HVVVLRPADA TETVEAWKVA LEREDGPTVL VLTRQNVPVL DRSRLAPADG VRRGAYVLKE AQGALQAILL ASGSEVHVAL AAAEQLEAEG IGTRVVSVPS WELFKKQEAA YRESVLPPEV TVRVAVEAGV GQGWEQFVGC RGRIVSIERF GASAPGKVLF EKFGFTPERV ASEVRALLAQ NH
|
| |