Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1073 |
Symbol | |
ID | 5208019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1331846 |
End bp | 1334389 |
Gene Length | 2544 bp |
Protein Length | 847 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640594687 |
Product | thiamine pyrophosphate enzyme, central region |
Protein accession | YP_001275432 |
Protein GI | 148655227 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [COG4221] Short-chain alcohol dehydrogenase of unknown specificity |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGAA CAACCTATCC GCAGCGCACC TGGCTCATCA CCGGTTGCGC GACCGGCTTT GGCGCCCGCC TGGCACAGCA AGTGATCGAT CGCGGGGAGC GCGTCGTCGC TACCGACCGC GCTGTTGACG CGCTGGCGCA TCTGCACACC GACGATCCTG CGCGCTTGCT GCGTCTGGCG ATGGATGTCA CCGACCCGGA TGCCGTCCGT CGCGCTGTTG AGATGGCGGT GGCGCGCTTC GAGCGTATCG ATGTGCTGGT CAACAACGCT GGTCTCGGAC ACGGCGGACC GCTGGAGGAA GCACGAGTGG ACGATATTCG CCGCCTCTTC GATGTGAATA TCATTGGTAT GATGATCGTG ACGCAGGCGG TTCTTCCTCA TATGCGCGCA TCTGGTGGCG GGTATATTAT CAATATCTCG TCGGACAGCG GCGTCGTGGG ATTTCCCTTC CAGGGCGTCT ACACCGCCAC CAAGCATGCC GTCGAAGGTT TCTCCGACTG TCTCTATCAG GAAGTAACCC CCTTCGGCAT CCGTGTATCG GTCATCCAGC CATGCGGCAT GTTCAAAACC GACATGCCCG CCAGTACGAT CACCGCTGCC AGAGCCGCGA TCCGTCCTGA CAGCCCGTAC TATGCCCGTG CCACCCGCAT GGCGGACGCG CTCGCCGCTG CCTGGGAGCA GAGCAGTGAT CCGCAGGACG TGGTCGATGC GATCATCGAG GTTGCCGATG CCGATCCGCC GCCGCTGCGC CGCCGCGTCG GTCCGCCCGA CCGCACCGCG CTGCCCGGTT TGCGTCAGCG GATGTCGCAC GAAGAGTTCG TCACATTCAT CTATCGTATG ACCAGCGAAG ATGTCGCGCG ACCTGCCATG CCGCGCGGGC GCGTCGATGG CGGACGACTG GTGGTACGCA CGCTGCGCGA AGCGGGGGTG ACGCATGCCT TCACCATCGT TGGCGGACAT AACTACCAGC TGGTCAATGC CTGCCGCGAA GAGGGCATCC GTGTTATCGA CGTGCGCAAC GAGATGCACG CCGCGCATAT GGCGGATGCC TTCGCTCGCT TTACCCGCCG ACCGGCGCTG CTCACCGTCG ATGCCGCACC AGGTCTGGTC AACGCTATCG CCGGAATTGA AGTCGCATAT GAGGCGCAGG TTCCGCTGAT TATTGCCTGT GCCCAGGGGT CGCTGGCAGG GCGCGACATC GGTGTCATGC AGGCGATCGA TCAACTTCGA CTGTTGCGCC CGGTCACCAA ATGGCAGCGC ACATGTTTCG ATGTGAAACG GTTGCCGGAA TACACCGCCG CTGCGTTGCG TCACGCAACA ACGGGCCGCC CCGGTCCGAC GTTTCTCGAC TTTCCGCTGG AAGTGATGCA GGCGATGGTG GACGAAGATG CCGTGACCTT TCCACGCAAC TATCGTGTGA CGACGGGACC AGCAGGAGAT CCGGCACTGG TGCGCCGGGC GCTTGACCTG ATTCGACGCG CACGACGGCC GCTCCTGATC GTCGGCAGCG GCGTCTGGTG GGCACACGGC GACGACGAAC TGCGTCGCTT CGTCGAGACA ACCGGCATCC CTGTGCTCAG TCGCAACCTG GCGCGCGGCA TCATCCCCGA TGACCACCCG CTTTCAGCCG GGTTCTATCC CACGCCAGCT GCGATGGCAG ACGCTTTCCT GGTGATCGGG ACACGTCTCG ACTGGACGAT TGGATACGGG CGTTTTCCCC TCTTCAACCT GGACGCTCCC GTCGTTCAGG TAGATATTCA CGCTGAAAGC ATCGGCAAAA CGCGACCGAT CGATGTTGGG ATTATTGGTG ATGCGGCGCA GGTGCTGCGA CAACTCAACG ACCTCGTTGC GGCGGGCGGG GAGTGGGCAA TGGAAGCGGC GTGGCCCCCC ATGGCACACG GGAGCATTGC CATGATGCGT CAGGAAACTG CGGCTGCCGC AAACCTTCCA GCCCGCCCGT CCGACCGCCC CATGCATTCG ATCCAGTTGA TGCAGGCGCT GGCAACATGC CTGCCGCGTG AGGCGATCAA AGTGGTGGAC GGCGGTTACA GCGCGGCGTT TGCGATTCAG TATCTCGATG CCACCGTTCC TGGTGGGGTG ACGTGGGTGG GCAGCACCGG ACATATCGGG GTGGGATTGG GTTTTGCCAT CGGCGCCAGA CTGGCACATC CCGACAGCCC GGTGGTGGCG ATCATGGGTG ATGGCGCTTT TGGGCTATGT GGACTGGAGT TCGATACTGC AGTGCGCCAC CAGCTGCCGA TTATCGTGGT GATCGCCAAC GACGAAGGGT GGGGTGAGAC GCGCGATGGA CAACGGCGAC GGTGGGGCGA TGCGGCAGTG ATCGGTACGC ATCTGGGACC ACGCCGTTAC GACGAACTGG CGCGGGCGCT CGGCGGCTAT GGCGAACGTG TCGAACGACC GGAGGAGATT GCGCCCGCCA TTCGACGCGC CTTCGAGTCG GGAGTGCCGG CGATCATCGA TGTGCATACC GATCCGGAAC AGCGCAGCAC GGCAGTCGCC GGATTGCCCT GGATCGTTGA GTGA
|
Protein sequence | MNRTTYPQRT WLITGCATGF GARLAQQVID RGERVVATDR AVDALAHLHT DDPARLLRLA MDVTDPDAVR RAVEMAVARF ERIDVLVNNA GLGHGGPLEE ARVDDIRRLF DVNIIGMMIV TQAVLPHMRA SGGGYIINIS SDSGVVGFPF QGVYTATKHA VEGFSDCLYQ EVTPFGIRVS VIQPCGMFKT DMPASTITAA RAAIRPDSPY YARATRMADA LAAAWEQSSD PQDVVDAIIE VADADPPPLR RRVGPPDRTA LPGLRQRMSH EEFVTFIYRM TSEDVARPAM PRGRVDGGRL VVRTLREAGV THAFTIVGGH NYQLVNACRE EGIRVIDVRN EMHAAHMADA FARFTRRPAL LTVDAAPGLV NAIAGIEVAY EAQVPLIIAC AQGSLAGRDI GVMQAIDQLR LLRPVTKWQR TCFDVKRLPE YTAAALRHAT TGRPGPTFLD FPLEVMQAMV DEDAVTFPRN YRVTTGPAGD PALVRRALDL IRRARRPLLI VGSGVWWAHG DDELRRFVET TGIPVLSRNL ARGIIPDDHP LSAGFYPTPA AMADAFLVIG TRLDWTIGYG RFPLFNLDAP VVQVDIHAES IGKTRPIDVG IIGDAAQVLR QLNDLVAAGG EWAMEAAWPP MAHGSIAMMR QETAAAANLP ARPSDRPMHS IQLMQALATC LPREAIKVVD GGYSAAFAIQ YLDATVPGGV TWVGSTGHIG VGLGFAIGAR LAHPDSPVVA IMGDGAFGLC GLEFDTAVRH QLPIIVVIAN DEGWGETRDG QRRRWGDAAV IGTHLGPRRY DELARALGGY GERVERPEEI APAIRRAFES GVPAIIDVHT DPEQRSTAVA GLPWIVE
|
| |