Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2188 |
Symbol | |
ID | 5539669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2810582 |
End bp | 2813125 |
Gene Length | 2544 bp |
Protein Length | 847 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894321 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_001432289 |
Protein GI | 156742160 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.941679 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGTA TGACCCATCC ACAGCGCACA TGGCTGATCA CCGGCTGCGC CAGCGGTTTC GGCGCGCGCC TGGCGCAGCG ATTGATCGAG CGCGGTGAGC GCGTTGCCGC CACCGACCGC AGTGTTGACC TTCTTACCCG TCTCCATTCA GACGATCCGG CGCGCCTGCT GTGCCTGGCA ATGGATGTGA CCGACCCCGA TGCGATACGT CGCGCTGTCA GGACGGCAGT GGCGCACTTC GGACGGATCG ATGTATTGGT CAACAACGCC GGGTTGGGGC ACGGCGGTCC GCTGGAAGAA GCGAAACTGG AAGATATTCG CCGCCTCTTC GACGTGAATA TCATCGGCAT GATCCTTGTG ACCCAGGCAG TTCTCCCTCA TATGCGCAAC GCAGGTCGCG GACACATTAT CAATCTGTCG TCAGACAGCG GCGTGGTCGG GTTTCCGTTT CAGGGGATTT ATACTGCGAC CAAACACGCT GTCGAGGGCT TTTCCGACTG TCTCTACCAG GAAGTGACGC CGTTCGGCAT CCATGTGTCG GTCATTCAGC CGTGCGGTAT GTTTAAGACC GATATGCCTG CAAGCACGAT CACCGCCGCC AGAGCAGCCA TGCGACCAGA CAGCCCGTAC TATGCGCGTG CTGTGCGCAT GGCGGAAGCG CTTGCCGCCG CCTGGGAACA GAGCAGCGAC CCACAGGAGG TGGTTGACGC CATTATTGAA ACTGCTGACG CCAACTCGCC GCCATTGCGT CGCCGCGTTG GTCCTCCTGA ACGCACCGGT CTTGTCGGGT TACGCCACCG TATGCCGCAC GAGGAATTCG TCCGATTCAT CGCCAGTGTC ACCGGCGATG GCGCCACACG TCCCGCTATG CCGAAAGGTC GTGTCGATGG CGGGCGACTG GTGGTGCGCA CTCTGCGCGA AGCGGGGGTG ACCCATGCCT TCACGATTGT CGGCGGGCAC AACTACCACC TGATTAATGC CTGCCGCGAA GAGGGCATCC GGGTGATCGA CGTGCGCAAC GAAATGCATG CCGCACACAT GGCGGATGCC TATGCGCGCT TCACACGCCG ACCGGCGCTG CTCACGGTGG ACGCTGCGCC AGGACTGGTC AATGCAGTGG CCGGCATCGA AGTCGCCTAT GAAGCACAGG TTCCTATGAT CATTGCATGC GCACAGGGAT CGCTGGAAGG ACGCGACATC GGCGTTATGC AGGCAATCGA TCAGGTGCGC CTGATGCGCC CGATCACCAA ATGGCAACGC ACATGCTTCG ATGTCAGACG GCTGCCCGAA TATACCGCCG CTGCCGTGCG CCATGCCACC ACCGGACGTC CCGGTCCGGC ATTTCTCGAT TTTCCGCTGG AGGTGATGCA CGCGGTCATT GAGGAGGATA CCGTCACCTT TCCACGCCAT TATCGGGTGA CGGTCGGACC GCCGGGCGAT CCGGCGCTGG TGCGTCAGGC GCTCGATGTG ATTCGCAAGG CGAGACGTCC GCTCCTGATC GTTGGCAGCG GCGTCTGGTG GGCGCGCGGC GAGGAGGAAC TACGCCGTTT TGTCGAAACG ACCGGCATAC CGGTGCTGAG CCGCAACCTG GCGCGCGGAA TCATTCCCGA TGACCATCCC CTCTCTGCCG GGTTTTACCC CACCCCGGCA GCCATGGCGG ATGCCTTCCT GGTGATCGGC ACACGACTCG ACTGGACGAT TGGCTACGGA CGCTTTCCGC TCTTCAGCAT GGACGCGCCG GTCGTGCAGG TGGATATACA CCCCGAAAGC ATTGGCAAAA CGCGACCAAT CGACCTCGGC ATCGTCGGCG ATGCAGCGCA GGTATTGCGT CAGCTCAATG AACTTGTTGC AGAAGGTAGC GATTGGGCGA TGGAGAAGGA ATGGAGCCAC ATAGCACATG GCAGCATTGG CGCTATGCGC GCAGAAACCG CCGCTGCCGC CGATCTGGCA GCGCGCGATC CCGCGCGACC GATGCACTCA ATTCAGTTGA TGCAGGCGCT GGCGGATTGC CTGCCACGCG ACACGATTAA GATTGTGGAT GGCGGGTACA GCGCGGCATT TGCCATTCAG TATCTTGATG CCTGCGTTCC TGGCGGCGTG ACGTGGGTGG GCAGCACCGG GCACATTGGC GTGGGACTGG GATTTGCCAT TGGCGCAAAA CTGGCGCACC CGGACCACCC CGTTGTGGCG ATCATGGGAG ACGGGGCATT TGGGCTGTGT GGCATGGAGT TCGATACTGC CGTGCGACAC CATCTGCCGA TGATTGTGGT GATTGCCAAC GATGCCGGGT GGGGTGAGAC GCGCGACGGT CAGCGACGGC GCTGGGGAGA CGCTGCTGTA ATCGGCACAA ACCTGGGACC GACGCGCTAC GATGAACTGG CGCGCGCGCT CGGCGGCTAT GGCGAGCGTG TCGAACGACC GGAAGCGATT GCGCCGGCGA TCCGGCGTGC GTTTGACTCA GGATTGCCTG CGATTGTCGA TGTGCGCACC GACCCGGAGC AGCGCAGTGC CGCAGTAACC GGATTACCGT GGATTGTCGA GTGA
|
Protein sequence | MHRMTHPQRT WLITGCASGF GARLAQRLIE RGERVAATDR SVDLLTRLHS DDPARLLCLA MDVTDPDAIR RAVRTAVAHF GRIDVLVNNA GLGHGGPLEE AKLEDIRRLF DVNIIGMILV TQAVLPHMRN AGRGHIINLS SDSGVVGFPF QGIYTATKHA VEGFSDCLYQ EVTPFGIHVS VIQPCGMFKT DMPASTITAA RAAMRPDSPY YARAVRMAEA LAAAWEQSSD PQEVVDAIIE TADANSPPLR RRVGPPERTG LVGLRHRMPH EEFVRFIASV TGDGATRPAM PKGRVDGGRL VVRTLREAGV THAFTIVGGH NYHLINACRE EGIRVIDVRN EMHAAHMADA YARFTRRPAL LTVDAAPGLV NAVAGIEVAY EAQVPMIIAC AQGSLEGRDI GVMQAIDQVR LMRPITKWQR TCFDVRRLPE YTAAAVRHAT TGRPGPAFLD FPLEVMHAVI EEDTVTFPRH YRVTVGPPGD PALVRQALDV IRKARRPLLI VGSGVWWARG EEELRRFVET TGIPVLSRNL ARGIIPDDHP LSAGFYPTPA AMADAFLVIG TRLDWTIGYG RFPLFSMDAP VVQVDIHPES IGKTRPIDLG IVGDAAQVLR QLNELVAEGS DWAMEKEWSH IAHGSIGAMR AETAAAADLA ARDPARPMHS IQLMQALADC LPRDTIKIVD GGYSAAFAIQ YLDACVPGGV TWVGSTGHIG VGLGFAIGAK LAHPDHPVVA IMGDGAFGLC GMEFDTAVRH HLPMIVVIAN DAGWGETRDG QRRRWGDAAV IGTNLGPTRY DELARALGGY GERVERPEAI APAIRRAFDS GLPAIVDVRT DPEQRSAAVT GLPWIVE
|
| |