Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2149 |
Symbol | |
ID | 5539629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2759985 |
End bp | 2761223 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894283 |
Product | 2-alkenal reductase |
Protein accession | YP_001432252 |
Protein GI | 156742123 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0134489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00552688 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCGTG GAACGAAATA TGCGCTGATC TTCGGCATGG TGGTTACCCT GGTAATCGGT GCGCTTATCG GCGCTCTGGC CGGCGGCGGC GTTGCCTGGT ATGTAACGCA GCAGCAGATT GAGCGGATTG CAGCACAACC GTCGACGCCC GCGCCAATTC CGGCGTCAAT GCCGGCGACG ACGGTCGTTC CGCAAGCAAC TGACGTGCCG CTGCCAACTC CTGCCCAGGT CCCGACGCCG GCGCCAGCGG CGCCGGCAAC GACTTCACCG GTTGTTGAGG CGGTTCAGAA GGTGTCGCCG GCGGTTGTGA CGGTCGTGAA CACGCTGGCA TCAGGTGCGC AGGGATCGCC GCTGCTTGGC GATCTACCGT TTCCGCTGCC GGATCAACCC GGCGGTTCAG TGCGCCGCGG CAGCGGTTCT GGGGTCATTA TCAGCCCGGA TGGGTATATT CTTACCAACA ATCACGTGAT TGAAGGGTAT CGCTCGCTCT CGGTCATTTT CTACGACGGT TCGCGCCGTG ATGCAACATT GGTCGGCGCC GATCCACTGA TGGATCTTGC CGTGGTCAAG GTCGATGGTC CGGTTCCCGG CGTGGCGACG CTGGGCGACT CCGACGCGCT CCAACCCGGT GAAACGGTCA TTGCGATTGG CAGCCCGCTT GGCGACTTCC GCAACACGGT GACGGTTGGC GTGGTGAGCG CTCTCAACCG TTCGCTTGGC GCCGACGCAC CCGAAGGATT GATCCAGACT GATGCGGCGA TCAACAGCGG CAACAGCGGC GGTCCACTGA TCAATCTGCG CGGTGAAGTC GTCGGGATCA ATACGCTCGT CGTGCGGGGG AGCGGTTTGG GAACGGCGCC CATCGAAGGG CTTGGGTTTG CAGTGCCAAG CTCGATTGCC AGGCGGGTGA GCGAGCAGTT GATCGCCAAT GGCAAAATCG TTTACCCGTT CCTCGGTGTG CGTTTTGGCA CAATCGATGC TATGCTGGCG CTCGATAACG ATCTGCCGGT CAATGCTGGC GCACTGATCT CCGCTGTCGA GCCGGGTGGA CCGGCTGCCC GCGCCGGGTT GCGCAGCGGT GACATTGTGA CCAAAGTTGA TGGAAAGACG ATTGGACCGG GGCAGTCGTT GCGTGCTCTG TTGCTGGAGT ACAAACCGGG CGACACGGTT ACGCTCGAGG TGTTGCGTAA TGGTGAACGG CTGTCGTTGG ACGTGACTCT GGGGACGCGC CCGGATTGA
|
Protein sequence | MERGTKYALI FGMVVTLVIG ALIGALAGGG VAWYVTQQQI ERIAAQPSTP APIPASMPAT TVVPQATDVP LPTPAQVPTP APAAPATTSP VVEAVQKVSP AVVTVVNTLA SGAQGSPLLG DLPFPLPDQP GGSVRRGSGS GVIISPDGYI LTNNHVIEGY RSLSVIFYDG SRRDATLVGA DPLMDLAVVK VDGPVPGVAT LGDSDALQPG ETVIAIGSPL GDFRNTVTVG VVSALNRSLG ADAPEGLIQT DAAINSGNSG GPLINLRGEV VGINTLVVRG SGLGTAPIEG LGFAVPSSIA RRVSEQLIAN GKIVYPFLGV RFGTIDAMLA LDNDLPVNAG ALISAVEPGG PAARAGLRSG DIVTKVDGKT IGPGQSLRAL LLEYKPGDTV TLEVLRNGER LSLDVTLGTR PD
|
| |