Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3914 |
Symbol | |
ID | 5541420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5115252 |
End bp | 5116085 |
Gene Length | 834 bp |
Protein Length | 277 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640896024 |
Product | D-mannonate oxidoreductase |
Protein accession | YP_001433967 |
Protein GI | 156743838 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.374414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.041983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAC CATTTCTTCG TTCATTGTTC GGTCTGGAAG GGAAGGTTGC CGTGGTCACC GGCGGCAGCG GCGCCCTTGG CGCTGCGATG GCGCAGGGAC TGGCACGCGC TGGCGCGCGG ATCGCCATTC TTGCCCGACG CATCGAACCG GCAAACGCAG TCGCCGCCGC TCTCGCAGAC GCTGGCAGCG ATGCGTTTGC GCTCAGCGCC GATGTGCGCG ACCGGACGCA GATCGAACAC GCCTGTAACA CAATTCTTGA ACGCTGGGAA CGGGTCGATA TTCTGGTGAA TGCCGCTGGC GGGAATATGC CGGGCGCAAC GCTTGCTCCC GATGCTGCGC TGCGCGATCT TGACCCCGAC GCCTTTCGCA CCGTCGTCGA TCTGAATCTG ATCGGCACAC TGCTGCCATC GCTCATCTTT GGCGCAGCCA TGATCGCAGC CGACAGGCAG GGCGTGATTG TCAACATCTC CTCCATGGCA GCGCAACGTC CACTGACTCG CATCGCCGGT TATAGCGCCG CCAAAGCCGC TGTGGACAAC CTCACACGCT GGATGGCGGT CGAACTGACG CGGCGGTATG GTCCGGGATT GCGGGTCAAC GCAATTGCGC CCGGATTCTT CATCGGCGAG CAGAACCGGT CGTTGTTACT CAATCCCGAC GGATCGCCGA CAGCACGTGG TGCGAGCGTG ATTGCGCACA CACCTGCCGG ACGCTTCGGC GTTCCGGATG ATCTCATCGC CACGCTGATC TGGCTCTGCG GACCTGGCGC CGCTTTCGTC AACGGCGTCG TCGTCCCGGT GGACGGCGGA TTCTCTGCGT GGAGCGGCGT GTAA
|
Protein sequence | MSEPFLRSLF GLEGKVAVVT GGSGALGAAM AQGLARAGAR IAILARRIEP ANAVAAALAD AGSDAFALSA DVRDRTQIEH ACNTILERWE RVDILVNAAG GNMPGATLAP DAALRDLDPD AFRTVVDLNL IGTLLPSLIF GAAMIAADRQ GVIVNISSMA AQRPLTRIAG YSAAKAAVDN LTRWMAVELT RRYGPGLRVN AIAPGFFIGE QNRSLLLNPD GSPTARGASV IAHTPAGRFG VPDDLIATLI WLCGPGAAFV NGVVVPVDGG FSAWSGV
|
| |