Gene Rcas_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0203 
Symbol 
ID5537664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp249681 
End bp250685 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content58% 
IMG OID640892366 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001430354 
Protein GI156740225 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.387989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGAA CAAACGACTA TACCCATGCG TATAGCGGCG CGCGCGTGCT CATTACAGGC 
GGAATGGGGT TCATCGGTTC GAATCTGGCG CATCGCCTGG TGGAACTCGA TGCGCAGGTG
ACTCTGGTCG ACTCACTCAT CCCGATCTAC GGCGGCAATC AGCGCAACAT CGCCGGCATC
GAGCATCGGG TGCGCGTCAA CATCGCCGAT GTGCGCGACG AGTATTCGAT GAACTATCTG
GTGCAAGGGC AGGATTACCT CTTCAATCTT GCCGGTCAGA CGTCGCACCT GGACTCGATG
ACCGACCCCT ATACCGATCT TGAGATCAAC TGCCGCGCGC AGTTGTCGAT CCTCGAAGCC
TGTCGCAAGC ACAATCCCAA CCTGAAACTG GTGTACGCTT CGACGCGCCA GATCTATGGC
AAGCCGGATT ATCTGCCGGT CGATGAGCGC CACCTGCTCC ATCCGGTCGA TGTCAATGGC
GTCAACAAAA TGGCCGGCGA GTGGTACCAT ATTCTCTACA ATAACGTCTA TAGCATTCGC
GCATGCGCCC TGCGCCTGAC GAACACCTAT GGTCCGCGCA TGCGCGTCAA AGATGCGCGA
CAAACGTTTC TCGGCATCTG GATCAAGCGC CTGATTGACG AAGAGCCGAT CCAGGTCTTC
GGCGACGGGT CGCAGATCCG CGACTTCAAC TACGTTGATG ATGTGGTCGA AGCGATGCTG
CTGGCAGGCG CATCGCCTGC GGCGGATGGC GGCATCTTCA ATCTGGGCAG CGACGAAACG
ATCAACCTGC GCGACCTGGC GGCATTGCTG GTCGAAATTA ATGGCGGCGG CAGTTTTGAA
ATTGTGCCTT TCCCACCAGA CCGCAAAGTC ATCGACATCG GCGATTATTA CGCCGATTAC
CGCATGATCC AGGGGCGGCT CGGCTGGCGC CCCAAAGTGT CGTTGCGCGA GGGATTGCGC
CGTACTCTCG AGTTCTATCG GCGTGAGCGC GAGTATTACT GGTAG
 
Protein sequence
MPGTNDYTHA YSGARVLITG GMGFIGSNLA HRLVELDAQV TLVDSLIPIY GGNQRNIAGI 
EHRVRVNIAD VRDEYSMNYL VQGQDYLFNL AGQTSHLDSM TDPYTDLEIN CRAQLSILEA
CRKHNPNLKL VYASTRQIYG KPDYLPVDER HLLHPVDVNG VNKMAGEWYH ILYNNVYSIR
ACALRLTNTY GPRMRVKDAR QTFLGIWIKR LIDEEPIQVF GDGSQIRDFN YVDDVVEAML
LAGASPAADG GIFNLGSDET INLRDLAALL VEINGGGSFE IVPFPPDRKV IDIGDYYADY
RMIQGRLGWR PKVSLREGLR RTLEFYRRER EYYW