Gene RoseRS_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4100 
Symbol 
ID5211083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5139872 
End bp5140849 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content60% 
IMG OID640597688 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001278394 
Protein GI148658189 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGC GCAGCATTCT GATCACTGGC GGCGCAGGGT TCATCGGTTC GCACCTGGCG 
GATGCCCTGA TCGAGCGCGG CGACCGGGTG GCGATCATCG ATGATCTCTC CACCGGCGCG
GTTGCGAATA TTCGCCACCT CAAAGGACAT CCGAACTTCA GTTATACGCT CGATACCATC
GCCAATGAAG CGGTGCTGGC GGAACTGATC GACGAGAGTG ATGCGGTGGT GCATCTGGCG
GCGGCGGTCG GCGTGCAACT GATTGTGCAA AGCCCCGTGC GCACCATCGA AACCAATGTC
AACGGCACTG AACTGGTGTT GCGCTGGGCG GCGAAGAAGG GAAAGACTGT GTTGATTGCC
AGCACGTCCG AGGTGTACGG CAAAAGTGAG CGCGCCCCCT TCCGCGAAGA CGATGACCTG
GTGCTTGGTC CTTCCACAAT AAACCGCTGG AGTTATGCCT GCTCCAAACT GCTCGATGAG
TTTCTGGCGC TGGCGTACCA CAAAGAGCGC GACCTGCCTG TGATCATTGC GCGCCTGTTC
AACACGGTCG GTCCGCGTCA GACGGGGCGC TACGGGATGG TCGTGCCGCG CTTTGTTCGG
GCTGCACTCC GTAATGTGCC GTTGCGTGTG TATGGCGATG GGCAGCAAAC GCGCTGCTTC
TGCTACGTCG GCGATACAGT GCGCGCATTG ATCGCCCTGC TCGACCATCC AGACGCGGTT
GGGAAGGTTT TCAACGTTGG CAATCCGCAG GAAGTGAGCA TTCTCGAACT GGCGCAGCGT
GTGGTGCGCC TGGCGCAGAG TTCATCACCG ATCGTGCTGG TGCCCTACGA GCATGCCTAC
GAAGCCGGGT TTGAAGATAT GCGCCGGCGC GTGCCGGATA TTTCGCGTCT CACAGCGCTG
ACCGGCTTCC GCCCGACGCT CGATCTCGAT GATATTATCC GCACGGTCAT CGAGTACGAA
CAGGCGCACG GCGCGTGA
 
Protein sequence
MAQRSILITG GAGFIGSHLA DALIERGDRV AIIDDLSTGA VANIRHLKGH PNFSYTLDTI 
ANEAVLAELI DESDAVVHLA AAVGVQLIVQ SPVRTIETNV NGTELVLRWA AKKGKTVLIA
STSEVYGKSE RAPFREDDDL VLGPSTINRW SYACSKLLDE FLALAYHKER DLPVIIARLF
NTVGPRQTGR YGMVVPRFVR AALRNVPLRV YGDGQQTRCF CYVGDTVRAL IALLDHPDAV
GKVFNVGNPQ EVSILELAQR VVRLAQSSSP IVLVPYEHAY EAGFEDMRRR VPDISRLTAL
TGFRPTLDLD DIIRTVIEYE QAHGA