Gene RoseRS_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2884 
Symbol 
ID5209853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3598119 
End bp3599288 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content59% 
IMG OID640596480 
Productxylose isomerase 
Protein accessionYP_001277202 
Protein GI148656997 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02631] xylose isomerase, Arthrobacter type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA TGTACGAACC CAAACCCGAA CACAAATTCA CCTTCGGACT CTGGACGGTT 
GGCAACATCG GGCGTGATCC GTTCGGCGAA CCGGTGCGCC AGCCTCTTTC GCCGGTCGAA
ATCGTCCATC TGCTGGCGGA AGTTGGCGCG TATGGCGTCA ATTTCCACGA CAACGACCTG
ATCCCGATCG ATGCAACACC GGCGGATCGC GACCGCATTC TACGCGAGTT CAAGGCGGCG
CTCCAGGAAA CCGGCCTCGT TGTGCCGATG GCGACGACCA ACCTCTTTGC CGATCCGGTC
TTCAAGGATG GCGCCTTTAC CTCGAACGAT CCTAAAGTGC GCGCATACGC GCTACAAAAA
ACAATGCGCG CAATCGATCT GGGCGTTGAA TGTGGCGCGA AGGTGTACGT TTTCTGGGGC
GGTCGCGAAG GCGTTGAGAG TGACGCGGCG AAGGATCCGC AGGAAGGGTT GAAACGCTAC
CGCGAAGCGA TCAATTTCCT CTGCGCCTAT GTGAAAGACC AGGGGTACGA CCTGAAATTC
GCGCTCGAAC CCAAGCCGAA CGAGCCGCGC GGCGACATCT ATCTGGCGAC TGTCGGGCAC
GCGCTGGCGT TCATCAACAC GCTCGACTAC CCGGACATGG TCGGGCTGAA CCCGGAAGTC
GCCCATGAAA CAATGGCAGG ACTGAACTTC CTGCACGGCG TCGCGCAGGC GTGGGATGCT
GGCAAACTGT TCCATATCGA CCTGAACGAT CAGGTAATCG GGCGCTACGA TCAGGATTTC
CGCTTCGGAG CGGTCAATCT GAAAGCCGCC TTCTTCCTGG TGCGCTTCCT GGAAAAGGTC
GGGTACCAGG GGAGTCGTCA CTTCGATGCA CACGCCTACC GCACCGAGGA TTATGAAGGG
GTGAAAGCGT TTGCCCGCGG CTGTATGCGC ACCTACCTGA TCTTGAAAGA AAAGGCGCGC
CGCTTTGACG AAGATGCCGA GATTCAGGCG TTGCTGGCGG AAATCTCTGC CGACGACGGC
TCGATGGCGC CGTTCCAGGG CGGCTACAGC CGCGAAAAGG CTGCCGCGCT CAAGGCGCAT
CCGTTCGACC GGGCGGCGCT TGGGCGACGC GGGCTGGCGT ATGAGCGCCT CGATCAACTG
ACGAACGAAT TGTTGCTCGG CGTTCGTTAA
 
Protein sequence
MSDMYEPKPE HKFTFGLWTV GNIGRDPFGE PVRQPLSPVE IVHLLAEVGA YGVNFHDNDL 
IPIDATPADR DRILREFKAA LQETGLVVPM ATTNLFADPV FKDGAFTSND PKVRAYALQK
TMRAIDLGVE CGAKVYVFWG GREGVESDAA KDPQEGLKRY REAINFLCAY VKDQGYDLKF
ALEPKPNEPR GDIYLATVGH ALAFINTLDY PDMVGLNPEV AHETMAGLNF LHGVAQAWDA
GKLFHIDLND QVIGRYDQDF RFGAVNLKAA FFLVRFLEKV GYQGSRHFDA HAYRTEDYEG
VKAFARGCMR TYLILKEKAR RFDEDAEIQA LLAEISADDG SMAPFQGGYS REKAAALKAH
PFDRAALGRR GLAYERLDQL TNELLLGVR