Gene RoseRS_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2371 
Symbol 
ID5209340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2933714 
End bp2935006 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content62% 
IMG OID640595977 
Productglycosyl transferase, group 1 
Protein accessionYP_001276699 
Protein GI148656494 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.64695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAGA CGAGCCAGAG TGCATCTCCC CGACGACCGC GCACGGTTGC GTATACGATG 
TCGCGCTTCC CCAAGATCAC CGAGACGTTC ATCCTGATCG AGATGCTGGA ACTCGAACGT
CAGGGGGTGC GGGTCGAAAT CTTTCCGCTC ATCCGCGAGC GCGAGCCGGT GCAGCACGCC
GATGCCCAGA GGATGGTCGA GCGCGCGCAT TTCTGTCGGT TGTTCTCGCG CCCGGTGCTG
GATGCGCAGA TCTACTGGCT TCTGCGTCGT CCCGGCGCCT ACCTGCGCGC CTGGTGGCGC
GCAGTGCGCG GCAATCTGGA GTCGCCGAAG TTCCTGTCGC GTGCGCTGGT GGTCGTGCCC
AAAGCCGCAT ATGCCGCACG GCGCATGGTC GAACTCGACG TCGATCACCT GCACGCGCAC
TATGCAACCC ACCCGGCGCT CCTGGCATAT GTGGTTCACC TCCTGACTGG TATTCCGTAC
AGTTTCACGG TGCATGCTCA CGATCTCTAC GTTGAGCGCC CGATGCTGGG AGAAAAGGTC
GCTGCGGCCA GTTTCGTTGT GGCGATCTCC GAGTTCAATC GCCGGATGCT GATCGACCTG
TACGGCGCGA CGGCTGAGGA GCGGGTCATC GTGGTGCATT GCGGCATCGA CCCGGCTCTC
TTTCGTCCAC GCGAGCGGCG TGAGCCGGGT GAACTGTTCA CGATCGCATG CGTGGCCAGT
CTTGCGGGGT ACAAAGGGCA GCGCTACCTG ATCGATGCGT GTGATGTGCT GCATCAACGG
GGCGTGCCCT TCCAATGCCT GCTGGTCGGC GAGGGTGAGG ATCGCCCGCA CCTCGAAGCA
CAGATTCGCC GTCTGGGTCT GACGGATCAT GTCAGGCTGC TTGGCGCTCA ACCGCGTCAT
AACGTGAGCG AACTGTTGCA GCAGGTCGAT GCGCTGGCGC TACCGAGCGT TGTGATGCCC
AACGGCAAGA TGGAAGGCAT TCCGGTGGCG CTGATGGAAG CGCTTGCCGC CGAAATTCCG
GTTGTGGCGA CTGCCATCTC CGGCATTCCC GAACTGGTGC GGGACGGCGA AACCGGTCTG
CTCGTTCCGG AACGTGACGC GGCGGCGCTG GCTGAAGCGC TGCTGCGCCT CTATACCGAC
CGCGACCTGG GACGACGCCT GGCTTCCGCA GGGCGCCAGC TGGTGCTGCG TGAATTCAAT
CTCGAACATA GTGTTGCTCA ATTGCGCACT CTTTTTGAAC GTGACTGGCG GATGGATGGA
CAGCGGCCGC TCCTATGCGA AATCGAGGCG TGA
 
Protein sequence
MIETSQSASP RRPRTVAYTM SRFPKITETF ILIEMLELER QGVRVEIFPL IREREPVQHA 
DAQRMVERAH FCRLFSRPVL DAQIYWLLRR PGAYLRAWWR AVRGNLESPK FLSRALVVVP
KAAYAARRMV ELDVDHLHAH YATHPALLAY VVHLLTGIPY SFTVHAHDLY VERPMLGEKV
AAASFVVAIS EFNRRMLIDL YGATAEERVI VVHCGIDPAL FRPRERREPG ELFTIACVAS
LAGYKGQRYL IDACDVLHQR GVPFQCLLVG EGEDRPHLEA QIRRLGLTDH VRLLGAQPRH
NVSELLQQVD ALALPSVVMP NGKMEGIPVA LMEALAAEIP VVATAISGIP ELVRDGETGL
LVPERDAAAL AEALLRLYTD RDLGRRLASA GRQLVLREFN LEHSVAQLRT LFERDWRMDG
QRPLLCEIEA