Gene RoseRS_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2653 
Symbol 
ID5209622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3290552 
End bp3292051 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content61% 
IMG OID640596255 
ProductO-antigen polymerase 
Protein accessionYP_001276977 
Protein GI148656772 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGTT TTCTGTTCGA TCTCTATCAG CGCACCGCGC GCGCAACGTT TGTGCTGGCG 
TCCCTTTTCA TGGCGCTATG TCTGGCTGTC TTCTCCACTG GCGGATTGCC GCTCGCGTTT
CGCCTGACAG GTCTGCTCCT GTTCGTAGCG CTGGCGCTGG TGCACCCGGC AGGCGGTCTG
GCATGCGTCA TACTGACGGC GCCTCTCTAC CTGATGCCTG CAACGGTCGA TAGTCCGACG
CGAACGCTGT TGCTTCCGCT GCATGAAGTC GCGCTGCTGA TCACGACTGC TGCCGTATGC
TGGCGCTGGA TCGGCGGACA CATTCGAGAT CGCCGTATGC CCGATGCAGG CGCTGCCCTG
CAACGTGTGC GCGCAATGAG CGTTTCCCAC GCTCCGGAAG CGTTGCTGGC GCTGGCGGGG
ATCATCGGCG TGATGCTGGC AGTTCCGGAG GGTCGAGGCG CTGCGCTGCG CGAGTTTCGC
TGGTTGATCG TTGAGCCGTT ACTGTTCTAT GGACTCGTGC GCACAATCAG AATGTCGCGC
GAAACCCTGA TCGGCGCCCT GGCGCTCAGT GGCGCATTCG TGGCAGTGAT CGGCGCATTC
CAGTTCGTCG GACTGGACCT GACGCCATTG ATCGGCGAGA AACGCGCCTT CAGCGAAAAC
ATTATTGTTG TGGATGGGGT GCGACGGGTA ACGTCGGTCT ATGGTCATCC CAACAATCTG
GGTCTCTATC TGGAACGGGT CTGGCCCCTG GCGGCGGCGA TGGCGGCGTG GATATGGCAT
GCGCGGCGTG GAGACGGGGC AGCCAGCGCT TCTCCTGCTG TGTCGGAAAG GCGATCAACC
GGCGCTGCCT TCTTCTTTGC CGCCTGTGCG CTGCTGTCGC TGACGGGGGT TGTGGTTTCC
TTTTCGCGCG GTGCGTGGCT GGCGAGCGTG ATTGCGGCGG TTGTGTTGGG AGCAGGCTGG
CTGCTGCACC GGTCACAGCA TCGACAGGCG GTGCGCTGGT CGGCGCTGGC GTTCATCGGA
GCGCTGATCG TGGGAATGAC CGGACTGGCG CTGACGCTGC GCGGCGGTCC TGGCGGCGGA
AGCGTCGATG CGCGTCTGCT TCTCTGGCGT GAGGCGCTGG TCTATCTCCG GCAGAATCCG
CTCGGATTGG GGATCGACCA GTTTTACTAT TACCACAATC CGGCATTCGG GCGGAGTGCA
ATCGATCCAT CGCTGGTCGG CACGAGCGAG GAATTTGCTG CGCATCCTCA CAATCTGCTG
CTTGATGCTT GGGTGAATGT CGGACCTCTG GGGGTTCTGG CTTTTGGTCT GCTGCTGGTG
CGCTTCTATC GCAACGCCCT CATCGCTGTG AGGAAACGGC GTGAGGTGGT GATTGCGGGG
GCGCTGGCAG CGATGACTGC CGCACTCTTC CATGGTCTGG TCGATCGGTT CTATTTTGTG
CCGGATCTGG CAATTGCATT TTGGGTGCTG ATGACTGTTG GGGAGAGAAG TGAGAATTGA
 
Protein sequence
MPGFLFDLYQ RTARATFVLA SLFMALCLAV FSTGGLPLAF RLTGLLLFVA LALVHPAGGL 
ACVILTAPLY LMPATVDSPT RTLLLPLHEV ALLITTAAVC WRWIGGHIRD RRMPDAGAAL
QRVRAMSVSH APEALLALAG IIGVMLAVPE GRGAALREFR WLIVEPLLFY GLVRTIRMSR
ETLIGALALS GAFVAVIGAF QFVGLDLTPL IGEKRAFSEN IIVVDGVRRV TSVYGHPNNL
GLYLERVWPL AAAMAAWIWH ARRGDGAASA SPAVSERRST GAAFFFAACA LLSLTGVVVS
FSRGAWLASV IAAVVLGAGW LLHRSQHRQA VRWSALAFIG ALIVGMTGLA LTLRGGPGGG
SVDARLLLWR EALVYLRQNP LGLGIDQFYY YHNPAFGRSA IDPSLVGTSE EFAAHPHNLL
LDAWVNVGPL GVLAFGLLLV RFYRNALIAV RKRREVVIAG ALAAMTAALF HGLVDRFYFV
PDLAIAFWVL MTVGERSEN