Gene RoseRS_3121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3121 
Symbol 
ID5210090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3921081 
End bp3922232 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID640596713 
Productglycosyl transferase, group 1 
Protein accessionYP_001277434 
Protein GI148657229 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000483528 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000199281 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGCATCTCG CCATCAACGC CATGTTCTGG TCGCAACCGA CCGTCGGCAG CGGGCAGTAT 
CTGCGCATGC TCGTCCATGC CATGCTATCC GTCGCGCCCG ATGTGCGGGT GACTCTGCTC
CTGCCTGCCG GTCGCCCGGT CGCTGAACCG CTGCCACCGA ATGTCCAGGC GGCGCCTGTG
CCCACGCCGT TCGATGGGCG CAACGCGAAC CTGGCGAAGG TCTGGTTCGA GCAGATCGCC
GTTCCACGCG CAGCGCTGCG CCTGAATGCC GACCTGCTGC ATGTACCGTA CTTTGCGCCG
CCGCTGCGCC CGCCGCTGCC GACCGTCGTG ACGATCCTGG ACATTATTCC GCTCTTGCTG
CCGGAGTATC GGGGACGGGC AGCGGTGCGT CTCTATATGC GCCTGGTCGC GCGCGCCGCG
CGGCATGCGA CGCAGATTAT CACGATTTCG CACCATAGCG CCAGCGATAT TATCCGTCAC
CTCGGCTGTC AGGCAGCGCG CGTGGCGGTC GTCCACCTGG CGGCCGGCGC ACAGTTCCGC
CCGCGCGACC GTACCTTGAG CGAAACGGAA GTTGCCGCCC GCTACAGCGT CACGCCCCCG
TTCGTGTACT ATGTCGGCGG GCTGGACGCG CGGAAGAACC TGGCGACGCT GGTGTGGGCA
TTTGCGCGTA TGCGATACGC TGGCGGACCT CCCGCCACGC TGGTGATTGC CGGACGTGCG
GCTGGCAACG ATCCACGGAT GTTTCCCGAC CTGGATGCTA TCATCATGTT CGCCAGAGCC
GGCGCTTTTG TGAAGCGCAT TGATGTCCCC TACGAAGATG CGCCGCTGCT CTATGGCGCA
GCAACGGTAT TCGCCTTTCC GTCGCGCTAC GAAGGGTTTG GCTTACCGCC GCTCGAAGCC
ATGGCGTGCG GTACGCCGGT GATCGTCGCC GATGCCGCCA GCCTGCCCGA GGTTGTCGGC
GATGCGGCGC TGCGTGTTCC AGCGGAGGAC GTGACAGGAT GGAGCGCTGC ACTCTGGCGC
ATGCTGGCGG ACGACGCCCT GCGCGCCGAT CTGTCCCGAC GCGGACTGGA GCGCGCGGCG
CAGTTCAGCC CGGATCGTAT GGCGCGCGAG ACGCTGGCAA TCTACGCAGC GACACGCAAC
AGTTCCGGTT GA
 
Protein sequence
MHLAINAMFW SQPTVGSGQY LRMLVHAMLS VAPDVRVTLL LPAGRPVAEP LPPNVQAAPV 
PTPFDGRNAN LAKVWFEQIA VPRAALRLNA DLLHVPYFAP PLRPPLPTVV TILDIIPLLL
PEYRGRAAVR LYMRLVARAA RHATQIITIS HHSASDIIRH LGCQAARVAV VHLAAGAQFR
PRDRTLSETE VAARYSVTPP FVYYVGGLDA RKNLATLVWA FARMRYAGGP PATLVIAGRA
AGNDPRMFPD LDAIIMFARA GAFVKRIDVP YEDAPLLYGA ATVFAFPSRY EGFGLPPLEA
MACGTPVIVA DAASLPEVVG DAALRVPAED VTGWSAALWR MLADDALRAD LSRRGLERAA
QFSPDRMARE TLAIYAATRN SSG