Gene RoseRS_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3594 
Symbol 
ID5210572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4491591 
End bp4492856 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID640597187 
Productglycosyl transferase, group 1 
Protein accessionYP_001277899 
Protein GI148657694 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0454071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC TGCTGCTCTC CCCGTATCCG CCATACCCGC CGCGCGGTGG CGGTGCAATG 
CGCATATACC AGATCATCCG CGGTCTGGCG CAGCGCCATT CCCTGACGTG CCTCACCTTC
GTGCCCGATG CTGCGGCAGA ACAGGCGCTC GCGCCATTGC GCGACCTGTG CCGGTTGATC
ACAGTGCGCG GTCCTGCGCC ACGCTCGTTG CTCCGGCGCG CGTGGACGAC CCTGGCATCG
CCGTTCCCCG ATATGGCGCT ACGGAACGCA TCACCTGCCT TCCGCGCCCT CCTCTGCGAC
CTGGTTGCAC GCGAGCACGT TGACATCGTC CAGGCAGAAA GTATCGAGAT GGCGTCGTAT
CTCATCGAAC TGGCGCGCAA CGCCGGGGCG TCATCGTCGA TCGTCCAGCG ACCGTTGCTG
GCGCTCGATC AGTTCAACGC CGAATATGTG CTTCAGAAGC GCGCAGCAAT CACCGATCTG
CGCGCTGCAT TCACGCTCGC CGACCCGGTG CGTCGCGGCG CAGGCGGCGT CTATTCGTTG
ATCCAGTGGA TCAAACTGGC GCACTACGAA CGCCGCATCC TGCAGATATG CGATGCCGTC
ATTGTCGTTT CAGAAGAGGA TCGTAAAGCG CTGGAACGTC TCGGCGGAAC GTGTCGTGCA
GTCGTGCCGA ACGGCGTGGA CACCACATTC TTCAGCCGGG AGACGCTCAC CGGCGATCAT
CGGACGCCGC TGTCGTATGC AGCACCGGTG ATGGTATTCA GCGGAACGCT CGATTTTCGT
CCCAATATCG ACGCGATCGT CTGGTTCATC GAAGCAGTCC TGCCCCGCAT TCACGCCCGA
CGCCCCGACG TCCAACTGCT CGTCGTCGGG CGGCGCCCCG CGCCGATCCT GCGCCGCCTC
GCCGAACAGG GACGGCTCAT TCTGACCGGT GAAGTCAGCG ATGTGCGCCC ATTCCTGGCT
GGCGCTGCGG TCTACATCGT GCCCATGAGG ATTGGCGGCG GCATACGCCT GAAAGTGCTC
GAAGCATTCG CACTCGAAGC GCCGGTTGTC AGCACAACCC TGGGAGTTGA AGGGATCGCC
GGGTTGCGCG ACGGCGTCCA CTGCCTGCTG GCAGACACGC CGCAGCAGTT CGCCGATGCT
GTCGTGCGTC TGCTGGATGA TCCGGCGTTA AGGCGAATAC TCGGCGCTGC CGGACGGCGA
CTGGCACGTG CGGAGTACGA CTGGAAAGCG ATCATTCCCC GCCTGGAAGC GGTCTATCAG
CGTTGA
 
Protein sequence
MKILLLSPYP PYPPRGGGAM RIYQIIRGLA QRHSLTCLTF VPDAAAEQAL APLRDLCRLI 
TVRGPAPRSL LRRAWTTLAS PFPDMALRNA SPAFRALLCD LVAREHVDIV QAESIEMASY
LIELARNAGA SSSIVQRPLL ALDQFNAEYV LQKRAAITDL RAAFTLADPV RRGAGGVYSL
IQWIKLAHYE RRILQICDAV IVVSEEDRKA LERLGGTCRA VVPNGVDTTF FSRETLTGDH
RTPLSYAAPV MVFSGTLDFR PNIDAIVWFI EAVLPRIHAR RPDVQLLVVG RRPAPILRRL
AEQGRLILTG EVSDVRPFLA GAAVYIVPMR IGGGIRLKVL EAFALEAPVV STTLGVEGIA
GLRDGVHCLL ADTPQQFADA VVRLLDDPAL RRILGAAGRR LARAEYDWKA IIPRLEAVYQ
R