Gene RoseRS_4233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4233 
Symbol 
ID5211218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5303703 
End bp5304920 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content57% 
IMG OID640597822 
Productglycosyl transferase, group 1 
Protein accessionYP_001278526 
Protein GI148658321 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.968175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000923088 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCATCC TTGCACTCAC ATCCTGGTGG CCCGAACCGG CAGACAACGG CTCGCGGCTG 
CGTATTGCCA GTTTACTGCG CGCAATGGCG CAGCGCCACG ATATTCACCT CATTTCGTTC
TTTCAGGAGC CGGTTACCGA AGCGCAGATC CGAAGAATGC GCGAGATATG CACCGCAGTC
GAGGCAATCC CTCAACCGGT ATGGCGACCG CGTCCGGGAG AACAAATCCT GAGCCTGTGG
CACCCGGAAC CAAGTTCTTT TCGTGCCACC TGGAGCGCGG CATTCGATGC GTGTGTGCGA
CGTGCTGCAA CCGATGCGCC TGATATGGTG ATCGCCTTTC AAACCGGCGT CGCGCGGTAT
GCTCTAAGCG TACCGGGCGT TCCGCGGTTG CTCGAAGAAC TCGAAGTTGG AAATTTCTAC
ACCCACGTGC ATCTTCAGAA AATGCCGCAC CATCGGTTGC GCGCATGGTT AACGTGGCGC
AAACAGACGG CATACATCCG CCGTTTGCTT GGTCACTTCG ATGCCTGTAC TGTTGTTTCT
GTGAATGAGC AACGATTGAC CCATGCAATC GCGCCGGGTG CGACGGTTTA CGTTCTGCCG
AACGGAACCG ATGTGAGTGT CGGTGATCAG GATTGGGGCG CGCCCCAACC GGATACGCTG
ATCTATCCCG GTGCACTAAC ATTCGACGCC AATTTTGATG CCGTTGATTA TTTTCTGCGT
GATATTTTTC CACGCGTAAA GGCGCAGCGA CCGGAAGTGC GATTTGTGGT GACCGGCAAT
GCCCCGCCGA CGCTCAGAAC GGCGCTGCCA CAGATAGAGG GCGTCGAGTT TACCGGCTAC
GTTCCTGATG TTCGCCCGGT TATCGCGCGT TCCTGGTGTG AAGTCGTGCC CCTGCGATCA
GGCGGCGGGA CGCGACTCAA GGTGCTCGAA GCGCTCGCAC TCGGCGTTCC GGTCGTTTCA
ACGCCAAAAG GCATCGAGGG TCTGGCGCTT GATGATGATA TTCATGTCCT GGTTGCGCCA
ACTACCGATG AATTTGTAGA CGCAACGCTG CGCATTCTTG ATCAACCGGA ATTGCGCGCG
CGTCTGGCGG AAGCCGGGCG TCGTCGCGTG GCAGAGTTGT ACGACTGGCG AATCATCGGT
CAACAGATGA ATGAGTTAAT CGAGGAAATC ATTCGCCAGC ATTCGGGTAG ACGATCTGTT
TATAGCACGC ATGCCTGA
 
Protein sequence
MRILALTSWW PEPADNGSRL RIASLLRAMA QRHDIHLISF FQEPVTEAQI RRMREICTAV 
EAIPQPVWRP RPGEQILSLW HPEPSSFRAT WSAAFDACVR RAATDAPDMV IAFQTGVARY
ALSVPGVPRL LEELEVGNFY THVHLQKMPH HRLRAWLTWR KQTAYIRRLL GHFDACTVVS
VNEQRLTHAI APGATVYVLP NGTDVSVGDQ DWGAPQPDTL IYPGALTFDA NFDAVDYFLR
DIFPRVKAQR PEVRFVVTGN APPTLRTALP QIEGVEFTGY VPDVRPVIAR SWCEVVPLRS
GGGTRLKVLE ALALGVPVVS TPKGIEGLAL DDDIHVLVAP TTDEFVDATL RILDQPELRA
RLAEAGRRRV AELYDWRIIG QQMNELIEEI IRQHSGRRSV YSTHA