Gene RoseRS_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3471 
Symbol 
ID5210448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4350926 
End bp4352098 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content63% 
IMG OID640597066 
Productglycosyl transferase, group 1 
Protein accessionYP_001277779 
Protein GI148657574 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.152793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0339025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTC TCTACGTCGC CAGCGGCATT CCAGTCCCCG GTACGCTCGG CGGCTCGGTC 
CATACCCTCG AAGTTGCGCG CGGGCTGGCA CAGCGCGGGC ACACGGTCGA TGTGGTTGCC
TGCACTCGCC CTGACGTGTT CGATGTTGCC GCGCTGTTGC GCCCGATCTC GTCGCGCTAT
GATCGGTTTC GTTTGCACCA CATCGATGTG CCCAAAACAC TGGCGTTGCT CTCCGCACCC
GTGATCATGC GCCTGGCGCG CGCCCTGAAA CCGGACATCA TTATCGAACG GTACTACAAT
TTCGCCGGCG CCGGTATTCT GGCAGCCCGT CGCCTCGGCG TACCGTCGAT CCTCGAAGTC
AATGCGTTGA TTGTTGATCC GCCGGTTGTG TTGAAACGGC GTCTCGACGA TCTGCTTGGC
GGACCGATGC GGCGCTGGGC GGTTGCACAG TGCCGTATGG CAGACCGGAT CGTCACACCG
CTGCACACCA CTGTGCCGCC TGACATTCCG CGTTCCCGCA TCGTTGAATT GCCCTGGGGC
GCCGATGTGG AACGCTTCTG CATTGATCGT TCGCAGGAAG GCACGACACC TGCCCTGCCA
ACCGTCGTTT TCCTCGGTTC GTTCCGCGCC TGGCATGGCG TGCTCGATGC GGTGCGCGCA
GGAGGTCTCC TGATCGAACA GGGGCGCGTC TGCCATTTCC TCCTGATTGG CGATGGTCCG
CAGCACGCTG CCGCAGTGCG CCTGGCGGCG CGCTGGCAGG GACATTTCAC GTTCACCGGC
GCCGTTCCCT ACGACGATGT GCCATCACTC CTGGCGCGGG CATCGATCGC GGTCGCACCG
TTCGACACCG CAGCCCATCC GGCGCTGCGC GCTGCCGGAT TTTTCTGGTC GCCGTTGAAG
GTCTTCGAGT ATATGGCGGC GGCGCTGCCG GTCGTGACCA TCGACATCCC GCCGCTCAAT
CAGATCGTGC GTCACGGAAG CGAAGGGTTG CTCTACCCCG AAGGCGACGT TGATGCACTG
GCAGGGGCAA TCGCATATCT GATCGACCAT CCCGACGAAG CGCGCGCTAT GGGAGAGCGC
GGGCGGGCGC GCGTCACAGC GCATTTTTCA TGGTCGCGGC ACTGCGAGGC GCTGGAATGG
GTGATGGAGG AGACGTTGAA GGTTGAAGGT TGA
 
Protein sequence
MNILYVASGI PVPGTLGGSV HTLEVARGLA QRGHTVDVVA CTRPDVFDVA ALLRPISSRY 
DRFRLHHIDV PKTLALLSAP VIMRLARALK PDIIIERYYN FAGAGILAAR RLGVPSILEV
NALIVDPPVV LKRRLDDLLG GPMRRWAVAQ CRMADRIVTP LHTTVPPDIP RSRIVELPWG
ADVERFCIDR SQEGTTPALP TVVFLGSFRA WHGVLDAVRA GGLLIEQGRV CHFLLIGDGP
QHAAAVRLAA RWQGHFTFTG AVPYDDVPSL LARASIAVAP FDTAAHPALR AAGFFWSPLK
VFEYMAAALP VVTIDIPPLN QIVRHGSEGL LYPEGDVDAL AGAIAYLIDH PDEARAMGER
GRARVTAHFS WSRHCEALEW VMEETLKVEG