Gene RoseRS_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4094 
Symbol 
ID5211077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5133959 
End bp5135365 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content64% 
IMG OID640597682 
Productglycosyl transferase, group 1 
Protein accessionYP_001278388 
Protein GI148658183 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.537371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.779331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC TGTTGATGAC CCCGTTTCTC ACTATTGGCG GGGCTGACCG GTTGAATCTC 
GATGTGGTGC GGCAACTCAC CGGGCGCAGG TTCCGGTTCA GCGTTGTTGC GACCCTGCCG
CATGCGCACG AGTGGCGTCC GCTGTTCGAG TCGATCACTC CCGATGTTGT GACGCTCCAT
CCCATGATTG CGCCTGAGCA GCAACCTGCA TTTGTGCGTG ATCTGATCCG TTCGCGCGAC
ATCCAGGCGC TGCTGATCAG CAACAGTCAG TTCGGCTATG TGCTGCTGCC CTACCTGCGC
CGTCACTGTC CTGATGTTGT TGTCCTCGAT CTGCTGCATG CAGTGGAACC GCACTGGCTC
GATGGCGGAT ATCCGCAACT CTCGCTCCAG CAACGCGCCT GGATCGACCT CAGTATCACG
GTATCGGGCG ATCTGCACGA CTGGATGATT GCGCGTGGCG GCGATCCGGA GCGGATCGTT
GTCAGCCCGG CAGCCATTGA TGTGAACGTG TGGAATCCGG CGCACTTTGA CCGGGCGACC
ATTCGCCAGG CGTCTGGCAT GCCTCTCGAT CTGCCCCTGA TCCTCTCCGT CGGGCGTCTC
GCACCAGAAA AGCGACCGCG CCTGGCGATG CGAATCCTGC GCGATGTTGC GCAGCGCGGT
GTTCCGTTCA GTGCGCTGAT CATCGGCGAA GGTCCTGAAC GTCCGGTGCT GGAACGCATG
CTGCACGATC CGGTGCTGCG CAACGTTCGA CTGACCGGGG CGCTTCCCCC GGAGCGGGTG
CGCGAGGCGC TGGCAGCCGC CGATCTCCTG CTCCTGCCCT CCGCGCGTGA AGGAATCGCA
ATAGTGCTCT ACGAAGCCAT GGCAATGGGA GTCGTCCCTG TTGCAGCCGA TGTTGGCGGG
CAACGCGAAC TCGTCACCCC CGACTGCGGC ATCCTTGTCC CGCCGTCGGG CGATGAAACC
ACCGCGTATG CCGCCGCGAT CATCGGTTTG CTGACCGATC CAACGCAGCG TGCCGCGATG
GGGGCGCGTG CACGCCAACG GATCGTCGAT CATTTCCGGC TCGATCTGAT GGGGGACCGG
ATGGAAGCCT TCATGCGGCA TGCAGTCGAG CATTCAGCGC GCGCGGGTCG CACCATCCCT
ACGCCGGAAG AAGCTGAGCG GAGCGCGATC GAAGCGATCC AGCTTGCGCG CCAGGCGCGC
AATGTTGCGC GTTTGTGGGA GACCGGCGGG TATGCCGGTG ACATTGATCT GTCACCAGCG
CGTCGTGCTG CGCTGCGGAT TGTGCGCAGC GCGCGGAAGC ATCTGCGACC GTGGTACCGG
CGACTCGCCG CGCACGACGA CAGTCGGTTG CGGCGCGGCG TCCTGACCGT GCGCGATTGG
GTGGTGCGAT GGGTGTATCG CGCGTAG
 
Protein sequence
MRILLMTPFL TIGGADRLNL DVVRQLTGRR FRFSVVATLP HAHEWRPLFE SITPDVVTLH 
PMIAPEQQPA FVRDLIRSRD IQALLISNSQ FGYVLLPYLR RHCPDVVVLD LLHAVEPHWL
DGGYPQLSLQ QRAWIDLSIT VSGDLHDWMI ARGGDPERIV VSPAAIDVNV WNPAHFDRAT
IRQASGMPLD LPLILSVGRL APEKRPRLAM RILRDVAQRG VPFSALIIGE GPERPVLERM
LHDPVLRNVR LTGALPPERV REALAAADLL LLPSAREGIA IVLYEAMAMG VVPVAADVGG
QRELVTPDCG ILVPPSGDET TAYAAAIIGL LTDPTQRAAM GARARQRIVD HFRLDLMGDR
MEAFMRHAVE HSARAGRTIP TPEEAERSAI EAIQLARQAR NVARLWETGG YAGDIDLSPA
RRAALRIVRS ARKHLRPWYR RLAAHDDSRL RRGVLTVRDW VVRWVYRA