Gene RoseRS_4088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4088 
Symbol 
ID5211071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5125245 
End bp5128343 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content58% 
IMG OID640597676 
Productglycosyl transferase family protein 
Protein accessionYP_001278382 
Protein GI148658177 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.127971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC GCGCTCATCC TGACGATCCA GCCTTTCGTG GAACGATCTA CGAAGCGTGT 
CTTCGTGAGC GTTATGCCTT TTGCTTGCCG TTTGTCCATG GTCGAGATGT GCTTGATGTT
CCTTGCGGGA CAGGATGGGG CAGTTCACTG CTGTCCGGCT ATGTTTCTCT CACCGGTCTT
GATATTGATC ACGACGCTAT TGACTACGCG AGAAAGCATT ATGCCGGTAT TCGCTTTATT
GAAGGTTCGA TGTGCCAGTT GCCTTTTGAG GATGCCAGTT TTGATACGGT TGTGTGCCTT
GAGGGGTTAG AGCATGTCTA TCTGAGCGAT GCACAGCGCT TCTTGCATGA AGCGCACCGG
GTTCTACGCG AGCATGGCAT CCTGGTAGTA ACGGCGCCAT TGCTCAACAA TGGCAGGCAT
TCATCGAACC CTTACCATCT GTACGAATTT GTCGCTGCTG AGCTGAAATC GTTGCTTTCG
CGTTATTTCA TCCCCATCCA CTGGGAGATT ATCCAGGGAG GCGATGGACC GGAAGCGCGT
TTTGTCGGTC AGCGTAAGGC GGTTATTAAT GCTGGAGCAT TCAGGATGGC TTCATCGTTG
ATGTTCGATC GCGTCTACGA CTGGGTGCTA TCGCTCACCG GAGAGCGCGG AGTGCGTTTT
ACCGCCGGTG GCGAAGAAAG TATTATTGCG ACCAGTTGCG CAGTGCTGAT ACTTGAAGGA
ATAGATCGCT TGCAGGATGT GTCGTCCTCC CTGCGCAACC AGTGGATCGC CTATCTCCAG
AGTTGTCAGC GCCCCACCGA TGGTTTATTC GTTGATCCGT TGCTGGAACG TTTCCCGATC
GAGAGCGCAA TCCACGATCA GGCGTATGTG CTCGATCAGA CAACCTATCT TGCTTTGCAG
GCGCTTGATG CGCTGGGTAG CGCCCCGATT CATCCGGTAT CGGTTCAAGA GCGCTGGCCC
AATCCACAGG CATTTATCGC CTGGATGGAA CGTCTGGACT GGTCGAATGC CTGGTTGCAG
AGCAATCGGG TGATGTTTGC GCTGGCGTTC CTGGTGCATG CCGTCGAACA GTGCAATCAA
CGGGAAGCAG CGTCAATCTA TCACGCCGCA CTCGACTGGC TTGATGCAAC ACAGGACCGC
GAAACTGGCG TGTGGGGAAC ACGTCATGGT GCGTCATTGC TCAACGGGAT GGCAGCCGCT
TACCATTTTC TGCCATTTTA TGAGTATGTG TGTCGTCCGA TTCAGTGTAT CAATCAACTG
ATTGATACAA CACTTGCGTT GCAGCAATCG GATGGCTTGT TTGGTTCAGG ACTGGGGGGT
GGAGCATGTG AAGACCTCGA TGCCATCGTA GTGCTCGCGG TTGCGGCACG TTACAGCCGC
TACCGGGCTG AGGAGGTGAA GCGAGCAGCC ATACGCGCTT TCTGGGCGTT GTGGAATGCG
CAAAACGAAG ATGGAGGATT CGGGTACGCA ATTCGTAGCG ATGATCAGGT CTATCGCTTC
AGTAGCTGGG GCGCAGCAGA GAGCCGTGTG TGTTCGAGCG ATGTCTGGTC GTCCTTGGCG
CGGCTCGTTG CCCTGGGTAC GATCCGGCAC TGGTTTCCCG ATGATACGCC TTCTTTGCCT
TTGTGGCGTT TTCGGCGTTG GCCGGCTTTG GGATATCATC GATCAACCGA TAGACTGGAC
GATAATGAGC GGGCACGCTT GAAGATATGG ATGCGTCCGC TCCCTGCACC CGAACGCCAT
ACCGGGACTG AACCAGGGGT AAGTGTTATC ATTCCCTGCT ACAACCTGGG GCGCTATCTC
TACGAGGCGC TGGCTTCGGC TTTGCAACAA ACATTGCAAC CGCTGGAAGT TATCGTTGTC
GATGATGGCT CGACCGACGA CTACACCCGT CTGGTTCTTG ACACGATCGA CCATCCGCAG
GTGCGCGTCA TCCGCCAGGA GAACTGCGGG CTGCCCGCTG CGCGCAATGC GGGTATTCGC
ATGGCTCGCA GTCCATTCAT TTGCTGTCTT GACGCCGATG ATCGCTTGCT TCCCACGTAT
TTTGAGCGAG TATTGCCATT ACTGGAGTCT GATCCGCAGG TGGGATTTGT CACCGGTCAC
TACCGTGAAT TCGATGGACG ATCAGGAGTG GTGGCGCCTT CAACCTGCGC GTTACCCGAT
ATGCTGGTCG TCAATCGAGC GATAGTCACG TCACTGTTCC GTCGGGAAGC ATGGGAGCGT
GCTGGCGGGT ACTGCGAAGA ATTGAGCGGA ATGCACGACT GGGATCTCTG GATTGGCATC
CTCGAAGCAG GATACCGTGC AGAGGTCGTG CCTGAGATAC TGTTCGAGTA TCGGGTACGT
CCCGGCTCTA TGTATGCCAC GACCAGCCAG CCAGAGAACT ATGCCCGCCT TGTCGGGCAA
ATAGTTGAGC GTCATGCGTC CCTCTACCAG CATTGGTGGC GTGATGTGGT CGTTCTGTAC
GCTCGTGAAC ACGCCAGTCT TGCTGCATAT GCTGAAGGAC AGGGACGGTT GTCGTCGCAG
ATCAGGAGCA ACAGTGCGCA GCAGGCGGCG AACTGGAAGC GTGCGGCGGA AGAGCGCGCG
GCGTGGATGG CGGAACTGGA AGCGGCGCGC GATTACCACG CGCAGCAGGC GGCGAACTGG
AAGCGTGCGG CGGAAGAGCG CGCGGCGTGG ATGGCGGAAC TGGAAGCGGC GCGCGATTAC
CACGCGCAGC AGGCGGCGAA CTGGAAGCGC CTGGCGGAAG AGCGCGCGGC GTGGATGGCG
GAACTGGAAG CGGCGCGCGA TTACCACGCG CAGCAGGCGG CGAACTGGAA GCATGTGGCG
GAAGAGCGCG CGGCGTGGAT GGCGGAACTG GAAGCGGCGC GCGATTACCA CGCGCAGCAG
GCGGTGAACT GGAAGCGTGC GGCGGAAGAG CGCGCGGCGT GGATGGCGGA ACTGGAAGCG
GCGCGCGACT ACCACGCGCA GCAGGCGGCG AACTGGAAGC ATGTGGCGGA AGAGCGTGGG
GCGTGGATAG CGGAATTGGA GCGCGGCTAC ATCCGTGTGC CGCGCAGTCG TGCCGTATGG
CGCCGGTTGA TGCAACGCAA ACGAGTAAGA TCCACATGA
 
Protein sequence
MAERAHPDDP AFRGTIYEAC LRERYAFCLP FVHGRDVLDV PCGTGWGSSL LSGYVSLTGL 
DIDHDAIDYA RKHYAGIRFI EGSMCQLPFE DASFDTVVCL EGLEHVYLSD AQRFLHEAHR
VLREHGILVV TAPLLNNGRH SSNPYHLYEF VAAELKSLLS RYFIPIHWEI IQGGDGPEAR
FVGQRKAVIN AGAFRMASSL MFDRVYDWVL SLTGERGVRF TAGGEESIIA TSCAVLILEG
IDRLQDVSSS LRNQWIAYLQ SCQRPTDGLF VDPLLERFPI ESAIHDQAYV LDQTTYLALQ
ALDALGSAPI HPVSVQERWP NPQAFIAWME RLDWSNAWLQ SNRVMFALAF LVHAVEQCNQ
REAASIYHAA LDWLDATQDR ETGVWGTRHG ASLLNGMAAA YHFLPFYEYV CRPIQCINQL
IDTTLALQQS DGLFGSGLGG GACEDLDAIV VLAVAARYSR YRAEEVKRAA IRAFWALWNA
QNEDGGFGYA IRSDDQVYRF SSWGAAESRV CSSDVWSSLA RLVALGTIRH WFPDDTPSLP
LWRFRRWPAL GYHRSTDRLD DNERARLKIW MRPLPAPERH TGTEPGVSVI IPCYNLGRYL
YEALASALQQ TLQPLEVIVV DDGSTDDYTR LVLDTIDHPQ VRVIRQENCG LPAARNAGIR
MARSPFICCL DADDRLLPTY FERVLPLLES DPQVGFVTGH YREFDGRSGV VAPSTCALPD
MLVVNRAIVT SLFRREAWER AGGYCEELSG MHDWDLWIGI LEAGYRAEVV PEILFEYRVR
PGSMYATTSQ PENYARLVGQ IVERHASLYQ HWWRDVVVLY AREHASLAAY AEGQGRLSSQ
IRSNSAQQAA NWKRAAEERA AWMAELEAAR DYHAQQAANW KRAAEERAAW MAELEAARDY
HAQQAANWKR LAEERAAWMA ELEAARDYHA QQAANWKHVA EERAAWMAEL EAARDYHAQQ
AVNWKRAAEE RAAWMAELEA ARDYHAQQAA NWKHVAEERG AWIAELERGY IRVPRSRAVW
RRLMQRKRVR ST