Gene RoseRS_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4075 
Symbol 
ID5211058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5109217 
End bp5111451 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content53% 
IMG OID640597663 
Productglycosyl transferase, group 1 
Protein accessionYP_001278369 
Protein GI148658164 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCG ATTATCAATC ACGCTATAGC TACGCCCCTG CCGATCCGTC CGCTGATCCC 
GATGTGCTGG TGGTTACACT GGTTGTATCG GTCACAGATC TTGCGCGCCA GGAAATGGCG
TTACGCGCTC AGTCACTGCA ATCCTGGCAG TGGGTCATTG TCTGTTTGCC TGCCCTGGCG
GAGGGATTGC TGCATCTCGC AACCGATCCG CGTGTGCGGG TTATTCCAGA GATGCACCCT
GCTTCAGCGT CGCGTCATCT CGTGCAATCG TCATCGTGCC GGTTTGTCTG TACCTTGAGC
GCCGATACTG CTCTTGATCC TACGTTCCTG GAGAAAGCGG CGTGGTTCCT GGCGACGAAC
CCCCACGTGG CGTGTTGTAA CGCATATACC CGCGATGAGA GCGGCGCTCT CTGGTCATAT
GGTTTCGAGC AGGGAGATCG GTTCCTTGAT ACGAACTATG CTGGGGCAAC TGCATTGTTC
TTCAAGGAAG ATCTGCTGGC GACAGGATTG CCGCTCCCAA TCAATAACGA TGACGCGTGG
GAAATATGGT TGCAACTCGC CGATCAGGGA AAGTGGGGCT TTACTATTCC CCAGCCGCTT
ATTCTCTCTC TCTATCCACT TTCGATGCCC CCTTTTCAAT CAAGAGATCC TTCACACCAT
ACGATACAGC GTGACCGATT GCGGGCGCGG TATGGTACGT TGCATGGTCG CTTTCCACAT
GTCGGATTGA CTGATGCCAC GCCATTTGAA GCAATCAGTG ATGCGCCGGT TCTTGCGAAT
CACCTGGCAA AAAGCACGAC AACGAAGCGT ATCTTGATGA TCGTGCCCTG GCTTATAGTT
GGTGGTGCAG AGCGAGTTAA CCTCAACCTG ATATCGTATC TGACGCAGAA AGGGTTCGAA
GTTAGTATTG TAACGACCCT GATCAGACAG CAACATCCCT GGTTGTCTGA GTTTGAGCGT
TTCACACCCG ATATTTTCAT CCTTGATCTC TTTCTTCGTC TCCCTGAATT TCCGCGATTC
ATCGTTTATT TGATCCGCTC GCGCGGCATT GATACCGTCA TTGTTGCTAA CAGTTATCTT
GGATATCAAT TGCTTCCTTT TTTACGTTCG CACTTCCCGG AAACGGTATT TGTCGATCTT
TTGCACAGTT ACGAAGAAAT GTGGAAAAAT GGAGGTTATC CACGCGCCAG CGTCGGTTAT
CAATCCCAAC TCGATCTGAC CATTGTAACC GGTGGTCATG TCAAGCGTTG GATGGTGGCT
CGTGGCGCCG ATCCAGAACG AATTCACATA TGTTATACCA ATGTGGATGT TGATCGTTGG
CGCCCTGATC CGCAACGTCG TGCAGAAATG CGGATAAGTC TTGGTCTTGA GGAAGATGAT
CTGTTGATTG TGCTGATTGG TCGGTTGACA AGCGAAAAAC GCCCTCTCCT TTTTGCGCCG
ATTATTGCGG CATTACGAAA AAAAACAGCT TCTCGTTTTC TGACGCTGGT GTTAGGTGAC
GGTCCGTTGC GAGGCCAGGT TGAACGTCAG ATACGAGAGT TGCGCCTCAG TGATGTTATG
CGCGCGTTGG GACGTGTCAA TGACGATGAA GTGTTTGGTT ATCTTGCAGC TGCTGATGTG
TTCCTGATCC CGTCACAGAC TGAGGGCGTT TCGGTGGCGA CGATGGAAGC TATGGCGATG
GGGGTCGTAC CGGTAAGCGC TGATGTTGGC GGTCAGGGCG AATTGATTAC GGCAGACTGT
GGTGTTCTCA TTCCTCATGG ACCTCATGAG GTTGATGAGT ATGCACGAGT GCTCGCACGT
TTCGCCGACG ATCCGGACAT GCGACGACGT ATGGGGCAGG CAGCTCGCAA ACGAGTCGAG
CGGCACTTTG CGTTACGTGA TTTTGGACCT CGTATGGAGG CGCTATTAGA GCAGGCGTGG
GAGTTGCACC ATACGGCGCC GCGTGTTGTG ATACCTATCG AATTGGCGCG TGAGTGGGCA
GCTCAAATCA TTGAATATAC CCGTCAGGAG ATGTCGCTTG ATGAAATATG GGCGGAGCGA
GAAAGCTGGC GTTCCAATCC GCGTCCCCAT CCATCGCTGG CAATCCCTCT GCGTCAATCT
TTTCGCCGCA GGGTGCTGCG ATTTGTCTGG GCTCAGGTTG CACATCTCTA CCGCTGGGGC
GTGAGTCGCA ATATGAGATG GCTCGTGCCC CTCAAGGAAC GACTGGTATC AGAAGTTTAC
CGCCGTGGCT GGTAG
 
Protein sequence
MNADYQSRYS YAPADPSADP DVLVVTLVVS VTDLARQEMA LRAQSLQSWQ WVIVCLPALA 
EGLLHLATDP RVRVIPEMHP ASASRHLVQS SSCRFVCTLS ADTALDPTFL EKAAWFLATN
PHVACCNAYT RDESGALWSY GFEQGDRFLD TNYAGATALF FKEDLLATGL PLPINNDDAW
EIWLQLADQG KWGFTIPQPL ILSLYPLSMP PFQSRDPSHH TIQRDRLRAR YGTLHGRFPH
VGLTDATPFE AISDAPVLAN HLAKSTTTKR ILMIVPWLIV GGAERVNLNL ISYLTQKGFE
VSIVTTLIRQ QHPWLSEFER FTPDIFILDL FLRLPEFPRF IVYLIRSRGI DTVIVANSYL
GYQLLPFLRS HFPETVFVDL LHSYEEMWKN GGYPRASVGY QSQLDLTIVT GGHVKRWMVA
RGADPERIHI CYTNVDVDRW RPDPQRRAEM RISLGLEEDD LLIVLIGRLT SEKRPLLFAP
IIAALRKKTA SRFLTLVLGD GPLRGQVERQ IRELRLSDVM RALGRVNDDE VFGYLAAADV
FLIPSQTEGV SVATMEAMAM GVVPVSADVG GQGELITADC GVLIPHGPHE VDEYARVLAR
FADDPDMRRR MGQAARKRVE RHFALRDFGP RMEALLEQAW ELHHTAPRVV IPIELAREWA
AQIIEYTRQE MSLDEIWAER ESWRSNPRPH PSLAIPLRQS FRRRVLRFVW AQVAHLYRWG
VSRNMRWLVP LKERLVSEVY RRGW