Gene RoseRS_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3768 
Symbol 
ID5210750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4710506 
End bp4712539 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content62% 
IMG OID640597364 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001278072 
Protein GI148657867 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.092957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000191257 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATCGTT TCGTCCAGAA ACTGCTGCCT GAAGACCCCG ATCTGCGCGC AGAATTGCGC 
CTGGGGTTGC TGCGCGTGCT CGTTGCCGCC AATCTGCTCC TCGGCTTCCT GTATCTGAGC
TGGCGCTACA CCGCAACGAT CAATTGGGCC GCCTGGCCCA TCGCCATCGG TCTGGTGATT
GCCGAAACAT ACAGTTATAT CGACGCATGG CTGTTTGGGC TGACCCTCTG GAAACTGAAG
CAGCGCGGCG AACCGCCGCC ACCGCCGCCC AACGCAACCG TCGATGTCTT CATCACCTGC
TACAACGAAC CGGTCGAGAT CGTGCGCGAA ACGGCGATCG CTGCACGCGA TATTCGTTAC
CCGCACCGCA CCTACCTGCT CGACGACGGC AATTCGCCAG CGATGCGGGC AATGGCAAAA
GAGATCGGCA TCGGGTATCT GGTGCGCTCC GAGGAATGGA AAGGAAAACA GCGTCACGCC
AAGGCAGGCA ACCTGAATAA CGCCCTCTGC CACACCAATG GCGAATTCGT GCTGGTGCTG
GACGCCGATC AGATTCCATC ACCCCATATC CTTGATCGCA CGCTTGGATA CTTTGCTGAT
GAGCGTGTGG CGCTGGTGCA AACACCCCAA TGGTTCTACA ACGTTCCCCC GGACGACCCG
CTCGGCAGTC AGGCGCCGCT CTTCTATGGT CCGATCATGC AGGGGAAAGA TGGCTGGAAC
GCTGCGTTCT TCTGCGGATC GAACGCCGTC CTGCGCCGCG AGGCGCTCAT GCAGATCGGC
ATCGCCAATT ATGTGCGTGA CCTGGAACTG CGGGTGCAGC GCGCCCTGCG CACCGCCGAT
ACCCTGCTGC ACAAGGCTGC GCAGCAGGTG AAAACCAACG GGAACCAGGC GCTTCAGGCG
GCCATTGACG AATTGCATGC CGCGGTGCGC ACGAGTCGTC GCATGTTGCG TGAGGGACAC
CCCATCCAGG AAGTCACCTG GTACTTCCAG CAACGCGCCG CCGCCGCCGC GCGCCCGATC
GTCGCCGACG ACCTGGCGCG CCTGCGCGAC GAACTGACGA CCATTCCAGG ACTCGAAGGG
GATGATGATC TCACGACCAC GCTGAGCCGA TCGCTCGACG ACGAAGCGAT CCTGCACACG
CTGACGACGC GCGAACGCTC GCCGCTGGCA GCCATCGCCA CCGTGCGCGA GTTGCTGCTG
GCAGTCGATA TCGACCGCGC CGATGAAGCG CAACCGGTGA TGCCGCTGGC AACGATCTCG
GTGACCGAAG ACATGGCGAC GGCGATGCGC CTGCACGCCG CAGGGTGGCG TTCCGTCTAC
CATCACGAAA TCCTGGCGCG CGGTCTGGCG CCGGAGGATC TCCGCTCGGC GCTTCAGCAA
CGTCTCCGCT GGGCGCAGGG AACGATCCAG GTGATGCTGC GCGAAAATCC GCTTTTCATA
CCGGGATTGC GCTGGGGACA GCGCCTGATG TACTTTGCCA CCATGTGGAG TTATCTTTCC
GGTTTTTTCA GCGTCATCTA CCTGTCCGCG CCGATCTTCT ACCTGATCTT TGGCATGCTG
CCGGTGCGCA CGCAGGCGGA CGAGTTTTTC TGGCGCCTCG TGCCCTACCT GATCACAAAC
GAACTGGTCT TCGCAGCGGC TGGCTGGAAA CTGCCCACCT GGCGCGGAAG GCAGTACAGT
CTGGCGCTCT TCCCACTCTG GATCCAGGCG GTCATCAGCG CCATCGGCAA TGTGTACGCC
GGACGACCGC TTGGTTTTGT GGTGACGCCG AAAGTGCGAC AGGGAGGCGC ACCGCTCTGG
GGCGTATTGC GCCTGGTGCG GATCCAGCTC ATCACCATGG CGCTCCTGGC GCTGGCTGCC
GTCTGGGGAT TGACGCGACT GGCGTTGGGC TGGCAGATTG AAGGCGTTCC AACACTGGTC
AATGTCTTCT GGATCGGGTA TGATTTGCTT ATGTTGAGCG TGGTGATCGA CGCGGCGCTC
TATCAACCGG ACGAACAGGA ACAATCGATG CCGGTGGCTG CGTCGGCAGC ATAG
 
Protein sequence
MNRFVQKLLP EDPDLRAELR LGLLRVLVAA NLLLGFLYLS WRYTATINWA AWPIAIGLVI 
AETYSYIDAW LFGLTLWKLK QRGEPPPPPP NATVDVFITC YNEPVEIVRE TAIAARDIRY
PHRTYLLDDG NSPAMRAMAK EIGIGYLVRS EEWKGKQRHA KAGNLNNALC HTNGEFVLVL
DADQIPSPHI LDRTLGYFAD ERVALVQTPQ WFYNVPPDDP LGSQAPLFYG PIMQGKDGWN
AAFFCGSNAV LRREALMQIG IANYVRDLEL RVQRALRTAD TLLHKAAQQV KTNGNQALQA
AIDELHAAVR TSRRMLREGH PIQEVTWYFQ QRAAAAARPI VADDLARLRD ELTTIPGLEG
DDDLTTTLSR SLDDEAILHT LTTRERSPLA AIATVRELLL AVDIDRADEA QPVMPLATIS
VTEDMATAMR LHAAGWRSVY HHEILARGLA PEDLRSALQQ RLRWAQGTIQ VMLRENPLFI
PGLRWGQRLM YFATMWSYLS GFFSVIYLSA PIFYLIFGML PVRTQADEFF WRLVPYLITN
ELVFAAAGWK LPTWRGRQYS LALFPLWIQA VISAIGNVYA GRPLGFVVTP KVRQGGAPLW
GVLRLVRIQL ITMALLALAA VWGLTRLALG WQIEGVPTLV NVFWIGYDLL MLSVVIDAAL
YQPDEQEQSM PVAASAA