Gene RoseRS_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2637 
Symbol 
ID5209606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3271706 
End bp3272971 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID640596239 
Productglycosyl transferase, group 1 
Protein accessionYP_001276961 
Protein GI148656756 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.731543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CAATGTTGAG CGTTCATAGC AGCCCCCTCG CGCGTCTCGG CGGCAAAGAG 
GCGGGCGGCA TGAACGTCTA TGTCCGCGAA TTGAGCCGCG AGTTCGGACG TCGCGGCATA
GCCGTCGATA TATTCACCCG CGCCCAGGCG CACGACGCAC CGACGGTCGT TCAGATCGAT
CGGGGCGTGC GCCTGATCCA TGTGCGCGCC GGTCCACCGG CGCCCTGCGA TAAAAACCGC
CTGCTGGACT ATCTGCCGGA GTTCATCGGG CGGGTGCGCT GCTTCGCCGA CGGTGAAGAC
CTGCACTACG ACGTCATTCA CAGCCACTAC TGGGTTTCTG GCGAGGCGGC GCTGGCGCTG
CGCCGTAGTT GGGGTGCGCC GGTCGTTCAT ATGTTCCATA CGCTCGGCGC GATGAAAAAT
CTGGTGGCGC GCGGCGACCA GGAGCGTGAA ACCCGCGAGC GGGTCGCGGT TGAGGAGCGT
ATCCTGCGCG AAGCCGACGC GATTGTGGCA GCCACTCCGC TCGACCGGGC GCAGATGGTC
TGGCACTACG CCGCCGATGT GGGCAGGATT CGTGTTGTTC CAGCAGGGGT TGATCTGCGC
CGCTTTCAGC CGCGCGATGC AGCAATGGCG CGCACAATGC TCGATCTCCC GCCAGCGCCG
CACCGCATCA TCCTGCTGGT GGCGCGTATT GAGCCGCTCA AAGGCATCGA TGCGCTGATC
GAAGCCAGCG CCCTGCTGGT GCAGCGCCAC CCTGAGTGGC GCGACACGCT GACGGCATTG
ATCGTCGGTG GGGGCAGCGA GGAGGAACGG GCGCACTGGA ACGCCGAGCA GCGGCGCCTG
GACGCCATCC GGCAGCGGCT TGGGATCGCC AATGTTGTGC GATTCGCCGG CGCGCAGCCG
CAGGAACGCC TGCCGCTCTA CTACGCGGCT GCCGATGTCG TTACCATGCC GTCTCATTAC
GAGTCATTCG GGATGGCGGC GCTCGAAGCG CTGGCATGCG GCAAGCCGGT GATAGCAACG
AGTGCAGGCG GTCCGGCGTT TATCGTCGAA GATGGCGTCA GCGGTCTGCT GACCCCGCCT
TCCGACCCGC CGACCCTCGC GCGACACCTT GAGCGCCTGC TGCTGAATGA CGACGAGCGC
GCAACGATGG GCGCTGCGGC ACGGGAACGG GCGCTGCGGT TCGGTTGGGA GCATATTGCG
TGTGACATTC TCGGCATCTA CCGCGACCTG TTGCAGCAGC GCGACCGTCA GGCGCGGGCA
GGGTAG
 
Protein sequence
MRIAMLSVHS SPLARLGGKE AGGMNVYVRE LSREFGRRGI AVDIFTRAQA HDAPTVVQID 
RGVRLIHVRA GPPAPCDKNR LLDYLPEFIG RVRCFADGED LHYDVIHSHY WVSGEAALAL
RRSWGAPVVH MFHTLGAMKN LVARGDQERE TRERVAVEER ILREADAIVA ATPLDRAQMV
WHYAADVGRI RVVPAGVDLR RFQPRDAAMA RTMLDLPPAP HRIILLVARI EPLKGIDALI
EASALLVQRH PEWRDTLTAL IVGGGSEEER AHWNAEQRRL DAIRQRLGIA NVVRFAGAQP
QERLPLYYAA ADVVTMPSHY ESFGMAALEA LACGKPVIAT SAGGPAFIVE DGVSGLLTPP
SDPPTLARHL ERLLLNDDER ATMGAAARER ALRFGWEHIA CDILGIYRDL LQQRDRQARA
G