Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2637 |
Symbol | |
ID | 5209606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3271706 |
End bp | 3272971 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640596239 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001276961 |
Protein GI | 148656756 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.731543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCG CAATGTTGAG CGTTCATAGC AGCCCCCTCG CGCGTCTCGG CGGCAAAGAG GCGGGCGGCA TGAACGTCTA TGTCCGCGAA TTGAGCCGCG AGTTCGGACG TCGCGGCATA GCCGTCGATA TATTCACCCG CGCCCAGGCG CACGACGCAC CGACGGTCGT TCAGATCGAT CGGGGCGTGC GCCTGATCCA TGTGCGCGCC GGTCCACCGG CGCCCTGCGA TAAAAACCGC CTGCTGGACT ATCTGCCGGA GTTCATCGGG CGGGTGCGCT GCTTCGCCGA CGGTGAAGAC CTGCACTACG ACGTCATTCA CAGCCACTAC TGGGTTTCTG GCGAGGCGGC GCTGGCGCTG CGCCGTAGTT GGGGTGCGCC GGTCGTTCAT ATGTTCCATA CGCTCGGCGC GATGAAAAAT CTGGTGGCGC GCGGCGACCA GGAGCGTGAA ACCCGCGAGC GGGTCGCGGT TGAGGAGCGT ATCCTGCGCG AAGCCGACGC GATTGTGGCA GCCACTCCGC TCGACCGGGC GCAGATGGTC TGGCACTACG CCGCCGATGT GGGCAGGATT CGTGTTGTTC CAGCAGGGGT TGATCTGCGC CGCTTTCAGC CGCGCGATGC AGCAATGGCG CGCACAATGC TCGATCTCCC GCCAGCGCCG CACCGCATCA TCCTGCTGGT GGCGCGTATT GAGCCGCTCA AAGGCATCGA TGCGCTGATC GAAGCCAGCG CCCTGCTGGT GCAGCGCCAC CCTGAGTGGC GCGACACGCT GACGGCATTG ATCGTCGGTG GGGGCAGCGA GGAGGAACGG GCGCACTGGA ACGCCGAGCA GCGGCGCCTG GACGCCATCC GGCAGCGGCT TGGGATCGCC AATGTTGTGC GATTCGCCGG CGCGCAGCCG CAGGAACGCC TGCCGCTCTA CTACGCGGCT GCCGATGTCG TTACCATGCC GTCTCATTAC GAGTCATTCG GGATGGCGGC GCTCGAAGCG CTGGCATGCG GCAAGCCGGT GATAGCAACG AGTGCAGGCG GTCCGGCGTT TATCGTCGAA GATGGCGTCA GCGGTCTGCT GACCCCGCCT TCCGACCCGC CGACCCTCGC GCGACACCTT GAGCGCCTGC TGCTGAATGA CGACGAGCGC GCAACGATGG GCGCTGCGGC ACGGGAACGG GCGCTGCGGT TCGGTTGGGA GCATATTGCG TGTGACATTC TCGGCATCTA CCGCGACCTG TTGCAGCAGC GCGACCGTCA GGCGCGGGCA GGGTAG
|
Protein sequence | MRIAMLSVHS SPLARLGGKE AGGMNVYVRE LSREFGRRGI AVDIFTRAQA HDAPTVVQID RGVRLIHVRA GPPAPCDKNR LLDYLPEFIG RVRCFADGED LHYDVIHSHY WVSGEAALAL RRSWGAPVVH MFHTLGAMKN LVARGDQERE TRERVAVEER ILREADAIVA ATPLDRAQMV WHYAADVGRI RVVPAGVDLR RFQPRDAAMA RTMLDLPPAP HRIILLVARI EPLKGIDALI EASALLVQRH PEWRDTLTAL IVGGGSEEER AHWNAEQRRL DAIRQRLGIA NVVRFAGAQP QERLPLYYAA ADVVTMPSHY ESFGMAALEA LACGKPVIAT SAGGPAFIVE DGVSGLLTPP SDPPTLARHL ERLLLNDDER ATMGAAARER ALRFGWEHIA CDILGIYRDL LQQRDRQARA G
|
| |