Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0402 |
Symbol | |
ID | 5207338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 509117 |
End bp | 510601 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640594028 |
Product | sugar transferase |
Protein accession | YP_001274783 |
Protein GI | 148654578 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03013] sugar transferase, PEP-CTERM system associated [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.597497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGTC GCAGCGTTCT CAACCATCCA ACACAGCCGT GTGAGATGTC ATTCCATCTG CGCTTGGAGA TTTCCGAGCG TCGGTTGTTG CTGCGCGTTG GTGATCTGGC GTTGACCATG CTCGCCGTCT TCGGAGCGTT GTGGTTCTGG GCGCGTCTGG CGGATCGTGC GCTCAATGTC GCTCTGATCC AATCGCAGTT CAGTTGGTTG GCGCTGATCG GCATTGGCTG GCCCCTCTGG CTGATGCTCG CAGATATGTA CAATCTGCGT CTGGTCGCAC GCATCGGTCC GAGCGTGCGC CGGATTCTGC TTGGCGGTCT GGCGCTGCTG TTCGCGTATC TGGCGCTGTT CTTCGTACTG TCACGTGCGC CGGTCACCGG GATGCTTGCG TCGATTGAAA TTGGCACGCC GCAACTGCGC CTGGCGCCTG CGCTTGCGAT TGTGCTGCTG GTCGCGTCGA TGGCGATCTG GCGTCTGGCA TACATTCGTG TGCTGGGCGC ACCGCACGCG CGACGTCGGT TGCTGATCCT GGGAACGGGT CAGGCAGGTT CGGCGCTGTC GCATGTTATT CTGAAGGGGC ACACCCCATA CTACGAGATT GTCGGGTTTG TCGATGATGC CCCCCTTCCA TCCTGTGACT GCATCGGCAG CGTGCCGGTG CTTGGCGGCG TTGACCGCCT CGGCGATGTC GTCTGGGATC AGCGTGTCGA TGAGATTGTG ATTGCCAGCA GCGATGTCAG CGGCGAACTG CTCCAGTTCT TGATCGACTG CTATGAACAC GGAGTTGCTA TTACGCCGAT GCCGCTGCTG TACGAACGGT TGACCGGGAA GATTGCAGTG GAACACGCTG GCAATCAGTG GCATGTGGCG CTGCCGTTGC AATCACGTCC GACACGGACT GCCGAGGCGG TGTTGAAACG CATGCTCGAT CTGGTCGGCG GTCTGGTTCT GTTTGGGTTG CTGCTTGTTC TGCTGCCATT CATTGCGCTT GCGATTCGTC TTGACACGCC CGGTCCCATC CTGTACCGGC AGCAGCGTGT TGGCTGGCGA GGACGAATCT TTACAGCGCT CAAGTTTCGC TCGATGGTTC AGGATGCCGA ACCGGACGGC GAGGCGCAGT GGGCATCGAA GGACGACTCG CGTGTGACGC GCGTTGGGCG CTGGTTACGG CGCACCCGCC TTGATGAACT ACCGCAGGCG TTGAATGTTC TGCGTGGCGA GATGAGTCTG GTCGGACCAC GTCCCGAACG ACCGGAGTTT GTCGAGCAGT TGCAGCGCGT TATTCCGTTC TACCGCGCGC GACTGGCGAT CAAACCGGGG CTGACAGGGT GGGCGCAGAT CAACTATGGC TATGGCAATA GCATTGAAGC ATCGCTTGCC AAACTCCAGT ACGACCTTTA CTATCTGAAG CATCAGTCGT TCTGGTTCGA TCTGCTCATT CTGGCGCGAA CGGTCTACGT TGTGCTGTTG ATGAAAGGTC AATAG
|
Protein sequence | MSRRSVLNHP TQPCEMSFHL RLEISERRLL LRVGDLALTM LAVFGALWFW ARLADRALNV ALIQSQFSWL ALIGIGWPLW LMLADMYNLR LVARIGPSVR RILLGGLALL FAYLALFFVL SRAPVTGMLA SIEIGTPQLR LAPALAIVLL VASMAIWRLA YIRVLGAPHA RRRLLILGTG QAGSALSHVI LKGHTPYYEI VGFVDDAPLP SCDCIGSVPV LGGVDRLGDV VWDQRVDEIV IASSDVSGEL LQFLIDCYEH GVAITPMPLL YERLTGKIAV EHAGNQWHVA LPLQSRPTRT AEAVLKRMLD LVGGLVLFGL LLVLLPFIAL AIRLDTPGPI LYRQQRVGWR GRIFTALKFR SMVQDAEPDG EAQWASKDDS RVTRVGRWLR RTRLDELPQA LNVLRGEMSL VGPRPERPEF VEQLQRVIPF YRARLAIKPG LTGWAQINYG YGNSIEASLA KLQYDLYYLK HQSFWFDLLI LARTVYVVLL MKGQ
|
| |