Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1415 |
Symbol | |
ID | 5208367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1723578 |
End bp | 1724516 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640595026 |
Product | glycosyl transferase family protein |
Protein accession | YP_001275765 |
Protein GI | 148655560 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00435994 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGATA TCATCATTCC AAACTACAAC GGCGCCGCGT TATTGCCGAC ATGTCTCGAC TCGCTGCGGG CGCAGACGCG GCGCGATTTC TGCGTGGTTG TGGTTGACGA TGGCAGCACG GACGACTCGG TGGCGCTGGT GCGGCGGCGT TACCCCGAAG TGCAGGTGAT TGCCCTGCCG CGCAACCGTG GGCTGGCAGC AGCAGTCAAT GCGGGGATCG AGGCGACCGG CGGCGAGTAT GTGGTGCTGC TGAACAATGA TACCGAAGCG CATCCGCGCT GGCTAGAACA CCTGATTGGC GCACTGGACC GATATCCTGC ATATGCGTTC GCTGCCAGTA AGTTGATGCT CTTCGACCGC CGTGATCATT TTCATTCGGC AGGCGATTAC TATCGTCTGG ACGGTGTTCC GGGGAGTCGC GGGGTCTGGC AGCGTGATAT TGGTCAGTTC GATGTGATGG AAGAGGTATT CGGACCATGT GCGGGCGCTG CGGCATACCG CCGGGCAGCG CTGGAAGAAC TGGCTGAGCA GGGTCGGGTG TTTGATGAAG ACCTGGTCAT GTACTGTGAA GATGTCGATC TGAATGTGCG CGCCAGATTG CGTGGCATGC GCACCGTCTA TGTGCCACGT GCAGTGGTGT ATCATCGATT GAGTGCAACG GGTGGCGGTG CGCTGGCAAG TTACTACTGC GGGCGCAATT TTATGCTTGT ATGGGCAAAG AACATGCCAA CGACGCAAGC GCGGCGCTAC TGGCCCCTTC TGCTCTGGTC ACAGATCGGT TTTGCGTTTC ATTCGATCTG GCATATTCGC GAACCCGCAG CGCGCGCCCG TTTGCGTGGT CAGATAGACG GATTACGGGC ATTGCCGCAG TTTTTGCGCA AACGGCGTCG CAGCCGGGAA AGGTATGCCG ATCGCCTTGT CGTCGAAACA CGTTATTAA
|
Protein sequence | MIDIIIPNYN GAALLPTCLD SLRAQTRRDF CVVVVDDGST DDSVALVRRR YPEVQVIALP RNRGLAAAVN AGIEATGGEY VVLLNNDTEA HPRWLEHLIG ALDRYPAYAF AASKLMLFDR RDHFHSAGDY YRLDGVPGSR GVWQRDIGQF DVMEEVFGPC AGAAAYRRAA LEELAEQGRV FDEDLVMYCE DVDLNVRARL RGMRTVYVPR AVVYHRLSAT GGGALASYYC GRNFMLVWAK NMPTTQARRY WPLLLWSQIG FAFHSIWHIR EPAARARLRG QIDGLRALPQ FLRKRRRSRE RYADRLVVET RY
|
| |