Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3682 |
Symbol | |
ID | 5210661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4610415 |
End bp | 4611407 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640597276 |
Product | glycosyl transferase family protein |
Protein accession | YP_001277987 |
Protein GI | 148657782 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.735798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.011793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCCG TCATCATTGT GACCTGGAAC GGCAGACGAT TCCTGGATGC CTGCCTGCGC GCGGTCACGG CGCAACTTCA CCCCGACGAT GAAATTATTG TCGTCGATAA CGGTTCCATC GATGGCACCG CCGTGTGGCT ACGGCATGCC TGGTCCGCTG TGCGTCTCGT GGCTCTGCCA GCAAATCTGG GGTTCGCTGG CGGCGTCAAC GCCGGCTTGC GCGCCGCGCG TGGTGATCTG CTTCTGCTGC TGAATAATGA CGCATTTGTC GAACCCGGAT GCGTGCCAGC GCTTGTCGAG GCGCTGAGGG ATCACCCGTG CTCTGGCGCG GTTGCTGGCG TGCTGACCTT CGATCATCGC CCTGATCTGG TGGCTTCTGC TGGTATCACC GCCTGTCGCA ACGGTCTGGC GCTCGATCTG TGGACGGGGC GTGCCGTGCG ATCGTTACCC GCTGCACCGC AACCGGTGAT GGGGGCAAGC GGCGGGCTGG CGCTCTACCG GCGGACGATG CTCGACGACA TCGGCTTGAT GGCGCCGGAT TTCTTCAACT ATCTGGAAGA TGTCGATCTG GCGTGGCGCG CACAACTGCG CGGATGGGAA TGCCTGGTCG TTCCTGCGGC GCGGGCGCGG CACATCTACT CCGCCACCGG CGGTCAGGGA TCACCGCTCA AGCAGCGATT GCTGGGACGC AACCGGTTGC GCGCGATCAT TCGTTGTTTT CCTTCGGGTG TTCTGCGTTC CTGCCTGCCG GATATTCTGG CATACGATAC CCTGGCGCTG GCATACGCTG CACTTGCCCG TCAGCCAGCA ATGATCGCCG GGCGCATCGA GGCGCTGCAC GACCGGGCGC AACTCTTGCG CGAACGCCGC GCGATCCAGG CGCGGCGCAC CGTATCGGAA GAAGCGTTTG CCGCCTGGCT CGAACCATCA CCAACGCCAT GGCGAACCCT GCGAACAGCT CGCCGTCTCG ACGCCCTGCT CCGCGACCGT TGA
|
Protein sequence | MISVIIVTWN GRRFLDACLR AVTAQLHPDD EIIVVDNGSI DGTAVWLRHA WSAVRLVALP ANLGFAGGVN AGLRAARGDL LLLLNNDAFV EPGCVPALVE ALRDHPCSGA VAGVLTFDHR PDLVASAGIT ACRNGLALDL WTGRAVRSLP AAPQPVMGAS GGLALYRRTM LDDIGLMAPD FFNYLEDVDL AWRAQLRGWE CLVVPAARAR HIYSATGGQG SPLKQRLLGR NRLRAIIRCF PSGVLRSCLP DILAYDTLAL AYAALARQPA MIAGRIEALH DRAQLLRERR AIQARRTVSE EAFAAWLEPS PTPWRTLRTA RRLDALLRDR
|
| |