Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3226 |
Symbol | |
ID | 3917484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3445570 |
End bp | 3446523 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640446010 |
Product | glycosyl transferase family protein |
Protein accession | YP_498495 |
Protein GI | 87201238 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACG TAACGTACAG TGCCGTATCA CCGGAAGTCA CGGTCATCGT GGTCAGCTAC AACACGCGCG ACCTTACGCT GCGTGCGCTG GAAACGCTGT TTGCCAATGC CGGTGACGTC TCGATGCGGG TCGTGGTGTG GGACAATGCC TCGCACGATG GGTCTGCCGA TGCCATCGCC CAGGCGTTCC CCGACATCGA GCTTGTCCGC AGCGACGAGA ACCTCGGTTT CGCGGTTGCC AACAACCGGG TCGCAGAGGC GGCAGATACC GAATGGCTGC TCTTGCTGAA CCCGGACACC GAAACCTATC CCCGCGCGGT GGAGAACATT CTCCGCTTCG GGCGCGAGCA TCCCGAAGCC GGCATCGTCG GCGGCCGGAC GGTCTTTCCG GACGGATCGC TCAACGCCGC CTCCTGCTGG AACACCATGT CGGTGTGGAG CCTGTTCTGC TCGGCCGTGG GCCTGTCGCG CCTGTTTCCG GACACGATTG CCTTCAATCC CGAAGGCATC GGTGGATGGA AGCGTGATTC GGTGCGCCAC GTCGATGTCG TTGTCGGCTG CTTTCTCCTG ATCCGGACGG ACCTGTGGCA CAAGCTCGGC GGGTTCAACC AGCGCTACTT CATGTATGGC GAGGAACACG ACCTGTGCCT GCGCGCGGCG CAGCTCGGTT ACCGTCCGAT GATCACGCCC GATGCCCAGA TCATGCACCT GGTCGGCGCC TCCACTTCGA AGCGCGAAGA GAAGATCGTC CAGCTCATGC GCGCCAAGGC CACGCTGGTG CGCGACCACT GGCGTGGATG GCGCGTTCCG CTGGGGCTCG GCATGCTCTG GCTGTGGATC GCCACGCGCC GGGCCGGCTC CGCGATTGCG GCGACCGTCA GGGGGGAGGC CGCCCGCGCC AGCGTGTGGC GCCGGGTCTG GCGGGATCGC CGGGAATGGC TTTCGGGATA CTGA
|
Protein sequence | MTDVTYSAVS PEVTVIVVSY NTRDLTLRAL ETLFANAGDV SMRVVVWDNA SHDGSADAIA QAFPDIELVR SDENLGFAVA NNRVAEAADT EWLLLLNPDT ETYPRAVENI LRFGREHPEA GIVGGRTVFP DGSLNAASCW NTMSVWSLFC SAVGLSRLFP DTIAFNPEGI GGWKRDSVRH VDVVVGCFLL IRTDLWHKLG GFNQRYFMYG EEHDLCLRAA QLGYRPMITP DAQIMHLVGA STSKREEKIV QLMRAKATLV RDHWRGWRVP LGLGMLWLWI ATRRAGSAIA ATVRGEAARA SVWRRVWRDR REWLSGY
|
| |