Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3069 |
Symbol | |
ID | 3916683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3288488 |
End bp | 3289687 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640445851 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_498338 |
Protein GI | 87201081 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.687268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAACA CACCGACCCC GCCCAAGGTC CTGCTGCTGT TGACCTCGCT CCACGGCGGC GGCGCCGAAC GTGTCGCCGT GCATCTGCTG AACCGCCTTC AAGGGCGCTT CGACATGCGC ATGGGCCTGC TCCGCGCCTC GGGCCCCTAC CTCGACCAGG CCGACCGGTC GCGGCTGATA GTGGCGCCGG AAGGCGAGAC GCACTTCAAC TTCGACGGTC CCAATTCCGC CAATTACCGC CCCGGAAAGC TGGTCGGCAG CGCAGTGCGG GCACCGCTCG CATTCCGCAG GATGATCCGC GAAACGCAGC CTGACGTCGT GCTGAGCTTC CTCAAGGGCA CCAACCTGCT GGTCTGGCTG GCGCTGATGA ACATGGGCCG CGCCCGACCG CGCTGGATCG CGCGCGAAGG CAACAACGTG CTGGCCGTCA TCCGCGAGGA AGCGCCCAAC GGCGCCGTGG CGCGGGCATC GCGTGACCTT ACGGCCAAGG CCTATCGGCG GGCCGATGCC GTTCTCGCAA ATTCCACCGA CATGGCCGCG GGGCTGATCA CCGATCTCGA TCTCGATCCC GCGAAGATGC GGATGATCAA CAATCCCATC GACATCGACG GCATACGCGA GGCAGCGGGC GAGAGCCTTC CGGGCGCGCC CAACCGGCCC TTCATCCTGA CCGCGGGCCG GCTCGAATAC CAGAAGGCGC ACGAGGTGCT GCTGCGCGCC TTCGCGCGGA GCGAAGCGTG GCGCACGCAC GCGCTGGTGA TCCTCGGCAA GGGGAGCCGG CTGGGCGAAC TGCACCGCCT CGCCGCACAG CTCGGCATCG GCGAGTACGT GCGCTTCATC GGCTTCGTCC CCAACCCCTA TGCCTGGATG GCGCGCGCCG ATCTGTTCGT GCTGCCTTCG CGGTGGGAAG GATTTCCGAC CGTGGCGGCC GAGGCGATGG CCTGCGGCAC GCCCCTGCTG CTGACCGACT GCAGATTCGG CGCGCGCGAT ATCGTGGAGC CCGGAGTGAC CGGGGAACTG GTGCCAGTGA ACGACGAGGC AGCGCTGGCC ACCGAAATCG CGGCACTGCT GGCTTCGCCG GAGCGGCGCA GTGCGCTGGC ACGGGCCGGA CGCGAGAAGG TGGAACGGTT CAGGCTTGAA CGAATGCTGG AAGCCTACGC TGCCCTCTTC GACGAACAGT TCGCCGCGCG TCGTCGTTAA
|
Protein sequence | MNNTPTPPKV LLLLTSLHGG GAERVAVHLL NRLQGRFDMR MGLLRASGPY LDQADRSRLI VAPEGETHFN FDGPNSANYR PGKLVGSAVR APLAFRRMIR ETQPDVVLSF LKGTNLLVWL ALMNMGRARP RWIAREGNNV LAVIREEAPN GAVARASRDL TAKAYRRADA VLANSTDMAA GLITDLDLDP AKMRMINNPI DIDGIREAAG ESLPGAPNRP FILTAGRLEY QKAHEVLLRA FARSEAWRTH ALVILGKGSR LGELHRLAAQ LGIGEYVRFI GFVPNPYAWM ARADLFVLPS RWEGFPTVAA EAMACGTPLL LTDCRFGARD IVEPGVTGEL VPVNDEAALA TEIAALLASP ERRSALARAG REKVERFRLE RMLEAYAALF DEQFAARRR
|
| |