Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3225 |
Symbol | |
ID | 3917483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3444277 |
End bp | 3445527 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640446009 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_498494 |
Protein GI | 87201237 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.503811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGCG CCGGGTGCCG GCCGAAGTGC CCCAGGGCAC ATCCGGAAGG CCAAGTGACC ACGCTCTTCC ACCCCAGGCC GGACATCCTG CACATCAGCG GCGACTTCCC GGACCCGATC AATCCTTCCA AGACCCCGGT CATCCGCACG CTGCTGGAAA TGACCGACGA TACGTTCGCT CATCGGGTCA TTTCGCTCAA TCGCAAGTCG CCCGGTCCCG TAGCCCTGTC CGCCACGTTG CTGGGACGCT CTCCGCTTTC GCGCATTGTC CCTTTCGAAC GGGGCATGGC AGCCGAGTAC CTGGCCCCGC CGATGGGGCT GCTTCACAAG ACCCTCTTGC ACCGGGTAGC CGACGCGCTG GTCGAACGGC TCGCGGAAGG TCCACGGCCC GCCCTGATCG TGGGCCACAA GCTTTCCATC GAGGGGATCG TGGCGCGCCG GATGGCACGC AGGCTCGACA TTCCGTTCGC GATCTCGATC CAGGGCAACA CCGATACGCG TATCGTGGAT GCCCGCCCGG ACTTGCGCGG CGAACTGCGA CGCGTTTTCC ACGAGGCGTC CGTCGCATTT CCCTTCGCGC CGTGGGCGTT GCGGCGGATC GAGGCACGGC TGGGCCGGCG CCGGGGCCTC ACAACGATGC TTCCCTGCCC TACCGACATC GACGAGCCTG TGGCCCCGGC GATGGACGGC AATCAGCTGG TCTCGGTGTT CCACCTGCGC AACCATGGCC TCAAGAACCT GCGTGGCCTT GCCTTGGCGA TGCGCGAGCT TGCCTCGACC GACCCGGATA TCCGGCTTTC GATCATCGGC GGCGGGAGCG AGGAAGACTT CGGGACATGC CGCGCGATCC TCGGCGACCT GCCAAACGTG GAACTGGCCG GTGCGATGGA CCGGCGCCAA CTGCGCGAAG CGCTCGGGGG GGCGGCAGGC TTCGTCCTGC CCTCGCTGCG CGAAAGCTTC GGGCTCGTTT TCATTGAAGC GCTGTTCTGC GGTCTGCCTG TGGTCTATCC GACCGGCAGG GCAGTGGACG GCTATTTCGA CGGTGAGCCC TTCGCCATCG GCGTCGACCC CCGGCAACCG GGGCGGATCG CGGAGGGTAT GCGCACGCTG GTGCGCGAGC AGGCGCCGCT CAAACGTGCG CTCGCACGCT GGCAACAGGA CGGTCGCGCT CGGCAATTCA CCCGACCGGA AATCGCCGCA TCCTTCGCCG CAGGGCTTAC CGCCGCCGCA GAAGCCGGAA CGCGCCCGTA G
|
Protein sequence | MDGAGCRPKC PRAHPEGQVT TLFHPRPDIL HISGDFPDPI NPSKTPVIRT LLEMTDDTFA HRVISLNRKS PGPVALSATL LGRSPLSRIV PFERGMAAEY LAPPMGLLHK TLLHRVADAL VERLAEGPRP ALIVGHKLSI EGIVARRMAR RLDIPFAISI QGNTDTRIVD ARPDLRGELR RVFHEASVAF PFAPWALRRI EARLGRRRGL TTMLPCPTDI DEPVAPAMDG NQLVSVFHLR NHGLKNLRGL ALAMRELAST DPDIRLSIIG GGSEEDFGTC RAILGDLPNV ELAGAMDRRQ LREALGGAAG FVLPSLRESF GLVFIEALFC GLPVVYPTGR AVDGYFDGEP FAIGVDPRQP GRIAEGMRTL VREQAPLKRA LARWQQDGRA RQFTRPEIAA SFAAGLTAAA EAGTRP
|
| |