Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2049 |
Symbol | |
ID | 3917696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2189261 |
End bp | 2190664 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444801 |
Product | sugar transferase |
Protein accession | YP_497322 |
Protein GI | 87200065 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03013] sugar transferase, PEP-CTERM system associated [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.595995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCG CCGGGCAGAT GATCCGCCTG TTCAAACACT ATATACCGCA TTCGGTCCTG CTGCTGGGGC TGCTGGATTT CATCCTGCTG CTCGGCGCGG GCGAGATCGG CTGGCAGCTT CGCGCCCACC AGATCGGCAT CGATTCCGGC CAGTTCGGCA TCCGCCTGAC GCCGCTGCTG CTGTTCGCGG GGCTGGTGCA GACCGCGATG ATCGCGGTCG GCGTCTATGG TTCGGATGCC CTGCGTTCGA TGCGCTATGC GACCGCGCGC CTGCTGGTCG CCGTAAGCCT TGGCATCATT GCGCTGTCGG TCGTCTATTT CATGCTGCCG GGGCGGACGT TGTGGCGTTC GAATCTGCTC TACGCGATGT TCCTGGCCAT GGGGATGCTC GTGCTCATCC GCCTCCTTCT GGGCGGATTG CTGGGCACGT CGGCCTTCCG CAGGCGCGTC CTGGTCCTTG GCGCGGGAGC GCGGGCCGAA AGGCTGCGCA AGCTCGGAGA GAGGCCTGAA GCCGGTTTCG CCATCGTCGG CTACATCGGC ATGAGCAGCG CTGCACCGAC GGTCGTCGAG GCGATACATC GCGATGCGAT CAACAACCTG ACGCGCTACG TCGAGAACCT TGGCGTCAGC GAGGTGGTCC TCGCGCTCGA GGAGCGGCGG AACGCCTTGC CGCTCAAGGA TCTTCTCAGG ATCAAGACCA CCGGCGTCCA CGTCAACGAC TTCTCCTCCT TCATGGAGCG AGAGACGGGC CGCGTGGACC TCGACACGGT CAATCCGAGC TGGCTGATCT TTTCAGACGG GTTCTCATCG GGCAGGGCGC TGTCGAGCGT GGCAAAGCGC ATCTTCGACA TTGGCGCGAG CCTGCTGCTG CTTGTCGCCA CGTTCCCGGT CATCCTCCTG TTCGCGATGC TGGTGAAGCT CGACAGCAAG GGCCCTGCGT TCTTCCGCCA GACGCGCGTT GGCCTCTACG GTCAGCCGTT CGACCTCATC AAGCTGCGTT CGATGCGCAT GGATGCGGAA GCCAACGGGG CGCAGTTCGC GCAGAAGGAC GATCCTCGCG TGACCCGCAT CGGCCGGATC ATCCGCAAGC TGCGGATCGA TGAACTGCCG CAGGCCTGGA CGGTGCTGAA AGGCGAGATG AGCTTCGTCG GGCCGCGCCC GGAACGTCCC GAGTTCGTGG CCGACCTGGA AGACAAGCTG CCTTATTATG CCGAGCGCCA CATGGTGAAG CCAGGCATCA CTGGCTGGGC GCAGATCAAC TACCCCTATG GCGCGTCCAT CGAGGATTCA CGGCACAAGC TCGAATACGA CCTCTACTAC GCCAAGAACT ACACCCCCTT TCTCGATCTC CTGATCCTGC TCCAGACCTT GCGCGTCGTG CTGTGGCACG AAGGCGCGCG GTGA
|
Protein sequence | MASAGQMIRL FKHYIPHSVL LLGLLDFILL LGAGEIGWQL RAHQIGIDSG QFGIRLTPLL LFAGLVQTAM IAVGVYGSDA LRSMRYATAR LLVAVSLGII ALSVVYFMLP GRTLWRSNLL YAMFLAMGML VLIRLLLGGL LGTSAFRRRV LVLGAGARAE RLRKLGERPE AGFAIVGYIG MSSAAPTVVE AIHRDAINNL TRYVENLGVS EVVLALEERR NALPLKDLLR IKTTGVHVND FSSFMERETG RVDLDTVNPS WLIFSDGFSS GRALSSVAKR IFDIGASLLL LVATFPVILL FAMLVKLDSK GPAFFRQTRV GLYGQPFDLI KLRSMRMDAE ANGAQFAQKD DPRVTRIGRI IRKLRIDELP QAWTVLKGEM SFVGPRPERP EFVADLEDKL PYYAERHMVK PGITGWAQIN YPYGASIEDS RHKLEYDLYY AKNYTPFLDL LILLQTLRVV LWHEGAR
|
| |