Gene Saro_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2049 
Symbol 
ID3917696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2189261 
End bp2190664 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content63% 
IMG OID640444801 
Productsugar transferase 
Protein accessionYP_497322 
Protein GI87200065 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03013] sugar transferase, PEP-CTERM system associated
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.595995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCG CCGGGCAGAT GATCCGCCTG TTCAAACACT ATATACCGCA TTCGGTCCTG 
CTGCTGGGGC TGCTGGATTT CATCCTGCTG CTCGGCGCGG GCGAGATCGG CTGGCAGCTT
CGCGCCCACC AGATCGGCAT CGATTCCGGC CAGTTCGGCA TCCGCCTGAC GCCGCTGCTG
CTGTTCGCGG GGCTGGTGCA GACCGCGATG ATCGCGGTCG GCGTCTATGG TTCGGATGCC
CTGCGTTCGA TGCGCTATGC GACCGCGCGC CTGCTGGTCG CCGTAAGCCT TGGCATCATT
GCGCTGTCGG TCGTCTATTT CATGCTGCCG GGGCGGACGT TGTGGCGTTC GAATCTGCTC
TACGCGATGT TCCTGGCCAT GGGGATGCTC GTGCTCATCC GCCTCCTTCT GGGCGGATTG
CTGGGCACGT CGGCCTTCCG CAGGCGCGTC CTGGTCCTTG GCGCGGGAGC GCGGGCCGAA
AGGCTGCGCA AGCTCGGAGA GAGGCCTGAA GCCGGTTTCG CCATCGTCGG CTACATCGGC
ATGAGCAGCG CTGCACCGAC GGTCGTCGAG GCGATACATC GCGATGCGAT CAACAACCTG
ACGCGCTACG TCGAGAACCT TGGCGTCAGC GAGGTGGTCC TCGCGCTCGA GGAGCGGCGG
AACGCCTTGC CGCTCAAGGA TCTTCTCAGG ATCAAGACCA CCGGCGTCCA CGTCAACGAC
TTCTCCTCCT TCATGGAGCG AGAGACGGGC CGCGTGGACC TCGACACGGT CAATCCGAGC
TGGCTGATCT TTTCAGACGG GTTCTCATCG GGCAGGGCGC TGTCGAGCGT GGCAAAGCGC
ATCTTCGACA TTGGCGCGAG CCTGCTGCTG CTTGTCGCCA CGTTCCCGGT CATCCTCCTG
TTCGCGATGC TGGTGAAGCT CGACAGCAAG GGCCCTGCGT TCTTCCGCCA GACGCGCGTT
GGCCTCTACG GTCAGCCGTT CGACCTCATC AAGCTGCGTT CGATGCGCAT GGATGCGGAA
GCCAACGGGG CGCAGTTCGC GCAGAAGGAC GATCCTCGCG TGACCCGCAT CGGCCGGATC
ATCCGCAAGC TGCGGATCGA TGAACTGCCG CAGGCCTGGA CGGTGCTGAA AGGCGAGATG
AGCTTCGTCG GGCCGCGCCC GGAACGTCCC GAGTTCGTGG CCGACCTGGA AGACAAGCTG
CCTTATTATG CCGAGCGCCA CATGGTGAAG CCAGGCATCA CTGGCTGGGC GCAGATCAAC
TACCCCTATG GCGCGTCCAT CGAGGATTCA CGGCACAAGC TCGAATACGA CCTCTACTAC
GCCAAGAACT ACACCCCCTT TCTCGATCTC CTGATCCTGC TCCAGACCTT GCGCGTCGTG
CTGTGGCACG AAGGCGCGCG GTGA
 
Protein sequence
MASAGQMIRL FKHYIPHSVL LLGLLDFILL LGAGEIGWQL RAHQIGIDSG QFGIRLTPLL 
LFAGLVQTAM IAVGVYGSDA LRSMRYATAR LLVAVSLGII ALSVVYFMLP GRTLWRSNLL
YAMFLAMGML VLIRLLLGGL LGTSAFRRRV LVLGAGARAE RLRKLGERPE AGFAIVGYIG
MSSAAPTVVE AIHRDAINNL TRYVENLGVS EVVLALEERR NALPLKDLLR IKTTGVHVND
FSSFMERETG RVDLDTVNPS WLIFSDGFSS GRALSSVAKR IFDIGASLLL LVATFPVILL
FAMLVKLDSK GPAFFRQTRV GLYGQPFDLI KLRSMRMDAE ANGAQFAQKD DPRVTRIGRI
IRKLRIDELP QAWTVLKGEM SFVGPRPERP EFVADLEDKL PYYAERHMVK PGITGWAQIN
YPYGASIEDS RHKLEYDLYY AKNYTPFLDL LILLQTLRVV LWHEGAR