Gene Saro_3218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3218 
Symbol 
ID3917476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3435376 
End bp3436779 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content64% 
IMG OID640446002 
Productsugar transferase 
Protein accessionYP_498487 
Protein GI87201230 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGC ACATGCCCAT CAGCGAGACG CAGGACGCGC CTCCGCGCCG CCGCATCACG 
CTACCGCTCG CCCCGCCTCT CGAACAGCGG CGCCTGCAGC TCTACATTGC ACTGCTGCTG
CTTGACGGCG CGGCGATCCT CAACGGCTTC TGCATCGCAA GCTGGCTCTA TCTGGGTCGC
TTCCTCGATG AGACTTCGCT GCTGCACAGC CAGGTCATGC TGCCGATCTA CTGGTCGATC
GCATTGTCGC TGCAAGTCTA CACCCTGACT GCACTGCGGC GTCCGAATTT CGCCCGCGCC
CGCGCTGGCC TCTCGCTCAT CGGCGCCGAA ACCGTGCTGC TCTTCGTCGG CTTTGCAACC
AAGAGCACCG ACAATTTTTC GCGCGTGTCA TCCTTGCTGG GTCTGGGCCT GAGCCTCGTC
TTGCTAATGT GGGTCCGCGC CCTTGTCCGC CCGCTGATCA AGGCGCGTTG CGGCGATGCG
GTTACGAATA CCCTGCTGAT CGATGACGGC GGGACGCCGC TGCGGATTCC CCACGCCTAT
CACATTGACG CGCGGGAACA TCACCTGGCC CCCGATCTGT CGGACCCGCA CATGATGGAC
CGGCTCGGGC TTTACATGAT GAACATGGAC CGGGTCATGG TAAGTTGCCC CCACGATCGC
CGGGCCGCGT GGGCACTCGT ATTCAAGAGC GCGAATGTCT CGGGCGAGAT CGTGGACCCG
GAAGTGAACA TGCTTGGCGT ACTGGGTGCA AGGCGCGAAC GCGGCTACGG CGCGCTGATC
GTGGCAAGCG GCCCGCTGGG CCTGCGCGCC CGCGCGGTCA AGCGCTTGCT CGACCTTGCG
CTCGCAGGTG GCGCGGTTCT GGCGCTCGGG CCAGTGCTGC TCCTGGTGGC GGTGCTGATC
AAGCTGGAGG ACGGAGGCCC CGTGCTGTTC ATCCAGAAGC GCACGGGGCG GGGTAACCGC
TTCTTCCCGA TCTTCAAGTT CCGGTCGATG CGCGTGGAAC GCCTCGATTC AACGGGCTCG
CGCTCGGCAA GCAAGGACGA TGACCGTATC ACGCGGATCG GACGCTTCAT ACGAAGCACG
AGCATCGACG AGTTGCCGCA GCTGTTCAAC GTGCTGCGCG GAGAAATGTC CATCGTCGGC
CCACGCCCGC ACGCCATCGG TTCGCTTGCC GGCGAGAAAC TATTCTGGGA AGTGGACCAC
CGCTACTGGC TGCGTCATTC GCTGAAACCC GGCCTTACCG GCCTGGCCCA GGTGCGCGGC
CTTCGCGGTG CGACCGACAC CGAGACGGAC CTTGCCAACC GTCTGCAGGC CGATCTCGAA
TACCTCGACG GGTGGACGAT CTGGCGCGAC CTCAAGATCA TCGTCAACAC TGCGCGCGTG
CTCGTGCACG ACCGAGCCTT CTGA
 
Protein sequence
MTRHMPISET QDAPPRRRIT LPLAPPLEQR RLQLYIALLL LDGAAILNGF CIASWLYLGR 
FLDETSLLHS QVMLPIYWSI ALSLQVYTLT ALRRPNFARA RAGLSLIGAE TVLLFVGFAT
KSTDNFSRVS SLLGLGLSLV LLMWVRALVR PLIKARCGDA VTNTLLIDDG GTPLRIPHAY
HIDAREHHLA PDLSDPHMMD RLGLYMMNMD RVMVSCPHDR RAAWALVFKS ANVSGEIVDP
EVNMLGVLGA RRERGYGALI VASGPLGLRA RAVKRLLDLA LAGGAVLALG PVLLLVAVLI
KLEDGGPVLF IQKRTGRGNR FFPIFKFRSM RVERLDSTGS RSASKDDDRI TRIGRFIRST
SIDELPQLFN VLRGEMSIVG PRPHAIGSLA GEKLFWEVDH RYWLRHSLKP GLTGLAQVRG
LRGATDTETD LANRLQADLE YLDGWTIWRD LKIIVNTARV LVHDRAF