Gene Saro_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2335 
Symbol 
ID3915680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2482011 
End bp2483528 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content69% 
IMG OID640445091 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_497606 
Protein GI87200349 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.104875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCA TCTACGAGGA AGTGCGCATC GCGCTCCACG GCATATGGCA CCGGCGCTGG 
ATCGCGCTCG GCCTGGCCTG GGGCGTGTGC CTGCTCGGTT GGCTGGCGGT GGCGATGGTG
CCGAACAGCT ATGAATCGAA GGCCCGCATC TTCGTCCAGA TCGACGACGT GCTGGCCGAT
CAGATCGGCA TCGGCGGCGA CCGCAAGCGC GACATCGAGC GCGTGCGCCA GACGCTGACC
AGCGCGGTCA ACCTGGAGAA GGTCATCCGT GCAACCCGGC TGGGCGACAA GGTGACCAGC
GACAAGCAGA TGCAGGCGGC GGTCGAGAGC CTGGGCAAGC ACGTGACCGT CGTCAGCCAG
CAGGACAACC TGTTCGAGAT CACCGGGACG GCCGGTGGCG GCGGGCGCAA CGATGCCGAG
AACGCCCGGC TGGCGCAGGA CATCGTCCAG AAGATGATCG ACATCTTCCG CGAGGAAAAC
CTGGCCGGCG GGCGCGGCGA GATGACCGAT ACGCTGGCCT TCATGGACCA GCAGCTCACC
GACCGGAAGA AGGCGCTGGA AGAGGCCGAG CTCAAGCGCA CCGAGTTCGA ATCGAAGAAC
GCCGGCTTCA TCCCCGGCGC GGGGTCGCTC ACCACCCGGC TCGAGGCGGC GCGCAACCAG
ATGCGCGACG TCGAATCGAA CCTGCTGGCG GCGCAGAGCG CGCTCGCCTC GATCAGCGGC
CAGCTTTCGG GCACGCCGCA GACCATCGCC ATTCCGGGCG TGTCGGGTGG CGCCAAGGGT
GCCCTGGCGC AGGCGCAGGC CGACCTTGGC GCGATGCGCG CGCGCGGGCT TACCGACAGC
CATCCGGACG TGATTTCCGC CAAGGCCCAG ATCGCCAACC TGCAGAAGGC GGCGGCGGGC
GAGGGTGCTG GCGGCGGTAC GCCCAATCCG GCCTACAGCT CGCTGCTTTC GATCAAGGCC
GATCGCGAGG CGAGCGTGGT CTCGCTCCAG TCGCGCAAGG CCTCGCTCCA GGCCGAGATC
GCCCAGCTCA CCGCGCAGCA GTACAGCGAA CCGACCGTGG CCGCCGAAGC CGCGCGCATC
AACCGCGACT ACGACGTGCT CAAGGAGCAG TACGACAAGC TCCTGCGCGA CCGCGAGCAG
CTGCGCCTGC GCGGCCAGGT CGAGACGGAG CGCGACGCGG TGAAGTTCGA GGTGATCGAC
CCGCCGACGA TGCCGCGCGG GCCTGCGGCC CCCGACCGGC CGCTGCTGCT GTTCCTCGTG
CTGATCGTCG GCGTGGGCGC GGGTTGCGGC GGGGCATTTG CCTTCGGCCA GCTCAGGTCC
GCCTATTCGA CCACGGGGCA GCTCGAACGC GCGACCGGGC TTCCGGTCCT CGGCGGAATT
TCCGAGGCCA TCACCCGCAC CGCAAGGGCT GAGCGCGCAC GCCGCCTCAA GTGGTTCGCG
GGTGGTGTGG CAGGCCTGTT CGTCGTCCTC TTCGTCCTCC TCGGCGTCGA GATGCTGAAG
CGCGGCATGG TGGCCTGA
 
Protein sequence
MTSIYEEVRI ALHGIWHRRW IALGLAWGVC LLGWLAVAMV PNSYESKARI FVQIDDVLAD 
QIGIGGDRKR DIERVRQTLT SAVNLEKVIR ATRLGDKVTS DKQMQAAVES LGKHVTVVSQ
QDNLFEITGT AGGGGRNDAE NARLAQDIVQ KMIDIFREEN LAGGRGEMTD TLAFMDQQLT
DRKKALEEAE LKRTEFESKN AGFIPGAGSL TTRLEAARNQ MRDVESNLLA AQSALASISG
QLSGTPQTIA IPGVSGGAKG ALAQAQADLG AMRARGLTDS HPDVISAKAQ IANLQKAAAG
EGAGGGTPNP AYSSLLSIKA DREASVVSLQ SRKASLQAEI AQLTAQQYSE PTVAAEAARI
NRDYDVLKEQ YDKLLRDREQ LRLRGQVETE RDAVKFEVID PPTMPRGPAA PDRPLLLFLV
LIVGVGAGCG GAFAFGQLRS AYSTTGQLER ATGLPVLGGI SEAITRTARA ERARRLKWFA
GGVAGLFVVL FVLLGVEMLK RGMVA