Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2335 |
Symbol | |
ID | 3915680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2482011 |
End bp | 2483528 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445091 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_497606 |
Protein GI | 87200349 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.104875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGCA TCTACGAGGA AGTGCGCATC GCGCTCCACG GCATATGGCA CCGGCGCTGG ATCGCGCTCG GCCTGGCCTG GGGCGTGTGC CTGCTCGGTT GGCTGGCGGT GGCGATGGTG CCGAACAGCT ATGAATCGAA GGCCCGCATC TTCGTCCAGA TCGACGACGT GCTGGCCGAT CAGATCGGCA TCGGCGGCGA CCGCAAGCGC GACATCGAGC GCGTGCGCCA GACGCTGACC AGCGCGGTCA ACCTGGAGAA GGTCATCCGT GCAACCCGGC TGGGCGACAA GGTGACCAGC GACAAGCAGA TGCAGGCGGC GGTCGAGAGC CTGGGCAAGC ACGTGACCGT CGTCAGCCAG CAGGACAACC TGTTCGAGAT CACCGGGACG GCCGGTGGCG GCGGGCGCAA CGATGCCGAG AACGCCCGGC TGGCGCAGGA CATCGTCCAG AAGATGATCG ACATCTTCCG CGAGGAAAAC CTGGCCGGCG GGCGCGGCGA GATGACCGAT ACGCTGGCCT TCATGGACCA GCAGCTCACC GACCGGAAGA AGGCGCTGGA AGAGGCCGAG CTCAAGCGCA CCGAGTTCGA ATCGAAGAAC GCCGGCTTCA TCCCCGGCGC GGGGTCGCTC ACCACCCGGC TCGAGGCGGC GCGCAACCAG ATGCGCGACG TCGAATCGAA CCTGCTGGCG GCGCAGAGCG CGCTCGCCTC GATCAGCGGC CAGCTTTCGG GCACGCCGCA GACCATCGCC ATTCCGGGCG TGTCGGGTGG CGCCAAGGGT GCCCTGGCGC AGGCGCAGGC CGACCTTGGC GCGATGCGCG CGCGCGGGCT TACCGACAGC CATCCGGACG TGATTTCCGC CAAGGCCCAG ATCGCCAACC TGCAGAAGGC GGCGGCGGGC GAGGGTGCTG GCGGCGGTAC GCCCAATCCG GCCTACAGCT CGCTGCTTTC GATCAAGGCC GATCGCGAGG CGAGCGTGGT CTCGCTCCAG TCGCGCAAGG CCTCGCTCCA GGCCGAGATC GCCCAGCTCA CCGCGCAGCA GTACAGCGAA CCGACCGTGG CCGCCGAAGC CGCGCGCATC AACCGCGACT ACGACGTGCT CAAGGAGCAG TACGACAAGC TCCTGCGCGA CCGCGAGCAG CTGCGCCTGC GCGGCCAGGT CGAGACGGAG CGCGACGCGG TGAAGTTCGA GGTGATCGAC CCGCCGACGA TGCCGCGCGG GCCTGCGGCC CCCGACCGGC CGCTGCTGCT GTTCCTCGTG CTGATCGTCG GCGTGGGCGC GGGTTGCGGC GGGGCATTTG CCTTCGGCCA GCTCAGGTCC GCCTATTCGA CCACGGGGCA GCTCGAACGC GCGACCGGGC TTCCGGTCCT CGGCGGAATT TCCGAGGCCA TCACCCGCAC CGCAAGGGCT GAGCGCGCAC GCCGCCTCAA GTGGTTCGCG GGTGGTGTGG CAGGCCTGTT CGTCGTCCTC TTCGTCCTCC TCGGCGTCGA GATGCTGAAG CGCGGCATGG TGGCCTGA
|
Protein sequence | MTSIYEEVRI ALHGIWHRRW IALGLAWGVC LLGWLAVAMV PNSYESKARI FVQIDDVLAD QIGIGGDRKR DIERVRQTLT SAVNLEKVIR ATRLGDKVTS DKQMQAAVES LGKHVTVVSQ QDNLFEITGT AGGGGRNDAE NARLAQDIVQ KMIDIFREEN LAGGRGEMTD TLAFMDQQLT DRKKALEEAE LKRTEFESKN AGFIPGAGSL TTRLEAARNQ MRDVESNLLA AQSALASISG QLSGTPQTIA IPGVSGGAKG ALAQAQADLG AMRARGLTDS HPDVISAKAQ IANLQKAAAG EGAGGGTPNP AYSSLLSIKA DREASVVSLQ SRKASLQAEI AQLTAQQYSE PTVAAEAARI NRDYDVLKEQ YDKLLRDREQ LRLRGQVETE RDAVKFEVID PPTMPRGPAA PDRPLLLFLV LIVGVGAGCG GAFAFGQLRS AYSTTGQLER ATGLPVLGGI SEAITRTARA ERARRLKWFA GGVAGLFVVL FVLLGVEMLK RGMVA
|
| |