Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0101 |
Symbol | |
ID | 3915987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 104030 |
End bp | 105130 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640442826 |
Product | myo-inositol-1-phosphate synthase |
Protein accession | YP_495384 |
Protein GI | 87198127 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1260] Myo-inositol-1-phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCAA TCAAGGTCGC CGTGATCGGC GTGGGCAACT GCGCTAGCTC GCTGGTGCAG GGGGTTGCCT ACTATCGCAA CAACAACTCG TCGCAGGGCC TCATCCATGA CCGGATCGGC GGCTATGGCG CGGGTGACGT GGACTTCGTG CTCGGCGTCG ACGTCGATGC GCGCAAGGTC GGCAAGGACA TCGCCGAGGC GATCTTCGCC GCGCCGAACA ACACCACCGT GTTCCAGCCC AACGTTCCGC CGACCGGCGC GAAGGTCATC ATGGGCCGCG TCTCGGACGG CGTCGCCCCG CACATGACCA CCGTTGGCGA CAAGGGCTTC ATCGTTTCCG ATCAGCCCGA GGCCACCCAG GCCGACATCG TCAAGGCGCT GAAGGATTCG GGCGCGGAAG TTCTCCTCAA CTTCCTCCCC GTCGGTTCGC AGAACGCCAC CGAATTCTAC ATGGAATGCG CGCTTGAAGC CGGCGTTGCG GTCGTCAACT GCATGCCCGT GTTCATCGCA TCGACCCCGG AATGGGAAGC GAAGTTCCGC GAGAAGCGCA TCCCGATCGT CGGCGACGAC ATCAAGGCGC AGGTCGGCGC CACGATCGTC CACCGCGTCC TGTCGAGCCT GTTCGCCGCC CGCGGCGTGA ACGTCGAGCG CACCTACCAG CTCAACACCG GCGGCAACAC CGACTTCATG AACATGCTCG ACCGCCAGCG TCTGGGCAGC AAGAAGGAAT CGAAGACCGA GGCAGTGCAG GCCATGCTTG CCCAGCGCCT CGACGACGAG AACATCCACG TCGGCCCGTC GGACTATGTT CCTTGGCAGA AGGACAACAA GCTGTGCTTC CTCCGTCTGG AAGGCGCGCA GTGGGGCAAC GTGCCCATGA ATCTCGAGCT TCGTCTCTCG GTCGAGGACA GCCCGAACTC CGCAGCTTGC GTCATGGACG CGATCCGTTG CTGCAAGGTT GCGCTGGACC GCGGTGAAGG CGGTGCGCTG ATCGGCCCGT CGGCCTACTT CTGCAAGCAC CCGCCGCAGC AGTTCAACGA CGACGTCGCC GCGCAGATGG TCGAGGAATA TGCCTCGGTC GAAAAGCTGG CCGCCGAATA A
|
Protein sequence | MKPIKVAVIG VGNCASSLVQ GVAYYRNNNS SQGLIHDRIG GYGAGDVDFV LGVDVDARKV GKDIAEAIFA APNNTTVFQP NVPPTGAKVI MGRVSDGVAP HMTTVGDKGF IVSDQPEATQ ADIVKALKDS GAEVLLNFLP VGSQNATEFY MECALEAGVA VVNCMPVFIA STPEWEAKFR EKRIPIVGDD IKAQVGATIV HRVLSSLFAA RGVNVERTYQ LNTGGNTDFM NMLDRQRLGS KKESKTEAVQ AMLAQRLDDE NIHVGPSDYV PWQKDNKLCF LRLEGAQWGN VPMNLELRLS VEDSPNSAAC VMDAIRCCKV ALDRGEGGAL IGPSAYFCKH PPQQFNDDVA AQMVEEYASV EKLAAE
|
| |