Gene Saro_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0101 
Symbol 
ID3915987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp104030 
End bp105130 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID640442826 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_495384 
Protein GI87198127 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCAA TCAAGGTCGC CGTGATCGGC GTGGGCAACT GCGCTAGCTC GCTGGTGCAG 
GGGGTTGCCT ACTATCGCAA CAACAACTCG TCGCAGGGCC TCATCCATGA CCGGATCGGC
GGCTATGGCG CGGGTGACGT GGACTTCGTG CTCGGCGTCG ACGTCGATGC GCGCAAGGTC
GGCAAGGACA TCGCCGAGGC GATCTTCGCC GCGCCGAACA ACACCACCGT GTTCCAGCCC
AACGTTCCGC CGACCGGCGC GAAGGTCATC ATGGGCCGCG TCTCGGACGG CGTCGCCCCG
CACATGACCA CCGTTGGCGA CAAGGGCTTC ATCGTTTCCG ATCAGCCCGA GGCCACCCAG
GCCGACATCG TCAAGGCGCT GAAGGATTCG GGCGCGGAAG TTCTCCTCAA CTTCCTCCCC
GTCGGTTCGC AGAACGCCAC CGAATTCTAC ATGGAATGCG CGCTTGAAGC CGGCGTTGCG
GTCGTCAACT GCATGCCCGT GTTCATCGCA TCGACCCCGG AATGGGAAGC GAAGTTCCGC
GAGAAGCGCA TCCCGATCGT CGGCGACGAC ATCAAGGCGC AGGTCGGCGC CACGATCGTC
CACCGCGTCC TGTCGAGCCT GTTCGCCGCC CGCGGCGTGA ACGTCGAGCG CACCTACCAG
CTCAACACCG GCGGCAACAC CGACTTCATG AACATGCTCG ACCGCCAGCG TCTGGGCAGC
AAGAAGGAAT CGAAGACCGA GGCAGTGCAG GCCATGCTTG CCCAGCGCCT CGACGACGAG
AACATCCACG TCGGCCCGTC GGACTATGTT CCTTGGCAGA AGGACAACAA GCTGTGCTTC
CTCCGTCTGG AAGGCGCGCA GTGGGGCAAC GTGCCCATGA ATCTCGAGCT TCGTCTCTCG
GTCGAGGACA GCCCGAACTC CGCAGCTTGC GTCATGGACG CGATCCGTTG CTGCAAGGTT
GCGCTGGACC GCGGTGAAGG CGGTGCGCTG ATCGGCCCGT CGGCCTACTT CTGCAAGCAC
CCGCCGCAGC AGTTCAACGA CGACGTCGCC GCGCAGATGG TCGAGGAATA TGCCTCGGTC
GAAAAGCTGG CCGCCGAATA A
 
Protein sequence
MKPIKVAVIG VGNCASSLVQ GVAYYRNNNS SQGLIHDRIG GYGAGDVDFV LGVDVDARKV 
GKDIAEAIFA APNNTTVFQP NVPPTGAKVI MGRVSDGVAP HMTTVGDKGF IVSDQPEATQ
ADIVKALKDS GAEVLLNFLP VGSQNATEFY MECALEAGVA VVNCMPVFIA STPEWEAKFR
EKRIPIVGDD IKAQVGATIV HRVLSSLFAA RGVNVERTYQ LNTGGNTDFM NMLDRQRLGS
KKESKTEAVQ AMLAQRLDDE NIHVGPSDYV PWQKDNKLCF LRLEGAQWGN VPMNLELRLS
VEDSPNSAAC VMDAIRCCKV ALDRGEGGAL IGPSAYFCKH PPQQFNDDVA AQMVEEYASV
EKLAAE