Gene Saro_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1809 
Symbolpgi 
ID3918368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1907699 
End bp1909222 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content67% 
IMG OID640444550 
Productglucose-6-phosphate isomerase 
Protein accessionYP_497083 
Protein GI87199826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.850311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTG CCGAAGCGGA AACGTACTGG ACGGCGCTGG CCGGGCTGCC GCGCCCGACG 
CTCAAGGAGC TGTTCTCCGA TGCCGGCCGT CTCGATCGCT ACGCCGCTAC GCTCGACCTG
CCGGGCGGCC CGATCCGCTT CGACTGGTCC AAGACGCACC TCTCCGCCGA AGTGGAAGCG
GTGTTCGCCG CGCTTGCATC GGCGATGGAC TTCGAAGGCC GCCGCGCCGC GCTGATCGAG
GGCGCGAAAA TCAACAACAC CGAAGGCCGC GCGGCCGAGC ACACCGCTCA GCGCGGCATC
GGCAACGAGG CCAGCGTCGA GGAAGCCGAG GCGCTCCACG CCCGCATGCG CATGCTGGTC
GACGCGATCC ACGCCGGTGC GCTTGGCGAA GTGCGCAGCC TGATCCACAT TGGCATCGGC
GGTTCGGCGC TCGGGCCGGC GCTGGCGATC GACGCACTGA CCCGCGACGG CGCGAAGGTG
GCCGTCCACG TCGTGTCGAA CATCGACGGC TGCGCGCTCG AAGCCGCTAT GAAGGCCTGC
GATCCGGCAA CGACGATGAT CGCCGTTGCC TCCAAGACCT TCACCACGAC CGAGACGATG
ACCAACGCTG CCTCGGCGCT TGAATGGCTG CGCGAGAACG GCGTTGCCGA TCCCTATGGC
CAGGTCGTTG CGCTCACCGC CGCGCCCGAG AAGGCGGTCG AGTGGGGCGT CGACGAAACC
CGCGTCCTGC CGTTCTCCGA AACCGTGGGC GGGCGTTACT CGCTGTGGTC GTCGATCGGT
TTCCCGGTCG CGATGGCGCT GGGGTGGGAA GGCTTCGCCG CGTTCCTCGA CGGTGCGGCG
GCTATCGATC GCCATTTCAT CGACGCAGAC CTGGCCGGCA ACGTCGTCGT TCGCGCCGCC
TTTGCCGATC TCTATTACAC TCAGGTTCGC GGGTGCCAGA CGCGTGCGGT CTTCGCCTAT
GACGAACGCC TCGCGCTTCT GCCGGACTAT CTCCAGCAGC TCGAAATGGA ATCAAACGGC
AAGCGCGTCC TCGCCGATGG CTCTCCACTT ACACGGCCAA GCGCGCCGGT TACCTGGGGC
GGCGTCGGGA CCGATGCACA GCATGCCGTG TTCCAGCTCC TGCACCAGGG TACGCACTTG
ATTCCGGTCG ATTTCCTTGC CGTCAAGACG CAGGGCCACG ACCTCGACCC GGCGCATCAC
CAGATCCTGC TTTCCAACTG CTTCGCCCAG GGTGCTGCGC TCATGGCCGG CAAGGCGAGC
GATGACGGCG CGCGTGCCTA TCCCGGCGAC CGTCCTTCCG CGACGATCCT GTGCGACGAT
CTCAACCCCG CGACGCTCGG CGCGCTGATC GCCTTCCACG AGCATCGCAC GTTCGTCTCT
GCGGTGATGC TCGGCATCAA TCCCTTCGAC CAGTTCGGCG TCGAACTGGG CAAGGCCATT
GCCAAGCAGA TCGAGTCTGG CGGCGGCGAA GGCTTCGATC CGTCGACCGA AGCACTCCTG
GCAGCGGTTG GCCTCGCCGG CTGA
 
Protein sequence
MTVAEAETYW TALAGLPRPT LKELFSDAGR LDRYAATLDL PGGPIRFDWS KTHLSAEVEA 
VFAALASAMD FEGRRAALIE GAKINNTEGR AAEHTAQRGI GNEASVEEAE ALHARMRMLV
DAIHAGALGE VRSLIHIGIG GSALGPALAI DALTRDGAKV AVHVVSNIDG CALEAAMKAC
DPATTMIAVA SKTFTTTETM TNAASALEWL RENGVADPYG QVVALTAAPE KAVEWGVDET
RVLPFSETVG GRYSLWSSIG FPVAMALGWE GFAAFLDGAA AIDRHFIDAD LAGNVVVRAA
FADLYYTQVR GCQTRAVFAY DERLALLPDY LQQLEMESNG KRVLADGSPL TRPSAPVTWG
GVGTDAQHAV FQLLHQGTHL IPVDFLAVKT QGHDLDPAHH QILLSNCFAQ GAALMAGKAS
DDGARAYPGD RPSATILCDD LNPATLGALI AFHEHRTFVS AVMLGINPFD QFGVELGKAI
AKQIESGGGE GFDPSTEALL AAVGLAG