Gene Saro_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1968 
Symbol 
ID3917284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2085758 
End bp2087725 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content64% 
IMG OID640444716 
Producttransketolase 
Protein accessionYP_497242 
Protein GI87199985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00341468 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGCG AAACCACCGC GCTGGCGCCC ATGGCCAACG CGATCCGTGC GCTTGCCATG 
GACGCGGTCC AGGCAGCGAA TTCGGGACAC CCTGGCATGC CGATGGGCAT GGCCGATGTC
GCGACGGTGC TGTGGACGCA GTTTCTGAAG CACGATCCCG CCGCGCCGAA GTGGTCGGAT
CGGGACCGCT TCATTCTGTC GGCCGGCCAC GGCTCCATGC TGATCTACGC GCTGCTTCAC
CTGTCGGGCT ACGAAAGCCC GACGATGGAC GACATCCGCA AGTTCCGCCA GCTCGACAGC
GTCTGCGCCG GTCACCCCGA GAACTTCCTT ATTCCCGGCG TTGAATGCAC CACCGGTCCG
CTCGGGCAGG GGCTTGCCAT GGCAGTCGGC TTCGCAATGG CCGAACGCCA TCTTAATGCC
CAGTTCGGCA GCGACCTGGT CGACCACAAG ACGTGGGTCA TTGCTGGCGA CGGCTGCCTC
ATGGAAGGCA TCAACCACGA AGCGGTCGGC CTTGCAGGCA CACTCAAGCT TGGTCGCCTC
AACGTGCTGT GGGACGACAA CAAGATTACC ATTGACGGTG ACACATCGCT TTCGACAAGC
GAGGACATTC TTGGCCGGTA TGCCGCGTCG GGTTGGCACG TCACCTCGTG CGACGGCCAT
GATTTTGCAG ACATCGCACG CGCGCTGGCA GAAGCACAGG CCGATCCGCG CCCGTCGCTT
GTTGCCTGCC GCACCGTGAT CGGAAAGGGT GCGCCCAACA AGCAGGGCGG GCACAACGTC
CATGGCGCAC CGCTCGGTGC GGACGAGATC GCTGCCGCGC GTGAGTATCT TGGCTGGACC
GCGGCTCCGT TCGAAGTCCC GGCCGACATC CTCGCCAACT GGCGCTCCTC GGCAGAAGCC
GGAAAGACCG CGCGAGCCGA ATGGGAAAAG CGGGCTGCAG CCAATCCCAA TGCCGCCGAA
CTGGCCCGCC GCATGGCCGG GGAACTCCCC GCCCAGACCG GTTTCGACGC CTATATCCAG
TCGCTGATCG CCAGCCCGCC CAAGGTCGCC ACCCGCAAGT CGAGCGAGAT GGCTCTCGAG
GCCTTCACCG CGAACGTGCC CGAGATGGTG GGTGGTTCGG CTGACCTCAC CGGCTCGAAC
AACACGAAGA CAAAGTCGAC CGCGCCGTTC ACGCCGGAAA GCTATGACGG ACGCTACGTC
TACTACGGCA TCCGCGAATT CGGCATGGCT GCCGCGATGA ACGGCATGGC GCTGCATGGC
GGGATCATCC CGTATGGCGG CACGTTCCTC GTGTTCTCCG ACTACTGTCG CAACGCCGTG
CGCATGTCTG CGCTCCAGCA CGTGCGAGCG ATCTATGTGT TCACTCACGA TTCGATCGGC
CTTGGCGAAG ACGGCCCGAC CCATCAGCCG GTCGAGCATG TCATGTCGCT GCGCATGATC
CCGAACCTGC TGGTATTCCG TCCTGCGGAT GCCATCGAGA CCGCTGAAGC GTGGGCTATC
GCGCTTGCAA ACAAGGACCG TCCGTCGGTC CTCGCGCTGA CCCGCCAAAA TCTGCCGCCG
GTCCGCTTCG ACGCCGAGAT GAAGAGCGCA AAGGGCGCCT ATCGCCTTGT TGCCGCGCAG
GCCGACCGCA AGGTTGTGCT GCTGGCTACC GGTTCGGAAG TGGAAGTCGC GATCAAGGTC
GCGGCCGAAC TGGAAGCCAA GGGTCTGGGC GCCGACGTCG TTTCGGTTCC ATGCTGGGAA
CTGTTCGACG AACAGGACGC GGCCTACAAG GCTGACCTGC TGCCCGCCGA CGCACTCAAG
GTCTCGGTGG AGGCAGGCGT GACTCTGGGG TGGCAGAAGT ACATCGGCGA TGGCCTGGCC
ATCGGCATCG ACACGTTCGG CGCGTCGGCC CCGGCCGAAG TGCTGTTCGA CCACTTTGGC
CTCACGGCTG AAAAGATTGT CCCGCAGATT CTCGCGCGGG TTTCGTAA
 
Protein sequence
MTRETTALAP MANAIRALAM DAVQAANSGH PGMPMGMADV ATVLWTQFLK HDPAAPKWSD 
RDRFILSAGH GSMLIYALLH LSGYESPTMD DIRKFRQLDS VCAGHPENFL IPGVECTTGP
LGQGLAMAVG FAMAERHLNA QFGSDLVDHK TWVIAGDGCL MEGINHEAVG LAGTLKLGRL
NVLWDDNKIT IDGDTSLSTS EDILGRYAAS GWHVTSCDGH DFADIARALA EAQADPRPSL
VACRTVIGKG APNKQGGHNV HGAPLGADEI AAAREYLGWT AAPFEVPADI LANWRSSAEA
GKTARAEWEK RAAANPNAAE LARRMAGELP AQTGFDAYIQ SLIASPPKVA TRKSSEMALE
AFTANVPEMV GGSADLTGSN NTKTKSTAPF TPESYDGRYV YYGIREFGMA AAMNGMALHG
GIIPYGGTFL VFSDYCRNAV RMSALQHVRA IYVFTHDSIG LGEDGPTHQP VEHVMSLRMI
PNLLVFRPAD AIETAEAWAI ALANKDRPSV LALTRQNLPP VRFDAEMKSA KGAYRLVAAQ
ADRKVVLLAT GSEVEVAIKV AAELEAKGLG ADVVSVPCWE LFDEQDAAYK ADLLPADALK
VSVEAGVTLG WQKYIGDGLA IGIDTFGASA PAEVLFDHFG LTAEKIVPQI LARVS