Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1968 |
Symbol | |
ID | 3917284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2085758 |
End bp | 2087725 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444716 |
Product | transketolase |
Protein accession | YP_497242 |
Protein GI | 87199985 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00341468 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGCG AAACCACCGC GCTGGCGCCC ATGGCCAACG CGATCCGTGC GCTTGCCATG GACGCGGTCC AGGCAGCGAA TTCGGGACAC CCTGGCATGC CGATGGGCAT GGCCGATGTC GCGACGGTGC TGTGGACGCA GTTTCTGAAG CACGATCCCG CCGCGCCGAA GTGGTCGGAT CGGGACCGCT TCATTCTGTC GGCCGGCCAC GGCTCCATGC TGATCTACGC GCTGCTTCAC CTGTCGGGCT ACGAAAGCCC GACGATGGAC GACATCCGCA AGTTCCGCCA GCTCGACAGC GTCTGCGCCG GTCACCCCGA GAACTTCCTT ATTCCCGGCG TTGAATGCAC CACCGGTCCG CTCGGGCAGG GGCTTGCCAT GGCAGTCGGC TTCGCAATGG CCGAACGCCA TCTTAATGCC CAGTTCGGCA GCGACCTGGT CGACCACAAG ACGTGGGTCA TTGCTGGCGA CGGCTGCCTC ATGGAAGGCA TCAACCACGA AGCGGTCGGC CTTGCAGGCA CACTCAAGCT TGGTCGCCTC AACGTGCTGT GGGACGACAA CAAGATTACC ATTGACGGTG ACACATCGCT TTCGACAAGC GAGGACATTC TTGGCCGGTA TGCCGCGTCG GGTTGGCACG TCACCTCGTG CGACGGCCAT GATTTTGCAG ACATCGCACG CGCGCTGGCA GAAGCACAGG CCGATCCGCG CCCGTCGCTT GTTGCCTGCC GCACCGTGAT CGGAAAGGGT GCGCCCAACA AGCAGGGCGG GCACAACGTC CATGGCGCAC CGCTCGGTGC GGACGAGATC GCTGCCGCGC GTGAGTATCT TGGCTGGACC GCGGCTCCGT TCGAAGTCCC GGCCGACATC CTCGCCAACT GGCGCTCCTC GGCAGAAGCC GGAAAGACCG CGCGAGCCGA ATGGGAAAAG CGGGCTGCAG CCAATCCCAA TGCCGCCGAA CTGGCCCGCC GCATGGCCGG GGAACTCCCC GCCCAGACCG GTTTCGACGC CTATATCCAG TCGCTGATCG CCAGCCCGCC CAAGGTCGCC ACCCGCAAGT CGAGCGAGAT GGCTCTCGAG GCCTTCACCG CGAACGTGCC CGAGATGGTG GGTGGTTCGG CTGACCTCAC CGGCTCGAAC AACACGAAGA CAAAGTCGAC CGCGCCGTTC ACGCCGGAAA GCTATGACGG ACGCTACGTC TACTACGGCA TCCGCGAATT CGGCATGGCT GCCGCGATGA ACGGCATGGC GCTGCATGGC GGGATCATCC CGTATGGCGG CACGTTCCTC GTGTTCTCCG ACTACTGTCG CAACGCCGTG CGCATGTCTG CGCTCCAGCA CGTGCGAGCG ATCTATGTGT TCACTCACGA TTCGATCGGC CTTGGCGAAG ACGGCCCGAC CCATCAGCCG GTCGAGCATG TCATGTCGCT GCGCATGATC CCGAACCTGC TGGTATTCCG TCCTGCGGAT GCCATCGAGA CCGCTGAAGC GTGGGCTATC GCGCTTGCAA ACAAGGACCG TCCGTCGGTC CTCGCGCTGA CCCGCCAAAA TCTGCCGCCG GTCCGCTTCG ACGCCGAGAT GAAGAGCGCA AAGGGCGCCT ATCGCCTTGT TGCCGCGCAG GCCGACCGCA AGGTTGTGCT GCTGGCTACC GGTTCGGAAG TGGAAGTCGC GATCAAGGTC GCGGCCGAAC TGGAAGCCAA GGGTCTGGGC GCCGACGTCG TTTCGGTTCC ATGCTGGGAA CTGTTCGACG AACAGGACGC GGCCTACAAG GCTGACCTGC TGCCCGCCGA CGCACTCAAG GTCTCGGTGG AGGCAGGCGT GACTCTGGGG TGGCAGAAGT ACATCGGCGA TGGCCTGGCC ATCGGCATCG ACACGTTCGG CGCGTCGGCC CCGGCCGAAG TGCTGTTCGA CCACTTTGGC CTCACGGCTG AAAAGATTGT CCCGCAGATT CTCGCGCGGG TTTCGTAA
|
Protein sequence | MTRETTALAP MANAIRALAM DAVQAANSGH PGMPMGMADV ATVLWTQFLK HDPAAPKWSD RDRFILSAGH GSMLIYALLH LSGYESPTMD DIRKFRQLDS VCAGHPENFL IPGVECTTGP LGQGLAMAVG FAMAERHLNA QFGSDLVDHK TWVIAGDGCL MEGINHEAVG LAGTLKLGRL NVLWDDNKIT IDGDTSLSTS EDILGRYAAS GWHVTSCDGH DFADIARALA EAQADPRPSL VACRTVIGKG APNKQGGHNV HGAPLGADEI AAAREYLGWT AAPFEVPADI LANWRSSAEA GKTARAEWEK RAAANPNAAE LARRMAGELP AQTGFDAYIQ SLIASPPKVA TRKSSEMALE AFTANVPEMV GGSADLTGSN NTKTKSTAPF TPESYDGRYV YYGIREFGMA AAMNGMALHG GIIPYGGTFL VFSDYCRNAV RMSALQHVRA IYVFTHDSIG LGEDGPTHQP VEHVMSLRMI PNLLVFRPAD AIETAEAWAI ALANKDRPSV LALTRQNLPP VRFDAEMKSA KGAYRLVAAQ ADRKVVLLAT GSEVEVAIKV AAELEAKGLG ADVVSVPCWE LFDEQDAAYK ADLLPADALK VSVEAGVTLG WQKYIGDGLA IGIDTFGASA PAEVLFDHFG LTAEKIVPQI LARVS
|
| |