Gene Saro_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3559 
Symbol 
ID5077708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp175909 
End bp176922 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content66% 
IMG OID640481283 
Producttransketolase, central region 
Protein accessionYP_001165945 
Protein GI146275785 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00351586 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA TGATGTACCG CGACGCCGTC GTCTCGACGA TCGCAGAGGA AATGGAGCGC 
GACGAGAACG TCGTCATGCT CGGTGAAGAC ATTGTCGGCG GCATGGGCAC GCCGGGTGGG
CCCGAGGCCA TCGGCGGCAT CTGGTCGACC TCCACCGGCC TGTTCGGCAA GTTCGGCGCG
GACCGCGTGA TCGACACGCC GATCTCGGAA AGCGCGATCA TGGGCGCGGC GGCAGGCCTT
GCGCTTTCGG GCAAGCGCCC GATCGCCGAG CTGATGTTCG CCGACTTCAT CGGCGTTTCC
CTCGACCAGA TCTGGAACCA GCTCGCCAAG TTCCGCTACA TGTTCGGCGG CAAGACCAAG
TGCCCGGCAG TGATCCGCAT GGCCTATGGC GCGGGCTACA ACGCCGCCGC GCAGCATAGC
CAGGCGGTCC ACCAGATCCT GACCGGCATG CCGGGCCTCA AGGTGGTCAT GCCGACCACG
CCTGCCGACG TGAAGGGCCT GCTGCGCACC GCGATCCGCG ACGACGATCC GGTGATCTTC
CTCGAGCACA AGGCGCTCTA CGGCGTTTCC GGCGAAGTGC CCGACGATCC GGACTTCATG
ATCCCGTTCG GCCACGCCCG CCTTTCGCGC GCCGGCCAGG ACGTGACGAT CGTCTCGACC
GGCCTGCTGC TGGGATTCTG CGAGGCGGTG GCCGACAAGC TTGCCGCCGA GGGCATCGGC
TGCGACGTGA TCGACCTGCG CACCACCAGC CCGATCGACG AGGAAACGAT CCTCGATTCG
GTCGAGGTGA CCGGCCGCCT CGTCGTCGTC GACGAAGCGC CGCCCCGGTG CAGCCTTGCG
TCCGACATCT GTGCGACAGT TGCCGAAAAG GGCTTCGCCG CGCTCAAGGC TCCGCCGCAG
GCGGTCAACC CGCCACACAC CCCGATCCCG TTCGCGCGTG AGCTGGAATC TGCCTACCTT
CCTTCGGTCG ACAAGATCGA AGCGGCGGTG CGCAAGGTTC TGGCTTACCG CTGA
 
Protein sequence
MAKMMYRDAV VSTIAEEMER DENVVMLGED IVGGMGTPGG PEAIGGIWST STGLFGKFGA 
DRVIDTPISE SAIMGAAAGL ALSGKRPIAE LMFADFIGVS LDQIWNQLAK FRYMFGGKTK
CPAVIRMAYG AGYNAAAQHS QAVHQILTGM PGLKVVMPTT PADVKGLLRT AIRDDDPVIF
LEHKALYGVS GEVPDDPDFM IPFGHARLSR AGQDVTIVST GLLLGFCEAV ADKLAAEGIG
CDVIDLRTTS PIDEETILDS VEVTGRLVVV DEAPPRCSLA SDICATVAEK GFAALKAPPQ
AVNPPHTPIP FARELESAYL PSVDKIEAAV RKVLAYR