Gene Saro_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3687 
Symbol 
ID5077835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp321553 
End bp322593 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID640481410 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_001166072 
Protein GI146275912 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCGA ATTTCAACGT GGAAGCGGGC GACAAGCTCT ACATCCAGGA CGTCACCCTG 
CGCGACGGCA TGCACGCGGT GCGCCACATG TACGGCATCG ACCATGTCCG CTCGATCGCG
TCCGCGCTCG ACAAGGCCGG CGTCGATGCG ATCGAGGTCG CCCACGGTGA CGGCCTTTCG
GGAGCCAGCT TCAACTACGG CTTCGGCGCC CACACCGACT GGGAATGGCT GGAGGCCGTG
GCCGACGTGC TGGAGAAGAG CGTCCTCACC ACGCTCATCC TTCCCGGCGT CGGCACCGTC
GAGGAACTGC GCCGCGCCTA TGACATCGGC GTCCGCTCGG TCCGCGTCGC GACCCACTGC
ACCGAGGCCG ACGTCAGCAA GCAGCACATC GGCATCGCCC GCGATCTCGG CATGGACGTG
TCGGGCTTCC TGATGATGAG CCACATGATC GAACCCGAAG CGCTGGCGCA GCAGGCATCG
CTGATGGAAA GCTACGGCGC GCAATGCGTC TATGTCACCG ACAGCGGCGG CGCGCTCGAC
ATGGACGGCG TGAAGGCCCG CCTCGAAGCC TATGACCGCG TACTCAAGCC AGAAACCCAG
CGCGGCATCC ACGCCCACCA CAACCTCGCG CTCGGCGTCG CTAACTCGAT CGTCGCGGCG
CAATGTGGCG CGGTGCGCAT CGACGCCTCG CTGACCGGCA TGGGTGCGGG TGCGGGCAAT
GCGCCGCTCG AAGTTTTCAT CGCCGCCGCC GACCGCAAGG GCTGGAACCA CGGCTGCGAC
GTGATGATGC TGATGGACGC GGCCGAAGAT CTCGTCCGGC CGCTGCAGGA CCGCCCGGTC
CGCGTCGACC GCGAGACTTT GGCGCTCGGC TATGCGGGGG TCTACTCCAG CTTCCTGCGT
CACGCCGAGA AGGCGGCTGA GACCTATGGC CTCGATACGC GCACGATCCT CGTCGAACTG
GGTCGCCGCA AGATGGTCGG CGGCCAGGAA GACATGATCG TCGACGTCGC GCTCGACATG
CTCAAGGAAC AGCAGGCCTG A
 
Protein sequence
MTSNFNVEAG DKLYIQDVTL RDGMHAVRHM YGIDHVRSIA SALDKAGVDA IEVAHGDGLS 
GASFNYGFGA HTDWEWLEAV ADVLEKSVLT TLILPGVGTV EELRRAYDIG VRSVRVATHC
TEADVSKQHI GIARDLGMDV SGFLMMSHMI EPEALAQQAS LMESYGAQCV YVTDSGGALD
MDGVKARLEA YDRVLKPETQ RGIHAHHNLA LGVANSIVAA QCGAVRIDAS LTGMGAGAGN
APLEVFIAAA DRKGWNHGCD VMMLMDAAED LVRPLQDRPV RVDRETLALG YAGVYSSFLR
HAEKAAETYG LDTRTILVEL GRRKMVGGQE DMIVDVALDM LKEQQA