Gene Saro_3291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3291 
Symbol 
ID3915938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3508428 
End bp3510692 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content65% 
IMG OID640446076 
Productmalic enzyme 
Protein accessionYP_498560 
Protein GI87201303 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase
[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.442539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAGG AAAGCAACGT CCGCTTCACC GAGCGCGAGG CGCTTTTCTA CCATAACACG 
ATCCGCCCGG GTAAGATCGA GATCATCGCC TCGAAGCCGA TGGCGACGCA GCGCGATCTC
AGCCTGGCCT ATTCGCCTGG CGTTGCGGTG CCGGTACGCG CGATCGCGGA AGATCCGTCC
ACGGCGTACG ACTATACCGC GAAGGGCAAT CTTGTCGCGG TCATCTCCAA TGGCACCGCA
ATCCTCGGCC TCGGCAATCT GGGCGCGCTT GCTTCCAAGC CGGTGATGGA AGGCAAGGCG
GTGCTGTTCA AGCGCTTTGC CGACGTCGAT TCGATGGACA TCGAACTGGC GACCGAAGAT
GCCGATGCCT TCATCGAGGC GGTCGCCCTG CTCGAACCGA CGTTCGGCGG CATCAACCTT
GAAGACATCA AGGCGCCTGA GTGCTTCATC ATCGAACAGG CGCTGAAGGA GCGCCTCAAG
ATCCCGGTCA TGCACGACGA CCAGCACGGC ACCGCGATCA TCTCCGCTGC TGGCCTGCTT
AACGCCTGCC ACCTGACCGG TCGCCGGCTC GAGGACGTGA AGGTCGTCGT GAACGGTGCG
GGGGCGGCGG CAATCGCCTG CACCGCGCTG ATCAAGGCGA TGGGCGTGCG CCACGACAAC
GTGATCATGT GCGACCGGTC CGGCGTGATC TACCGTGGCC GCGAATCCGG CATGGACCAG
TGGAAGAGCG CGCATGCGGT CGACACCACG GCGCGCAGCC TCGAGGATGC GCTCGTCGGG
GCGGACATCT TCCTGGGTCT TTCCGCCGCG GGCGCGCTCA AGCCCGAATG GGTGCTGAAG
ATGGCGCCGC AGCCGATCAT CTTCGCGATG GCCAATCCGG ACCCGGAAAT CACGCCGCCC
GATGCCAAGG CCGTCCGCCC GGATTGCATC GTCGCGACGG GCCGGTCCGA CTATCCCAAC
CAGGTCAACA ACGTCCTTGG TTTCCCCTTC ATCTTCCGCG GCGCGCTCGA TGTGCGGGCG
ACCGCGATCA ACGAAGAGAT GAAGATCGCC GCGGCAGAGG CCATCGCCCA GCTCGCGCGC
GAGCCGGTGC CCGAGGAAGT GGCTGCCGCC TATGGCATGA ACCACACCTT CGGGCTGGAC
TACATCATTC CCGCGCCGTT CGATCCGCGC CTGATGGAAG TCGTCTCCTC CGCCGTCGCC
AAGGCGGCGA TGGATTCGGG CGTCGCGCAG AAGCCTATCG AGGACTTCGA CGCCTATCGC
ACCAGCCTCA AGGCGCGTCT CAACCCGACG ACTTCGGTCC TGACCAACGT CTTCGCCACG
GCCAAGGACA ACCCCAAGCG CGTCGTCTTC GCTGAAGCCG AGAACGAGGT GGTGCTGCGC
GCCGCGATCC AGTTCAAGGA TTTCGGTTAT GGCACGCCGG TGCTGGTGGG CCGCACCCAG
CCGGTTCTCG ACCTGCTGAC CGAACTTGGC GTCAGCGACC CGTCGTCCTA CGAGATCCAC
AACTCGGCGG TTTCGCCGCT GGTGCCGGAG ATGGTCGAGT ACCTCTACGA ACGGCTCAAG
CGTCGCGGCT ATCTCGAGCG TGACGTCAGA CGCATGGTCA ACCAGGACCG CAACGTCTTC
GCCTCGCTGC TCGTCGCGCT TGGACATGGC GATGCGCTCA TCACCGGCAT GACGCGCACC
TTCGCGCAGT CGATGAAGGA GGTACGCCGC GTCCTCGATC CGAAGCCGGG GCACCTGCCC
TTCGGCATCC ACCTGATGGT GGCGAAGAAC TACACCGTGT TCCTCGCCGA CACGACGGTG
AACGAGCGTC CCTCGGCCGA AGACCTCGCG CACATCGCGC GCGAGACCGC CGCCGTTGCC
CGCCGCATGG GCCACGAGCC GCGCGTGGCG TTCCTGTCCT ATTCGACCTT CGGCAATCCC
TATGGCCGCT GGCTCGATTC GATCCGGGGC GCGGTGGCGA TCCTCGACGC CGAGAATCCC
GGCTTCGAGT ATGAGGGCGA AATGGCGCCG GACGCCGCGC TCAACCCCAA GGTCATGGCG
AACTATCCGT TCAGCCGCCT CTCGGGCCCG GCCAACGTGC TGATCATGCC TGGGCTGCAA
TCGGCCAACA TCTCGGCCAA GCTGCTGCGC GAATTGGCGG GCAATGCGGT GATCGGGCCG
ATGCTGCTGG GCATGGAGAA GCCGGTGCAG ATCGCGCCGA TGACGTCGAT CGCGCCCGAC
ATCCTGACCC TGGCCGTGCT GGCAGCGGCG GGCATCGTCG GCTGA
 
Protein sequence
MSEESNVRFT EREALFYHNT IRPGKIEIIA SKPMATQRDL SLAYSPGVAV PVRAIAEDPS 
TAYDYTAKGN LVAVISNGTA ILGLGNLGAL ASKPVMEGKA VLFKRFADVD SMDIELATED
ADAFIEAVAL LEPTFGGINL EDIKAPECFI IEQALKERLK IPVMHDDQHG TAIISAAGLL
NACHLTGRRL EDVKVVVNGA GAAAIACTAL IKAMGVRHDN VIMCDRSGVI YRGRESGMDQ
WKSAHAVDTT ARSLEDALVG ADIFLGLSAA GALKPEWVLK MAPQPIIFAM ANPDPEITPP
DAKAVRPDCI VATGRSDYPN QVNNVLGFPF IFRGALDVRA TAINEEMKIA AAEAIAQLAR
EPVPEEVAAA YGMNHTFGLD YIIPAPFDPR LMEVVSSAVA KAAMDSGVAQ KPIEDFDAYR
TSLKARLNPT TSVLTNVFAT AKDNPKRVVF AEAENEVVLR AAIQFKDFGY GTPVLVGRTQ
PVLDLLTELG VSDPSSYEIH NSAVSPLVPE MVEYLYERLK RRGYLERDVR RMVNQDRNVF
ASLLVALGHG DALITGMTRT FAQSMKEVRR VLDPKPGHLP FGIHLMVAKN YTVFLADTTV
NERPSAEDLA HIARETAAVA RRMGHEPRVA FLSYSTFGNP YGRWLDSIRG AVAILDAENP
GFEYEGEMAP DAALNPKVMA NYPFSRLSGP ANVLIMPGLQ SANISAKLLR ELAGNAVIGP
MLLGMEKPVQ IAPMTSIAPD ILTLAVLAAA GIVG