Gene Saro_0885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0885 
Symbol 
ID3917971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp939719 
End bp941863 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content66% 
IMG OID640443619 
Productmalate synthase G 
Protein accessionYP_496164 
Protein GI87198907 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGG GTGAAATGAA CGGACGTATC GAGCGCAGCG GGCTTCAGGT CGACGCGAAG 
CTGGCCACTT TCATCGACGG CGAGGTTCTC GGCCCGCTCG GCATTCAGGC CGACGCATTC
TGGCAGGGCT TCGCGGCGCT TGTCGGCCAC TTTACCCCGG TAAACCGCCA GCTTCTGGTG
AAGCGCGATT CCCTCCAGGC GCTGATCGAT TCCTGGCACC GCGAGCGTCG GGGCAAGCCC
ATCCACGCGC ACGAGTACCG CGCCTTCCTG AGCGAGATCG GCTATCTCGT GCCCGAGCCG
GACGACTTCC TGATCGGAAC CGAAAACGTC GACCGAGAGA TCGCGTCGAT GGCGGGGCCG
CAGCTCGTCG TGCCGGTCCT CAACGACCGG TTTGTCCTTA ATGCCGCGAA CGCGCGCTGG
GGCAGCCTGT ATGACGCATG GTATGGCACC GACGCGCTCG ATGCGCCGCC GGCCCGTCCG
GGTGGTTACG ACAAGGAACG CGGGGCCGCG GTCGTCGCCG CCGGACGCGC ATTCCTCGAC
CTGACGTTCC CTCTCGACAG CGCGAGCTGG TCCGACTGGG ATGGCGAAGG CGTGCCGCCG
CTGTGCGACG CGGACCAGTT CGCCGGGACG AAGCCCGGGG GCATCCTGCT TTGCAACAAT
GGCCTGCACG TCGAGATCGT GCTCGACCGC GCGCACCCGG TCGGCGCAGA CGACAAGGCG
GGCATTGCCG ACATCGTGAT GGAAGCGGCG CTGACGACCA TCGTCGATCT CGAGGATTCG
GTTGCGGCGG TCGACGCCGA GGACAAGCTG CTGGCCTATC GCAACTGGCT GGGCCTGATG
CGCGGCGACC TGGAGGCGAG CTTCCAGAAG GGCGGCAGGA CGCTGACCCG CGCGCTGGAG
GCTGACCGCG AGTGGACTTC GGCCAAGGGC GAGCCGCTGA CCCTGCCGGG GCGCAGCCTG
CTGTTCGTGC GCAACGTCGG ACACCTGATG ACCAACCCTG CGATCCTGCT TCCCGATGGC
AGCGAGATTC CCGAAGGGAT CATGGATGCG GTTGTGACTT CGGCCATCGC GATGCACGAC
ATCAGGGGGC TGGGCCGCCA TCGCAACAGC CGCGCGGGCA GCATCTATAT CGTGAAGCCC
AAGATGCACG GGCCGGAAGA GGTCGGCTTC ACCAACGATC TGTTCAACGC GGTCGAGGAC
CTGCTCGGGC TGGATCGCCA CACGGTCAAG GTCGGGGTGA TGGACGAGGA GCGCCGCACT
TCGGCCAACC TCGCCGCGTG CATCCGCGCG GTGAAGGACC GTATCGTCTT CATCAACACC
GGCTTCCTCG ACCGTACCGG CGACGAGATC CACACCTCGA TGCAGGCGGG GCCGATGATC
CGCAAGGGTG CGATCAAGGG GTCGGGCTGG ATCGCCGCCT ACGAGAAGCG CAACGTCTGC
ATCGGCCTTG CGCACGGGCT TTCCGGCAAG GCGCAGATCG GCAAGGGCAT GTGGGCCGCG
CCCGACATGA TGCACGACAT GATGCAGCAG AAGATCGCGC ACCCGAAGAC CGGCGCGAAC
ACCGCCTGGG TGCCTTCGCC CACGGCAGCC ACGCTGCATG CCATGCACTA CCACCAGGTC
GCGGTGTTCG ACGTGCAGCG CGAAGTGGCG CAGGAGAAGA CGCCGGGGCT CGACGCGCTG
CTCGCGATTC CGCTGGCGGA GGGGACCAAC TGGTCGGCCG AGGAAGTGCG CGAGGAGCTG
GACAACAACG CGCAAGGCCT GCTCGGCTAT GTCGTGCGCT GGATCGACCA GGGCGTCGGC
TGTTCGAAGG TGCCCGACAT CAACGACGTC GGCCTGATGG AAGACCGCGC GACGCTGCGC
ATTTCCAGCC AGCACATGGC CAACTGGTTG CTGCACGGCG TGGCCACCGG TGAGCAGGTG
ATGGACAGCC TCAAACGCAT GGCGGCCAAG GTCGATGCGC AGAACGCCGG CGATCCGCTC
TATGAGCCGA TGGCCGGACG CTGGGAGGAA AGCTTTGCCT TCCGCGCCGC CTGCGATCTC
GTGTTCAACG GGGTGGAACA GCCCAACGGC TATACCGAGC CGCTGCTGCA CGCGTGGCGC
CTGAAGAAGA AGGCGGCGGT GGGGAAGGTG GCCGAACCGG CCTGA
 
Protein sequence
MDMGEMNGRI ERSGLQVDAK LATFIDGEVL GPLGIQADAF WQGFAALVGH FTPVNRQLLV 
KRDSLQALID SWHRERRGKP IHAHEYRAFL SEIGYLVPEP DDFLIGTENV DREIASMAGP
QLVVPVLNDR FVLNAANARW GSLYDAWYGT DALDAPPARP GGYDKERGAA VVAAGRAFLD
LTFPLDSASW SDWDGEGVPP LCDADQFAGT KPGGILLCNN GLHVEIVLDR AHPVGADDKA
GIADIVMEAA LTTIVDLEDS VAAVDAEDKL LAYRNWLGLM RGDLEASFQK GGRTLTRALE
ADREWTSAKG EPLTLPGRSL LFVRNVGHLM TNPAILLPDG SEIPEGIMDA VVTSAIAMHD
IRGLGRHRNS RAGSIYIVKP KMHGPEEVGF TNDLFNAVED LLGLDRHTVK VGVMDEERRT
SANLAACIRA VKDRIVFINT GFLDRTGDEI HTSMQAGPMI RKGAIKGSGW IAAYEKRNVC
IGLAHGLSGK AQIGKGMWAA PDMMHDMMQQ KIAHPKTGAN TAWVPSPTAA TLHAMHYHQV
AVFDVQREVA QEKTPGLDAL LAIPLAEGTN WSAEEVREEL DNNAQGLLGY VVRWIDQGVG
CSKVPDINDV GLMEDRATLR ISSQHMANWL LHGVATGEQV MDSLKRMAAK VDAQNAGDPL
YEPMAGRWEE SFAFRAACDL VFNGVEQPNG YTEPLLHAWR LKKKAAVGKV AEPA