Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0885 |
Symbol | |
ID | 3917971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 939719 |
End bp | 941863 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443619 |
Product | malate synthase G |
Protein accession | YP_496164 |
Protein GI | 87198907 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATGG GTGAAATGAA CGGACGTATC GAGCGCAGCG GGCTTCAGGT CGACGCGAAG CTGGCCACTT TCATCGACGG CGAGGTTCTC GGCCCGCTCG GCATTCAGGC CGACGCATTC TGGCAGGGCT TCGCGGCGCT TGTCGGCCAC TTTACCCCGG TAAACCGCCA GCTTCTGGTG AAGCGCGATT CCCTCCAGGC GCTGATCGAT TCCTGGCACC GCGAGCGTCG GGGCAAGCCC ATCCACGCGC ACGAGTACCG CGCCTTCCTG AGCGAGATCG GCTATCTCGT GCCCGAGCCG GACGACTTCC TGATCGGAAC CGAAAACGTC GACCGAGAGA TCGCGTCGAT GGCGGGGCCG CAGCTCGTCG TGCCGGTCCT CAACGACCGG TTTGTCCTTA ATGCCGCGAA CGCGCGCTGG GGCAGCCTGT ATGACGCATG GTATGGCACC GACGCGCTCG ATGCGCCGCC GGCCCGTCCG GGTGGTTACG ACAAGGAACG CGGGGCCGCG GTCGTCGCCG CCGGACGCGC ATTCCTCGAC CTGACGTTCC CTCTCGACAG CGCGAGCTGG TCCGACTGGG ATGGCGAAGG CGTGCCGCCG CTGTGCGACG CGGACCAGTT CGCCGGGACG AAGCCCGGGG GCATCCTGCT TTGCAACAAT GGCCTGCACG TCGAGATCGT GCTCGACCGC GCGCACCCGG TCGGCGCAGA CGACAAGGCG GGCATTGCCG ACATCGTGAT GGAAGCGGCG CTGACGACCA TCGTCGATCT CGAGGATTCG GTTGCGGCGG TCGACGCCGA GGACAAGCTG CTGGCCTATC GCAACTGGCT GGGCCTGATG CGCGGCGACC TGGAGGCGAG CTTCCAGAAG GGCGGCAGGA CGCTGACCCG CGCGCTGGAG GCTGACCGCG AGTGGACTTC GGCCAAGGGC GAGCCGCTGA CCCTGCCGGG GCGCAGCCTG CTGTTCGTGC GCAACGTCGG ACACCTGATG ACCAACCCTG CGATCCTGCT TCCCGATGGC AGCGAGATTC CCGAAGGGAT CATGGATGCG GTTGTGACTT CGGCCATCGC GATGCACGAC ATCAGGGGGC TGGGCCGCCA TCGCAACAGC CGCGCGGGCA GCATCTATAT CGTGAAGCCC AAGATGCACG GGCCGGAAGA GGTCGGCTTC ACCAACGATC TGTTCAACGC GGTCGAGGAC CTGCTCGGGC TGGATCGCCA CACGGTCAAG GTCGGGGTGA TGGACGAGGA GCGCCGCACT TCGGCCAACC TCGCCGCGTG CATCCGCGCG GTGAAGGACC GTATCGTCTT CATCAACACC GGCTTCCTCG ACCGTACCGG CGACGAGATC CACACCTCGA TGCAGGCGGG GCCGATGATC CGCAAGGGTG CGATCAAGGG GTCGGGCTGG ATCGCCGCCT ACGAGAAGCG CAACGTCTGC ATCGGCCTTG CGCACGGGCT TTCCGGCAAG GCGCAGATCG GCAAGGGCAT GTGGGCCGCG CCCGACATGA TGCACGACAT GATGCAGCAG AAGATCGCGC ACCCGAAGAC CGGCGCGAAC ACCGCCTGGG TGCCTTCGCC CACGGCAGCC ACGCTGCATG CCATGCACTA CCACCAGGTC GCGGTGTTCG ACGTGCAGCG CGAAGTGGCG CAGGAGAAGA CGCCGGGGCT CGACGCGCTG CTCGCGATTC CGCTGGCGGA GGGGACCAAC TGGTCGGCCG AGGAAGTGCG CGAGGAGCTG GACAACAACG CGCAAGGCCT GCTCGGCTAT GTCGTGCGCT GGATCGACCA GGGCGTCGGC TGTTCGAAGG TGCCCGACAT CAACGACGTC GGCCTGATGG AAGACCGCGC GACGCTGCGC ATTTCCAGCC AGCACATGGC CAACTGGTTG CTGCACGGCG TGGCCACCGG TGAGCAGGTG ATGGACAGCC TCAAACGCAT GGCGGCCAAG GTCGATGCGC AGAACGCCGG CGATCCGCTC TATGAGCCGA TGGCCGGACG CTGGGAGGAA AGCTTTGCCT TCCGCGCCGC CTGCGATCTC GTGTTCAACG GGGTGGAACA GCCCAACGGC TATACCGAGC CGCTGCTGCA CGCGTGGCGC CTGAAGAAGA AGGCGGCGGT GGGGAAGGTG GCCGAACCGG CCTGA
|
Protein sequence | MDMGEMNGRI ERSGLQVDAK LATFIDGEVL GPLGIQADAF WQGFAALVGH FTPVNRQLLV KRDSLQALID SWHRERRGKP IHAHEYRAFL SEIGYLVPEP DDFLIGTENV DREIASMAGP QLVVPVLNDR FVLNAANARW GSLYDAWYGT DALDAPPARP GGYDKERGAA VVAAGRAFLD LTFPLDSASW SDWDGEGVPP LCDADQFAGT KPGGILLCNN GLHVEIVLDR AHPVGADDKA GIADIVMEAA LTTIVDLEDS VAAVDAEDKL LAYRNWLGLM RGDLEASFQK GGRTLTRALE ADREWTSAKG EPLTLPGRSL LFVRNVGHLM TNPAILLPDG SEIPEGIMDA VVTSAIAMHD IRGLGRHRNS RAGSIYIVKP KMHGPEEVGF TNDLFNAVED LLGLDRHTVK VGVMDEERRT SANLAACIRA VKDRIVFINT GFLDRTGDEI HTSMQAGPMI RKGAIKGSGW IAAYEKRNVC IGLAHGLSGK AQIGKGMWAA PDMMHDMMQQ KIAHPKTGAN TAWVPSPTAA TLHAMHYHQV AVFDVQREVA QEKTPGLDAL LAIPLAEGTN WSAEEVREEL DNNAQGLLGY VVRWIDQGVG CSKVPDINDV GLMEDRATLR ISSQHMANWL LHGVATGEQV MDSLKRMAAK VDAQNAGDPL YEPMAGRWEE SFAFRAACDL VFNGVEQPNG YTEPLLHAWR LKKKAAVGKV AEPA
|
| |