Gene Saro_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2267 
Symbol 
ID3916583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2406158 
End bp2407828 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content64% 
IMG OID640445021 
Product2-isopropylmalate synthase 
Protein accessionYP_497538 
Protein GI87200281 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00970] 2-isopropylmalate synthase, yeast type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGC TGAAGAACCC CTCGGCCAAG TACCGTCCCT TCCCGCAGAT CGACCTGCCC 
GATCGCCAGT GGCCCAGCCG CACGATCACC AAGGCCCCGC GCTGGCTCTC GACCGACATG
CGCGACGGCA ACCAGTCGCT GATCGATCCG ATGGATGCCG AAAAGAAGAG CCGTTTCTTC
GATCTGCTGC TCAAGGTGGG CATCAAGGAA ATCGAGGTCG GCTTCCCCTC TGCCGGCGCG
ACCGAGTACG ATTTCATCCG CGGGCTGGTC GATTCCGGCC GCATTCCCGA TGACGTCTTC
GTCCAGGTCC TCACGCAGTC CCGCGAGGAC CTGATCAAGA CGTCGTTCGA AAGCCTCGCC
GGCGCGAAGC AGGCCATCGT CCACGTCTAC AACGCGGTTT CGCCGCTGTG GCGCCAGGTC
GTGTTCGGCA TGGAAAAGTC GGACGTGAAG GACATCGCCA TCGCCGGTGC CAAGCACCTG
CGCGATCAGG CCGCTCGCTT CCCGCAGACC GACTGGCACT TCGAATACAG CCCGGAGACG
TTCTCCACTG CCGAACTCGA TTTCAGCATT GAATGCTGCG AGGCGGTCAT GGAAATTCTC
CAGCCGACGG TCGAAAAGCC GATCATCCTC AACCTGCCCG CAACGGTCGA GGCGGCGACG
GCCAACATCT ATGCGGACCA GATCGAATAC TTCTGCCGCA ATCTGCCGGG CCGCGACCGC
GCGGTGATCT CGCTGCACAC CCACAACGAC CGCGGCACCG GCGTCGCCGC CGCCGAGCTC
GGCCTGATGG CGGGCGCGGA CCGCGTCGAG GGTTGCCTGT TCGGCAATGG CGAACGCACG
GGCAATTGCT GCCTCGTTAC CGTCGGCCTC AACATGTACA CGCAAGGGGT CGATCCGGAG
CTTGACTTCT CGGACATCGA CGAAGTGATC CAGACGGTTG AATACTGCAA CCAGCTCCCC
GTCCATCCGC GCCATCCCTA TGGCGGCGAA CTGGTCTTCA CCGCGTTTTC CGGCAGCCAC
CAGGACGCCA TCAAGAAGGG CTTCGCCGCG CAGGAAAAGC GCAATGACGA ACTCTGGTCG
GTGCCCTACC TGCCGATCGA CCCGGCCGAT CTCGGCCGCA GCTACGAAGC TGTGATCCGC
GTCAATTCGC AATCCGGCAA GGGCGGCTTT GCCTGGGTCC TGGAACAGGA CCAGGGCCTC
AAGCTGCCCA AGAAGATGCA GGCGCACTTC TCGCGCCACG TGCAGGAACT GGCCGACGAA
CTGGGCCGCG AACTGCAGGC GGCAGATATC TGGGGCGTCT TCCGCAAGGC CTATCGCCTC
GACGCGCCGC AGCATCTGCA ACTGATCGAC TACGAAGAAA CGCGCGGTGC CGACGGGACG
CGCATCTTCG CTGGCAAGAT CGAGGTCGAT GGCAAGGTCC AGTCCGTGTC CGGTCGTGGC
AATGGCCTGA TTTCCTCGGT CGTGGCCACG CTGCGCGACG GCTTCGGCGT CGAGATCGAC
ATAACCGACT ATTCCGAACA TGCGATGGGC GCCGGCAGCA ACGCTCGCGC GGCCGCCTAC
GTCGAATGCA GGACCGCAGA TGGCCGCACC ATCTGGGGCG TGGGGATCGA TGAAGACGTC
GCCACGGCCA GCGTCCGCGC CGTGCTCAGC GCGGCCAACG CGGTCGGGTG A
 
Protein sequence
MTMLKNPSAK YRPFPQIDLP DRQWPSRTIT KAPRWLSTDM RDGNQSLIDP MDAEKKSRFF 
DLLLKVGIKE IEVGFPSAGA TEYDFIRGLV DSGRIPDDVF VQVLTQSRED LIKTSFESLA
GAKQAIVHVY NAVSPLWRQV VFGMEKSDVK DIAIAGAKHL RDQAARFPQT DWHFEYSPET
FSTAELDFSI ECCEAVMEIL QPTVEKPIIL NLPATVEAAT ANIYADQIEY FCRNLPGRDR
AVISLHTHND RGTGVAAAEL GLMAGADRVE GCLFGNGERT GNCCLVTVGL NMYTQGVDPE
LDFSDIDEVI QTVEYCNQLP VHPRHPYGGE LVFTAFSGSH QDAIKKGFAA QEKRNDELWS
VPYLPIDPAD LGRSYEAVIR VNSQSGKGGF AWVLEQDQGL KLPKKMQAHF SRHVQELADE
LGRELQAADI WGVFRKAYRL DAPQHLQLID YEETRGADGT RIFAGKIEVD GKVQSVSGRG
NGLISSVVAT LRDGFGVEID ITDYSEHAMG AGSNARAAAY VECRTADGRT IWGVGIDEDV
ATASVRAVLS AANAVG