Gene Saro_1170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1170 
Symbol 
ID3916467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1210614 
End bp1212050 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content68% 
IMG OID640443906 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_496449 
Protein GI87199192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.910867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCA CCGCACCCCG CACGCTCTAC CAGAAGATCT GGGACGCCCA CGTCGTCGAA 
CGCCGTGATG ATGGCACCTG CCTCATCTAC ATCGACCGTC ACCTCGTCCA CGAAGTGACC
AGCCCGCAGG CCTTCGAGGC GCTTCGCGCC GCCGGCCGCA AGGTGCGCCG TCCCGATCTC
ACGCTTGCGG TGCCAGACCA CAACCTGCCG ACCACCGCGC GCCGTACCGC CGATGGCCGG
CGCGTGCCCA TCGCCGATCC CGAATCGGCC CAGCAGCTCG AGGCGCTGGA GCGCAACGCC
CCCGAATTCG GCATCCGCTA TATCGGCGAT GCCGATGACG AGCAGGGCAT CGTCCACGTC
GTCGGCCCGG AACAGGGCTT CTCGCTCCCC GGCGCGACGA TCGTCTGCGG CGACAGCCAC
ACCGCCTGCC ACGGCGGCCT GGGTGCGCTG GCCTTCGGCA TCGGCACGAG CGAGGTCGAG
CACGTCCTCG CCACGCAGAC CCTGCTGCTC AAGCAGTCGA AGACGATGGA AGTGCGCGTC
GAGGGCGAAC TGACCCCGGG CGTCACGGCC AAGGATGTCG TCCTGCACAT CACCGGCGTG
CTCGGCGCGG CTGGCGGCAC CGGCTCGGTC ATCGAGTACA CCGGCTCCGT CATCCGCGAC
CTGTCGATCG AGGGTCGCCT GACCATCTCC AACATGGCGA TCGAGCACGG CGCGCGCGCG
GGCCTTTGCG CTCCGGACGA AAAGACCTTC GCCTATCTCA AGGGCCGTCC CTACGCGCCC
AGGGGCGAGG ACTGGGACAA GGCCGTCGCG TGGTGGAAGA GCCTCGCGAC AGATCCCGGC
GCGACCTATG ACAAGGTCGT CGTGATCGAC GCGAAGGACA TCGCTCCTTC CGTCACCTGG
GGCACCAGCC CGGAAGACGT GCTGCCGATC TCCGGCCTCG TCCCCGCGCC TGAATCCTTC
GCAGATCCCT CCAAGCAGGA AGCCGCCCGC GCGAGCCTCG AATACATGGG CCTCGTTCCC
GGCCAGCGCA TGGAGGACGT CGAGGTGCAG AACATCTTCA TCGGCTCGTG CACCAACAGC
CGCATCGAGG ACATGCGCGC CGCTGCCGCG ATCCTGAAGG GCCGCAAGAA GGCGGACAAC
GTGAAGTGGG CCATCGTGGT GCCCGGCTCG GGGCTGGTGA AGAAGCAGGC GGAAGAGGAA
GGCCTCGACC GCGTGTTCAT CGAAGCCGGC TTCGAATGGC GCGAGCCCGG ATGTTCGGCC
TGTCTCGGCA TGAACCCGGA CAAGGTGCCA GCGGGCGAAC GCTGCGCTTC GACCTCCAAC
CGCAACTTCG TCGGCCGCCA GGGCCCCGGC GCGCGCACGC ACCTCGTCAG CCCGGCGATG
GCGGCGGCCG CTGCCGTTAC CGGCAGGCTG ACGGACGTGC GCAAGCTGAT GGCCTGA
 
Protein sequence
MSSTAPRTLY QKIWDAHVVE RRDDGTCLIY IDRHLVHEVT SPQAFEALRA AGRKVRRPDL 
TLAVPDHNLP TTARRTADGR RVPIADPESA QQLEALERNA PEFGIRYIGD ADDEQGIVHV
VGPEQGFSLP GATIVCGDSH TACHGGLGAL AFGIGTSEVE HVLATQTLLL KQSKTMEVRV
EGELTPGVTA KDVVLHITGV LGAAGGTGSV IEYTGSVIRD LSIEGRLTIS NMAIEHGARA
GLCAPDEKTF AYLKGRPYAP RGEDWDKAVA WWKSLATDPG ATYDKVVVID AKDIAPSVTW
GTSPEDVLPI SGLVPAPESF ADPSKQEAAR ASLEYMGLVP GQRMEDVEVQ NIFIGSCTNS
RIEDMRAAAA ILKGRKKADN VKWAIVVPGS GLVKKQAEEE GLDRVFIEAG FEWREPGCSA
CLGMNPDKVP AGERCASTSN RNFVGRQGPG ARTHLVSPAM AAAAAVTGRL TDVRKLMA