Gene Saro_3828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3828 
Symbol 
ID5077976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp482563 
End bp483726 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content68% 
IMG OID640481551 
Productmuconate and chloromuconate cycloisomerase 
Protein accessionYP_001166213 
Protein GI146276053 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR02534] muconate and chloromuconate cycloisomerases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.126456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGC TCGCCAACCC CCAGATTCTC GGCATCGAGA CGATTCTTCT CGATCTGCCG 
ACCATCCGTC CGCACGTGCT GGCCATGGCC ACGATGCACG CCCAGACGAT CTGCCTGGTT
CGCCTGACCT GCTCCGATGG TATCGTCGGA TTGGGCGAGG CGACCACGAT CGGCGGCCTC
GCATATGGCC CGGAAGCCCC GGAAACGATC AAGACCGCCA TCGACACCTA CTTCGCCCCG
CTTCTTGCCG GGCAGGATGC CACGCGCCCC GCCGCGGCCA TGGCGCTCGT CGCCCGCCAC
GTCGTCGGGA ATCACTTCGC CAAGTGCGCG ATCGAGACCG CGCTGCTCGA CGCACAGGGC
AAGCGACTTG GCCTTCCGGT CAGTGAACTC CTTGGCGGCC GCCGCGTGGA TTCGCTACCG
GTGCTCTGGA CGCTCGCCAG CGGCGATACC GCGCGCGACA TCGCAGAGGC GGAGCAGATG
CTCGACACGC GCCGACACGA CGCGTTCAAG CTCAAGATCG GCAAGCGCCC GATCGAACAG
GATGTCGCCC ATGTCGGCGC GATCAAGGCT GCGCTCGGCG ACCGGGCTTC GGTCCGCGTC
GACGTCAACA TGGCGTGGGA CGAACCCACG GCGCGGCGCG GTCTTGCCAT GCTGGCCGAT
GCGGGATGCG ACCTCGTCGA ACAACCGATC ATCCGCCACA ACCGCGATGG CATGGCCCGC
CTCGTCGCGC TGGGGCTGGT CCCGGTCATG GCTGACGAGA GCCTTACCGG TCCGGCCAGC
GCGATGGACT TCGCCCGCGC CGCCGCCGCC GATGTCTTCG CCGTGAAGAT CGAGCAGTCC
GGCGGGCTCG ATGCGGCGCG CGCGGTGGCG CAGATCGGCG ATGCGGCCTG CATCGGCCTT
TATGGCGGCA CCATGCTCGA AGGCGCCATC GGTACCATCG CATCGGCCCA CGCTTTTGCC
ACTTTCCCGG CGTTGAAGTG GGGCACCGAA CTTTTCGGCC CGCTCCTCCT CACGGAGGAA
ATCCTCGAGC GCCCCCTCAC CTACGCCGAT TTCTCGCTCG AAGTGCCGGC GGGGCCCGGT
CTCGGCATCG CCCTCGACGA AGACCGCGTC GAACACTTCC GGCGCGATCG CACCGCAACC
CAGTTCGCCT TGCAAGGAGC CTGA
 
Protein sequence
MTALANPQIL GIETILLDLP TIRPHVLAMA TMHAQTICLV RLTCSDGIVG LGEATTIGGL 
AYGPEAPETI KTAIDTYFAP LLAGQDATRP AAAMALVARH VVGNHFAKCA IETALLDAQG
KRLGLPVSEL LGGRRVDSLP VLWTLASGDT ARDIAEAEQM LDTRRHDAFK LKIGKRPIEQ
DVAHVGAIKA ALGDRASVRV DVNMAWDEPT ARRGLAMLAD AGCDLVEQPI IRHNRDGMAR
LVALGLVPVM ADESLTGPAS AMDFARAAAA DVFAVKIEQS GGLDAARAVA QIGDAACIGL
YGGTMLEGAI GTIASAHAFA TFPALKWGTE LFGPLLLTEE ILERPLTYAD FSLEVPAGPG
LGIALDEDRV EHFRRDRTAT QFALQGA