Gene Saro_0238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0238 
Symbol 
ID3917587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp246695 
End bp248005 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID640442963 
Productpeptidase M20 
Protein accessionYP_495520 
Protein GI87198263 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.489939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTGC CGATGGGTCT GGTTGCCGTT CTCGCGCTCG TCGGAACCGC CGCACATGCA 
GCGCCCAAGG GTGCCGAGGC ACGCATGATT GCCACCGTCG ATGCCGAGCA GGCGCGCACG
TTGAGGTTCC TCGAAGTGAT GGTCAACCAG AACTCGGGCA GCCGCAACCT CGAAGGCGTG
CGAAGGTTGC GCGACATTGT CGTGCCCGAA TTCACCGCGC TGGGCTTCAC GTCCCGCTGG
ATCCCGATGG AACGGACCGG CCGGGCCGGG CACCTCGTCC TCACCCACAA GGGCCGCCAA
GGCGCGAAGA AGCTTCTGCT GATCGGCCAC CTCGATACCG TGTTCGAACC TGACTCCCCC
TTCCAGACCT ATGTCCTGAA CGGCGAAAAG GCGACCGGCC CTGGCGTCGG CGATGACAAG
GGTGGCATCG CCGTGATCCT CGCTGCGGTC CGCGCCATGA ACGCTGCAGG AACGCTGAAG
GGCGCCAGCA TCGAAGTCTT CCTTACCGGC GACGAAGAGG AGGCAGGCTC TCCCACCGAA
GTCGCCCGCG CCGATCTCGT TGCCGCCGCC AGGGCCGCCG ACGTCGCGCT GGATTTCGAA
GGCCTCTCCA GAGAGAACGG CCGCGACATG GGCTCGATCG CCCGCCGATC CTCGCAAAGC
TGGTCTTTGA CGGTCGAGGC GAAGTCCGGC CACTCCAGCG GCGTCTGGGG CGCAAACGCG
GGCGATGGCG CGATCTATGC CGCCGCGAAG ATCGTGAATG CCTTCCGCAC CGAACTGCCC
GAACCCTGGC TTACCCTCAA CGTCGGCCTG ATCGCGGGCG GGGCGGAGGC AGAGGTCGCC
GAGGACAACG CCCACGTCTC GGCACAGGGC AAGACCAATA TCATACCGGG CGAGGTCATC
GCCCGCGGAG ACCTGCGCAC CCTCAGTCCC GAACAGAACC GCGCCGCCAT GCGCAAGATG
GAGGAGATCG TCGGCAGGCC CTACCCCGGC GTCACCTCAG CCCGCATCGC ATTTAGCGAA
GGCTACCCGC CCATGGCCCC GACCGAAGGC AACAAGGCGT TGCTGGCCCG CCTGAATCAG
GTCAACGCCA CGCTTGGCTT GCCCGAAATG CAGCCGCTCG ATCCGATGAA GCGCGGGGCC
GGGGACATCA GCTTCGTCGC GGAATACATC GACGGCCTCG TCGGCCTCGG CCCGCACTCC
ACCGGCGATC ACGCGCCGGG CGAAACGGTC GACGTCCCCA GCATCTGGAC CCAGGCCAAG
CGCGCCGCCC TGCTGATGAC CCGGCTCTCG GCGGAGAAGT CCGCGCGGTG A
 
Protein sequence
MRLPMGLVAV LALVGTAAHA APKGAEARMI ATVDAEQART LRFLEVMVNQ NSGSRNLEGV 
RRLRDIVVPE FTALGFTSRW IPMERTGRAG HLVLTHKGRQ GAKKLLLIGH LDTVFEPDSP
FQTYVLNGEK ATGPGVGDDK GGIAVILAAV RAMNAAGTLK GASIEVFLTG DEEEAGSPTE
VARADLVAAA RAADVALDFE GLSRENGRDM GSIARRSSQS WSLTVEAKSG HSSGVWGANA
GDGAIYAAAK IVNAFRTELP EPWLTLNVGL IAGGAEAEVA EDNAHVSAQG KTNIIPGEVI
ARGDLRTLSP EQNRAAMRKM EEIVGRPYPG VTSARIAFSE GYPPMAPTEG NKALLARLNQ
VNATLGLPEM QPLDPMKRGA GDISFVAEYI DGLVGLGPHS TGDHAPGETV DVPSIWTQAK
RAALLMTRLS AEKSAR