Gene Saro_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0074 
Symbol 
ID3917673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp74760 
End bp76301 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content71% 
IMG OID640442799 
Productpeptidase M23B 
Protein accessionYP_495357 
Protein GI87198100 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCGC TGGCCGCCGC AGGCGGTGCG GGGGGCTTCG GTCTCCGCAC GACCCCGCCC 
TCGCGCGCCG CTTCCCCGGC CAGCGCATCG CGGCTCCCGG CTTACCTGAA CGGCCTCGAA
CGACACCTGA CGCGACACTG GCGTCCCTTG GCCGCGCGAA TCGAGCGCTG GTTCGCCGCA
AGGGACCTGG CGCCCGACCT TGCCGAGGAC ATCGGCAGCG CCCGCTGGTT CCGCGGCTTC
GGGACCATGC TCGCGCTGGG CGCGGCGGCG GTATTCTTCT GGCCGGATTT CGCCCCGCTC
CAGGCCGCGC CCCTGACCGC GCTCGATTCC GCCGCGACCG ACGAATTTCG CGTCCAGGGC
ATCCGCCCCC TGGCCTACGG CGCCGACAAC GGGCGGCGGA TGAACGCGAC GGCTGCGGTC
ATCCCGCTCG CCGCCGCGCC CGAGCGGCCG AGCGTCGACC TGGCAGCGAC GCTGGGCGCC
GGGGACAGCT TCCCCCGGAT GCTCCAGCGC GCCGGCCTGT CCAGCGTGGA TATCGTCCGC
GTCCTCGACC TTGTGGGCGG GCAGGTCAGC CCTGGCACCA TTCCGGCCGG AACGCGCTTT
GCCATCCGGC TCGGTGCGCG CACCTCTCCC GCGCAGCCGC GCCCGCTGGA ACAGCTTTCC
TTCCGCCCGC GCTTCGACCT GGCGCTCGAC GTCCATCGCA GCGGCGGCGG GCTGACGCTC
GCATCCAATG CCATCGCGGT CGACACAACG CCGATGCGCG TGCGCGGGAT CGTCGGGCAA
AGCCTCTACC GTTCCGCCCG CGCGGCCGGC GCGCCGCCTA CGGCGGTGCA GGACTACCTT
CGCGCCATCG ACCAGCACAT GGCCTTCGAG GAGATCGCGC CGGGCGACGA ATTCGACCTC
GTCTTTGCCA ACCGCCGCGC CTCGAGCGGC GAGCAGCAGC CGGGCGACCT GATCTATGCG
GGCGTGGTCC GCGCCGGAAA GCCGGTGCTG CAACTTCTGC GCTGGGGCAA TGACGGTGGC
TTTTATTCGC CGCAGGGCAT GGCCGAGGGC GCGCAGGAGC GGGAGAGCTT CCTGGGCGCG
CCGGTCAACG GGCGCATCAC GTCGGGCTAT GGCGCGCGCC GCCATCCCAT CCTCGGGTAC
GTGCGGATGC ATGCCGGGAT CGATTTTGCC GCAGGCTGGG GCGCGCCGAT CTACGCCGCG
ACGGACGGAC GCGTGACCTT TGCCGGATGG CACGGCGGAC ACGGCAACTA CGTCCGGCTC
GATCACGGCG GCGGCATCGG CACGGGATAT GGCCACATGA GCAGGATCGC GGTTGCGCCC
GGCATGAGCG TGCGGCGCGG ACAGGTGATC GGCTATGTCG GGTCGAGCGG GCTCTCGACC
GGACCGCATC TCCATTACGA GATGTATCGC GGCGGGCAGA CGGTGAATCC GCTGTCGATG
GGGGCGATCA CCACGCGCGC CACGGTCGAT CCGGCGCAGC TTGCCGCGTT CCGGGCCAGG
CTGGCGCAAG TCACCGCGAT CCACGCCAAC GCGCTTCGGT GA
 
Protein sequence
MMALAAAGGA GGFGLRTTPP SRAASPASAS RLPAYLNGLE RHLTRHWRPL AARIERWFAA 
RDLAPDLAED IGSARWFRGF GTMLALGAAA VFFWPDFAPL QAAPLTALDS AATDEFRVQG
IRPLAYGADN GRRMNATAAV IPLAAAPERP SVDLAATLGA GDSFPRMLQR AGLSSVDIVR
VLDLVGGQVS PGTIPAGTRF AIRLGARTSP AQPRPLEQLS FRPRFDLALD VHRSGGGLTL
ASNAIAVDTT PMRVRGIVGQ SLYRSARAAG APPTAVQDYL RAIDQHMAFE EIAPGDEFDL
VFANRRASSG EQQPGDLIYA GVVRAGKPVL QLLRWGNDGG FYSPQGMAEG AQERESFLGA
PVNGRITSGY GARRHPILGY VRMHAGIDFA AGWGAPIYAA TDGRVTFAGW HGGHGNYVRL
DHGGGIGTGY GHMSRIAVAP GMSVRRGQVI GYVGSSGLST GPHLHYEMYR GGQTVNPLSM
GAITTRATVD PAQLAAFRAR LAQVTAIHAN ALR