Gene Saro_0805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0805 
Symbol 
ID3915859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp855966 
End bp857228 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content69% 
IMG OID640443536 
Productmembrane dipeptidase 
Protein accessionYP_496084 
Protein GI87198827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCC TAACCCTTGC CGCACTCGCC GCGCTGGCTG CCCTGCCGGT CGCTGCCGCC 
ACGCCCCGCA GCCCAGAGGC GGTTGCCGCC GCCGCCCTGA AAAGCGCACC GGTGTTCGAC
GGGCACAACG ACGTGCCGGA GCAGTTGCGC GACCGGCGCA AGGACGTGCT TGAAGGCTTC
GATTTCCGCG ACACCCGCGC GACGGGCGAT GCGGCAACGG GGACGCCGCC GATGATGACC
GACATAGCCC GGATGCGTGC GGGACGGGTC GGGGCGCAGT TCTGGTCGGT CTACGTCTCG
GCCAACTTGC CCGAGGCGCA GGCAGTGCAG GCGACGCTCG AGCAGATCGA CGTTACCCGG
CGGCTGATCG CGCAGTATCC GGCGGACATG CAGTTCTGCA CTGACAGCCG GTGCGTGGAC
GAGGCGTGGA AGCGCGGCCG GATCGCGTCG TTGATCGGCA TGGAAGGGGG GCATTCGATT
GGCGGATCGC TGGGCGTGCT GCGGCAGATG CACGCGCTTG GCGCGCGCTA CATGACGCTG
ACGCATTTCC GGAACACGGC CTGGGCCGAC AGCGCCACCG ACGCGCCCCA ACATGACGGG
CTGACCCCGT TCGGCGAGAA GGTGGTGCGA GAGATGCAGC GGCTGGGCAT CCTGGTGGAC
CTTGCCCACG TGAGCGAGGC GACAATGCGC GACGTACTGG CGCTGGGTGG TCCCCCGCCC
ATCGTCAGTC ATTCCAACGC GCGGGCCATC AACCACCACG CCCGCAACGT CTCGGACGAG
ACGCTGAAGG CCATCGGCGC GGCGGGCGGG ATCGTGATGG TGAATTTCTA CCCGCCGTAT
GTGCTCGAAG CGGCCCGCCA GTGGAGCGCC GCGCGCGATG CGGAGGTCGC GCGGACCAAG
TCGCTCAACC GGGGCGATCC CTCGGCTGAG AAGGCCGCGC TGGATGCCTG GGACAAGGCC
AACCCCATGC CGCGCGGCAG CGTGCAGGAT GTGGCCGATC ATGTCGATCA TATTGCCCGA
CTGACCGGCA CGGACCACGT GGGGCTGGGC GGCGACCTCG ACGGGGTGGA AGCCACCATT
GCCGGTCTCG ACGATGCCGC GAGCTATCCC GCGCTGTTCG TGGAATTGGC AAGGCGGGGC
TGGTCGCAGG GCGATCTCGA GAAGCTGGCC AACGGCAACA TGATGAGGGT CCTGCGGGTG
GCCGAAGCCT ATGCCAGCCA GCACCGGGCC GACGGGCCGC TGGAAAGTCC GGTCGGCTTC
TGA
 
Protein sequence
MKRLTLAALA ALAALPVAAA TPRSPEAVAA AALKSAPVFD GHNDVPEQLR DRRKDVLEGF 
DFRDTRATGD AATGTPPMMT DIARMRAGRV GAQFWSVYVS ANLPEAQAVQ ATLEQIDVTR
RLIAQYPADM QFCTDSRCVD EAWKRGRIAS LIGMEGGHSI GGSLGVLRQM HALGARYMTL
THFRNTAWAD SATDAPQHDG LTPFGEKVVR EMQRLGILVD LAHVSEATMR DVLALGGPPP
IVSHSNARAI NHHARNVSDE TLKAIGAAGG IVMVNFYPPY VLEAARQWSA ARDAEVARTK
SLNRGDPSAE KAALDAWDKA NPMPRGSVQD VADHVDHIAR LTGTDHVGLG GDLDGVEATI
AGLDDAASYP ALFVELARRG WSQGDLEKLA NGNMMRVLRV AEAYASQHRA DGPLESPVGF