Gene Saro_0290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0290 
Symbol 
ID3916227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp312028 
End bp313980 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content65% 
IMG OID640443019 
Productglycyl aminopeptidase 
Protein accessionYP_495572 
Protein GI87198315 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGA CGCTCGCGGC AGCACCGCTC CTTCTCGCCC TTTCCGTTTC AACTCAGGCT 
CTTGCGGCGA ACTCTGCGCC GATGGCCGTG CCGATCGCGC AGACCGTGCC GGACGCGCAG
GACGTGGCCT ATCCGGGCAC AATGACGCTC GACATCGACG CGTCGGACGT CGTGCGCGGG
GCCTATCGCG TGACCCAGAC CATTCCGGTG GTGGCGGGCG CGAAGGAGCT GATCCTGCTT
TACCCGCAGT GGCTGCCCGG CAACCACGGG CCGCGCGGGC CGCTGGCCGA ACTCGTCGGC
GTGCAGTTCT TCGTCGACGG CAAGCCGGTG GAATGGAAGC GCGACCGGGT CGAGGTCTAT
GCCTTCCACG TCACCCTGCC GGCCGGCGCG AAGGAGGTCG TGGCCAAGCT GATCCACACC
TCGCCGCTGC AATCTTCGGA AGGGCGCATT ACGATGACAC CCGAGATGCT CAACCTGCAG
TGGGAGAAGA TGAGCCTCTA TCCCGCCGGT CACTATGTCC GCCGCATTCG CGTGAAGCCG
ACCGTAACCC TGCCGCAGGG CTGGACGCCC GCCACCGCGC TCGACGGAAT GAGCATGAGC
GGCAACCGCG CGACCTGGGC CGAAACCGAC TACGAGACGC TGGTCGATTC GCCGATCTTC
GCGGGCAAGA ACTTCCGCAA GTGGGACCTC GGGCAGAACG TCACGCTCAA CGTCGTCTCC
GACAAGCCGG AGCAGCTCGA GGCCAAGCCC GAGCATATCG CCGCGCACAA GGCGCTGGTC
GAGGAAGCGC GGATCGCGTT CGGCGCGAAC CATTTCGACC ACTACGAGTT CCTGCTGGCG
CTGTCGGACA AGATCGGCGG CATCGGGCTG GAACATCACC GGTCGAGCGA GAACCAGCTC
GAACCCGAGG CCTTCACCGA ATGGGCCAAG CAGGAATGGG ACCGCAATCT GCTGCCGCAC
GAGTATTCGC ACTCGTGGTC GGGCAAGTTC CGCCGTCCGT CGCGCCTGTG GACGCCCGAC
TATCGCCAGC CGATGCAGGG CGACCTGCTG TGGACCTACG AAGGGCAGGA CCAGTTCTGG
GGCGCGGTGC TGGCCAGCCG TTCGGGCATG CAGGGCAAGG ACATGGTGCT GGGCATGCTG
GCGGCATGGG CGGGCGGCTT CACCCAGCAG CCGGGCCGCG AATGGCGTTC GGTCGAGGAT
ACCGGGTTCG ACCCGGTCTT CGCCTCGCGC AAGCCGAAGC CCTACTCGTC GCTGGCCCGC
AACGAGGACT ACTACACCGA AGGCGCGCTG GTCTGGCTGG AAATCGACCA GATCCTTCGC
GAAGGCACCG GCGGCAAGAA GTCCATCGAC GACTTCGCCA AGTCGTTCTT CGGCATGAAT
CCGGGGGACT GGGGCCAGAT CCCGTTCGAG GTGGACGAGA TCGTCACCAA GCTGAACGCG
CTTTATCCCT ATGACTGGGC CAAGCTGATC GACACCCGCA TCAACCAGCC GGGCCAGCCC
GCGCCGCTGA ACGGGATCGA GAAGGGCGGC TACAAGCTGG TCTGGAAGGA AGAGCCCAAT
CCCTACATGA AGGCGGCGAT CGATTTCGGC AAAGGCCTGA GCCTTTCCAA CTCGATCGGC
ATTTCGCTCG ACAAGGACGG CAAGGTCACC GGCACGCGCT GGGACAGCCC GGCCTTCAAT
GCGGGAATCG TGACCGGTAC GCAGATCATG GCGGTGAACG GCACCGCCTA TAGCGCGGAT
GACCTCAAGA AGGCGATCAC CGCAGCCAAG GGTGACAAGG GCCAGCCGCT CGAACTGCTG
GTCAAGCGCG GCAGCCGGTT CGAGACCGTG AAGCTCGATT ACCGGGATGG CCTGCGTTAT
CCGTGGCTCG AGCGCGTGGC GCCGGGCAAG GCGCCGACCG GGCTCGACCT GCTGCTCGAA
CCCCGGCGCC CCGGCGCGGC GAAGAAGAAG TAA
 
Protein sequence
MFKTLAAAPL LLALSVSTQA LAANSAPMAV PIAQTVPDAQ DVAYPGTMTL DIDASDVVRG 
AYRVTQTIPV VAGAKELILL YPQWLPGNHG PRGPLAELVG VQFFVDGKPV EWKRDRVEVY
AFHVTLPAGA KEVVAKLIHT SPLQSSEGRI TMTPEMLNLQ WEKMSLYPAG HYVRRIRVKP
TVTLPQGWTP ATALDGMSMS GNRATWAETD YETLVDSPIF AGKNFRKWDL GQNVTLNVVS
DKPEQLEAKP EHIAAHKALV EEARIAFGAN HFDHYEFLLA LSDKIGGIGL EHHRSSENQL
EPEAFTEWAK QEWDRNLLPH EYSHSWSGKF RRPSRLWTPD YRQPMQGDLL WTYEGQDQFW
GAVLASRSGM QGKDMVLGML AAWAGGFTQQ PGREWRSVED TGFDPVFASR KPKPYSSLAR
NEDYYTEGAL VWLEIDQILR EGTGGKKSID DFAKSFFGMN PGDWGQIPFE VDEIVTKLNA
LYPYDWAKLI DTRINQPGQP APLNGIEKGG YKLVWKEEPN PYMKAAIDFG KGLSLSNSIG
ISLDKDGKVT GTRWDSPAFN AGIVTGTQIM AVNGTAYSAD DLKKAITAAK GDKGQPLELL
VKRGSRFETV KLDYRDGLRY PWLERVAPGK APTGLDLLLE PRRPGAAKKK