Gene Saro_0990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0990 
Symbol 
ID3915772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1031323 
End bp1034208 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content66% 
IMG OID640443724 
Productpeptidase M16-like 
Protein accessionYP_496269 
Protein GI87199012 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATTC GTACCACGCG CGCCCGGCGC ATAGGCCTGC TTCTCCCGCT CCTGCTGCTG 
ACTGCCGCGC CGATTGCGGC GCGCGCACCG GAAAGCGCCG GCGTTCCCTG GCTGTACAAG
GGGTCGGACG TTCCGCAGGA CAAGGCCTGG ACTTTCGGCG TCCTGCCCAA CGGCATACGC
TATGCCGTGC GCCACAACGG CGTCCCGCCC GAGCAGGTCT CCATTCGCGT CCTCGTCGAT
GCCGGCTCGA TGTACGAGAC CGAAAGCCAG CGCGGCTATG CGCACCTGAT CGAGCACCTG
ACCTTCCGTG AATCGAAATA CCTCAAGGAA GGCGAGGCGA TTCCGACCTG GCAGCGCCTT
GGCGCGACCT TCGGCAGCGA CACGAACGCC GAGACCAGCC CGACGCAGAC CGTCTACAAG
CTCGACATCC CGAACGCGAC CGATGCCAAG CTGGACGAGA CGTTCAGGCT CCTGTCCGGA
ATGATTACCG CGCCGATCTT CACCGACCAC GGCGTGAAGA CCGAAGTCCC CATCGTGCTT
GCCGAAATGC GCGAGCGCAC GAGCCCGCAA TCGCGCGTGC TGGACGAGAC GCGTGGCCTG
TTCTTCAAGG GGCAGCTTCT TGCCTCGCGC AATCCCATCG GCACGGTGCA GACGCTCGAG
GCGGCGAACG CGGCCGCGGT CAAGGCATTC CACGACAAGT GGTACCGGCC CGACAACACC
GTGATCGTGG TCGCCGGCGA TGCCGATCCC GCTGCTCTGG TGGCACGGAT CAAGCAGTCG
TTCGGCGGCT GGAAGGCCAC CGGCAAGAAG CCGCTTCAGC CGGATTTCGG CAAGCCTCTG
GCGCCTGCCG GTGCGGACCC GAAGAATCCG GTTGGCGAGG CAAAGGTGCT CGTCGAACCC
GATCTTCCGC GCATCATCAA CTGGGCGATC CTGCGTCCAT GGGTCAAGGT CAACGACACG
ATCCAGTACA ACCAAGGGCT GATGATAGAC CGGCTGGCGC TGGCCTTGAT CAATCGCCGG
CTTGAAGCCC GCGCCCGGGG CGGCGGCAGC TATCTGGTCG CATCGGTTGA CGAGATGAAG
CAGGAACTGT CACGCTCGGC CGATGCTACG GTCGTGACCG TGACACCGCT TGGCGAGGAC
TGGAAGGGCG CGGTCAAGGA CGTGCGAGCG GTCATTGCCG ATGCCCTTGC CACGCCCCCC
TCGCAGGAGG AGATCGACCG CGAGGTCGCC GAGTTCGAGG TGGCCTTCAA GGTCTCGGTC
GAGACGCAGA CAACAATCGC CGGATCGAAG GCGGCGGACG ACATCGTCAA TGCTGTCGAC
ATCCGCGAGA CAGTCGCCAA TCCCGACACG GTCTACGACA TCTTCAAGCG GTCGATCCCG
CTGTTCAGGC CCCAGGCGGT GCTCGATCAC ACGCGCGGCC TGTTCAAGGG GACGGTCGTC
CGTCCGCTCA TGATCACGCC GAAGGCCGGG GAGGCCGACG AGGCCTCGCT TCGAGCAGCG
CTGACGGCCC CGGTCGATGC CGCATCGGGA AGCCGCGTCG CGGCAAACGG CCTCAAGTTC
TCGGACCTGC CGGCTGTCGG CGTGCCCGGC ACGGTCGCCA CGGCGCGGCC GATCGGCTTG
CTCGGCATCG AGCAGATAGA ACTGTCTAAT GGGGTCAAGG TCCTGCTCTG GCCGAACGAC
GCCGAGCCGG GCCGGATCAT CATCAAGGCG CGCTTCGGTG GCGGCTATTC GGCCATCTCC
CCGCAGGACG CGGTCTATGG GCCGCTCGCG GAGGTGGCGC TGATGGACAG CGGCATCGGC
GAACTCGGGC GCGACGATCT CGATCGCCTG GCCACCGGCC GCAAGCTCAG CCTCGATTTC
GACATCGACG ATACTGCGTT CGTCATGTCC TCGGACACGC GCCCGGCGGA CCTTCAGGAC
CAGCTCTATC TCATGGCGGC CAAGCTTGCG ATGCCGCGCT GGGACCCGAA CCCGGTGCTG
CGCGCCAAGG CGGCGGCGAA GCTCCAGTAC GAAAGCTACA ACTCCGCCCC GATGGCCGTG
CTCAACCGCG ACCTGACGTG GCTGCTGCGC GATGGCGATC CGCGTTATGC CACGCCCAAC
CCGGCGGAGC TTGACCGGGC GACCCCCGAA GGCTTCCGCA AGACCTGGGA GCAACTCCTT
GCGCAGGGGC CGATCGAGAT CGACATGTTC GGCGATTTCA CGCGCGAACA GGCGCTTGCG
GCGCTTGAAA AGACCTTTGG CGCGCTTCCG GCGCGACAGC CGGCGCCTGC CTCCACGCTG
GCGCCGTCGA TCCCGGCGCA CAATGCCGAA CCCCTTGTGC TGACGCATCG CGGCGATCCC
TCGCAGGCTG CCGCTGTCGT TGCATGGCCC ACGGGCGGGG GGCAGGCGGG GGTCCGCGAG
AGCCGCCAGC TCGAGATTCT CGCCCAGATC TTCAACAATC GCCTGTTCGA CGCGATGCGC
GAGAAGGTGG GGGCGAGCTA CGCGCCGCAA GTGGGATCGA GCTGGCCGCT TGATCTGCCC
TCGGGCGGCT ATATCGCGGC GACCGTGCAA GTGCGTCCGG GCGATTTCGA GACCTTCTTT
GCCGCGGCCG ACAAGATCGC CGCCGACCTC GTCGCCACGC CTCCGACTGC CGACGAGATC
GCGCGCGTTA CCGAACCGCT CAAGCAGCTC ATCACCCGCG CCAGCACCGG CAACGGCTTC
TACATGTTCC AGCTTGAAGG GGCCGCCAAC GATCCGCGCA AGATCGCCGC GATCCGGACC
ATCCTCAACG ACTACAGCCA GACCACGCCT GAACGGATGC AGGCGCTGGC CGAGCGTTAT
CTGCGCAAGG ACAAGAGCTG GCGGCTCGAG GTGGTGCCGG AAAAGGCGGG GCAGGCGACG
CCCTGA
 
Protein sequence
MRIRTTRARR IGLLLPLLLL TAAPIAARAP ESAGVPWLYK GSDVPQDKAW TFGVLPNGIR 
YAVRHNGVPP EQVSIRVLVD AGSMYETESQ RGYAHLIEHL TFRESKYLKE GEAIPTWQRL
GATFGSDTNA ETSPTQTVYK LDIPNATDAK LDETFRLLSG MITAPIFTDH GVKTEVPIVL
AEMRERTSPQ SRVLDETRGL FFKGQLLASR NPIGTVQTLE AANAAAVKAF HDKWYRPDNT
VIVVAGDADP AALVARIKQS FGGWKATGKK PLQPDFGKPL APAGADPKNP VGEAKVLVEP
DLPRIINWAI LRPWVKVNDT IQYNQGLMID RLALALINRR LEARARGGGS YLVASVDEMK
QELSRSADAT VVTVTPLGED WKGAVKDVRA VIADALATPP SQEEIDREVA EFEVAFKVSV
ETQTTIAGSK AADDIVNAVD IRETVANPDT VYDIFKRSIP LFRPQAVLDH TRGLFKGTVV
RPLMITPKAG EADEASLRAA LTAPVDAASG SRVAANGLKF SDLPAVGVPG TVATARPIGL
LGIEQIELSN GVKVLLWPND AEPGRIIIKA RFGGGYSAIS PQDAVYGPLA EVALMDSGIG
ELGRDDLDRL ATGRKLSLDF DIDDTAFVMS SDTRPADLQD QLYLMAAKLA MPRWDPNPVL
RAKAAAKLQY ESYNSAPMAV LNRDLTWLLR DGDPRYATPN PAELDRATPE GFRKTWEQLL
AQGPIEIDMF GDFTREQALA ALEKTFGALP ARQPAPASTL APSIPAHNAE PLVLTHRGDP
SQAAAVVAWP TGGGQAGVRE SRQLEILAQI FNNRLFDAMR EKVGASYAPQ VGSSWPLDLP
SGGYIAATVQ VRPGDFETFF AAADKIAADL VATPPTADEI ARVTEPLKQL ITRASTGNGF
YMFQLEGAAN DPRKIAAIRT ILNDYSQTTP ERMQALAERY LRKDKSWRLE VVPEKAGQAT
P