Gene Saro_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1904 
Symbol 
ID3917125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2017265 
End bp2018746 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content64% 
IMG OID640444648 
Productpeptidase M48, Ste24p 
Protein accessionYP_497178 
Protein GI87199921 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.429469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTG CCCGCCGTTC GCGCGCCGCT TTCCTGACCG TCACCGCAAT CGTCGTCGCC 
GCATTGCCGG CGTGTGCCAG CCAGACCGGC AGTCCCGCAA CCGCGCAGGG CACGGCCCAG
GCGATCAGTG CGGCTGACAG AAAGCAGGGG GCAGAAGCCC ATCCGCAGCT TCTTCAGGAA
TTCGGCGGAG CGATGACCGG GCCGCAGGCT GCCTACGTCG AAACCGTGGG CAAGAACATC
GCTGTCCAGT CCGGCCTCAG CAACGCGCGA GGCGATTTCA CGGTCACCCT GCTCAATTCG
CCGGTCAACA ACGCCTTCGC GATTCCCGGC GGCTACGTTT ACGTCACACG CCAGCTCGCA
GCGTTGATGA ACAACGAGGC AGAGCTTGCC GGCGTGCTCG GACACGAGGT TGGTCACGTT
GCCGCAAGGC ACGCAGCCAA GCGGCAAAGC ACCGCAACGC GCAATTCCCT TCTCGGTGCG
CTCGGCACGA TCCTTTCCGG GGCGATTCTC GGGAACAGTG CGCTGGGCCA GCTCGGCCAG
CAGATTTTCT CGACCGGCAG CCAGCTACTG ACGCTGAAGT ATTCGCGCGG GCAGGAGACC
GAGGCGGACA ATCTCGGCAT CACCTATCTC CAGCGTGCCG GATACGATCC GCGCGCGATG
TCTACCGTTC TGCAGAGCCT GGCCAACCAG AATGCACTCG ACGCCAGCCT CAAGGGCACG
TCGAACCAGG TTCCGGAGTG GGCCAGTACC CACCCGGATC CGGCCTCGCG CGTGCGCGCC
GCGCTGACCC GGGCCGGTTC GACGCCGGGC CGCACGAATC GGGACGCCTT TCTCGCGGGG
ATCAACGGCG TGACCTATGG CGATGACCCT GCGCAGGGCA TCGTCGATGG ATCGCGCTTC
ACCCATCCGG GCTTCCGCAT GGCATTCCAG GCGCCAAACG GGTTCTATCT GGTAAACGGC
ACCCGCGCTG TCTCGATCGC GGGGCAGAGC GGCAAGGGCG AACTGGCGAC CGCTACCTAC
AGCGGCAATC TGTCGACCTA CGTAAAGACG GTGTTCGCCG GCCTTTCCAG CCAGCAGCAG
CTTCAGCCGG ATTCGATTCA GACGACGACG GTGAATGGCA TCCCGGCAGC CTTTGGCACC
GCCCGCGTCA ACAGCGGCAA TGGTCAGGTC GACGTTGTCG TCTTCGCCTA TGAATGGGGG
CCGCAGCAGG CGTTCCATTT CTCAACGATC AGCCAGGCCG GGCAGTCGGG GCAGTTCGAT
TCGATGTTCC GTTCGATGCG GCGGATCAGC GCCAGCGAAG CGACTGCGGT CAAGCCGCGC
AAGCTTTCGG TCGTGACTGT AAGGTCGGGC GACACCCTGC AGAGCCTGGC GAGCAGGATG
GCCTATACCG ACAAGCCGAT GGAACGTTTC CTCGTGCTCA ACGGGCTCAC CTCGTCGAGC
AGGCTGACGG CGGGGCAGAA GGTCAAGCTA GTGGTCTATT GA
 
Protein sequence
MPFARRSRAA FLTVTAIVVA ALPACASQTG SPATAQGTAQ AISAADRKQG AEAHPQLLQE 
FGGAMTGPQA AYVETVGKNI AVQSGLSNAR GDFTVTLLNS PVNNAFAIPG GYVYVTRQLA
ALMNNEAELA GVLGHEVGHV AARHAAKRQS TATRNSLLGA LGTILSGAIL GNSALGQLGQ
QIFSTGSQLL TLKYSRGQET EADNLGITYL QRAGYDPRAM STVLQSLANQ NALDASLKGT
SNQVPEWAST HPDPASRVRA ALTRAGSTPG RTNRDAFLAG INGVTYGDDP AQGIVDGSRF
THPGFRMAFQ APNGFYLVNG TRAVSIAGQS GKGELATATY SGNLSTYVKT VFAGLSSQQQ
LQPDSIQTTT VNGIPAAFGT ARVNSGNGQV DVVVFAYEWG PQQAFHFSTI SQAGQSGQFD
SMFRSMRRIS ASEATAVKPR KLSVVTVRSG DTLQSLASRM AYTDKPMERF LVLNGLTSSS
RLTAGQKVKL VVY