Gene Saro_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0294 
Symbol 
ID3916231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp316744 
End bp318090 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content70% 
IMG OID640443023 
Productmicrocin-processing peptidase 1 
Protein accessionYP_495576 
Protein GI87198319 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCAC CCGCCGAAGC CCGTGAGCGC TGCGAAGCGC TGGTCGAACG GGCACGCCGC 
GCCGGTGCAG ATGCAGGCGA TGCCGTCTAC ATCGCCAGTG GCTCCGAATC GGTGCAGGTC
CGCCTCGGCG CGCTCGAGGA CGTGGAACGC TCCGAATCCG AGCACATCGG CCTGCGCGTC
TTCGTCGGCG GCGCTTCAGC CTCCATCGGA TCGACCGACC TCGGCGATGC CGCGCTTGAC
GAACTGGCCA GCCGCGCGGT TGCCATGGCC CGACTGGCCC CCGCCGACAA GTTCGCAGGC
CTCGCGCCCG AAGACATGCT GTTTCGCGGC CCTGTGCCCG ATCTCGACCT CGACGACGCC
ACCGAACGCA GCCCCCAGGA CCTGCGCCGC CTGGCCGAGG AAGCGGAAGA TGCGGCGCGC
GCCATCGCGG GCGTGACCAA CAGCGAGGGC GGAAGCGCCA GCGCGGGGCG CGGCCTTTTC
GCACTTGCCA CCAGCCACGG CTTTTCCGGC GCCTACGCCG CATCGAGCCA CAGCATTTCC
GCCAGCGTCG TTGCCGGCGA AGGCAGCGCG ATGCAGCGCG ACTATTCCTG GCGCAGCACG
CGCCACGCGG CAGACCTGCT GCCCCCGGCC CGGATCGGCG CTGAAGCGGG CGAGCGCGCG
GTCCGCCGCC TCAACCCCGG TCGGGTGAAG AGCGGCCAGG TGCCCGTCGT GTTCGACCCG
CGCGTCGCCA ACAGCCTTGT CGGACACCTC CTCGGCGCCA TGTCGGGTGC ATCGATCGCC
CGCCGCGCCA GCTTCCTTCT GGACCGGGAC GGCGCCCAGC TGTTCGACAG CGCGATCACC
ATTTCGGACG ACCCCCTGTC CATTCGCGGC ATGCGCTCGC GCCCGTTCGA CGGCGAAGGC
CTGCCAACCG CGCCGCGCAA GCTGGTGGAC GCGGGCAAGC TGACCGGCTG GCTGATGGAT
ACCGCCGCTG CCCGGCAACT CGGCAGCCGC CCCACCGGCC ACGCATCGCG CGGGGCGTCC
GGCGCGCCGC ACGTCACCGC GAGCAACGTG GTCCTCGAAC CCGGCACGGT GACCCCGGCT
GAACTGATGG CCGACATCGC CGACGGGGTC TATGTGACCG AACTGATCGG CCAGGGCGTG
AATGCCGTCA CGGGCGACTA CAGCCGCGGC GCATCGGGCT TTCGGATCGT GAACGGCGAA
ATCGCCGAGG CGATTGCCGA ATTCACCGTG GCAGGCAACC TCATCGACAT GTTCGCCGCG
CTTACCGCAG CCAACGATCT CGAAGTCTAT CGCGGCATCG ACACGCCGAC CCTGCGCGTG
GACGGGATGA GCATCGCCGG CGACTGA
 
Protein sequence
MLSPAEARER CEALVERARR AGADAGDAVY IASGSESVQV RLGALEDVER SESEHIGLRV 
FVGGASASIG STDLGDAALD ELASRAVAMA RLAPADKFAG LAPEDMLFRG PVPDLDLDDA
TERSPQDLRR LAEEAEDAAR AIAGVTNSEG GSASAGRGLF ALATSHGFSG AYAASSHSIS
ASVVAGEGSA MQRDYSWRST RHAADLLPPA RIGAEAGERA VRRLNPGRVK SGQVPVVFDP
RVANSLVGHL LGAMSGASIA RRASFLLDRD GAQLFDSAIT ISDDPLSIRG MRSRPFDGEG
LPTAPRKLVD AGKLTGWLMD TAAARQLGSR PTGHASRGAS GAPHVTASNV VLEPGTVTPA
ELMADIADGV YVTELIGQGV NAVTGDYSRG ASGFRIVNGE IAEAIAEFTV AGNLIDMFAA
LTAANDLEVY RGIDTPTLRV DGMSIAGD