Gene Saro_3060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3060 
Symbol 
ID3916674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3277719 
End bp3279596 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content67% 
IMG OID640445842 
Productsignal peptide peptidase SppA, 67K type 
Protein accessionYP_498329 
Protein GI87201072 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTG CCCGTTCCGT GTGGAAGATC CTCGTCGCCA TCAAGGACGG TCTCGTCCTC 
CTGTTCCTGC TGCTGTTCTT TGCGGGCCTA TATGCCGTGC TCGCGATGCG CCCCGCGCCC
GGCATGGTGC GCGAAGGGGC GCTCTACCTG CCGCTCGAAG GCCGCGTAGT GGAAGAGGCG
ACCGAAATCT CGCCCGCCGA CTTCCTGACC GGCCAGACCC CGGCGCCGGA ATATGTCGAA
CGCGACCTGA CCCGTGCGAT CGAGGCTGCC GCCACGGACA AGCGCATCAA GGCCGTGGTA
CTCGATCTCG AATCCTTCGG CGGCGGCTCT GCCGTCCATC TTTCAAGCAT CGGCGCCGCG
ATGGACAGGG TCCGCGCCGC CAGGAAGCCG GTCTTCGTTC GCTCGCTCAT CTATACCGAC
GATGCGATGC AACTGGCCGC ACATGCGTCG GAAGCGTGGG TCGATCCGAT GGGCGGCGTC
GCGATCACCG GGCCGGGCGG CACGATGCTG TTCTATAAGG GGCTGATCGA CAAGCTGAAG
GCCAACGTCC ACATCTTCAA GGTCGGCACC TACAAGAGCG CGGTCGAACC CTACATGCGC
GCGCAATCCT CGCCCGAGGC GCGCGAGGCG ATGCAGGCGG TCTATGGCGC GATCTGGCAG
AACTGGCAGG ACGAAGTGCG CAAGGCCCGG CCCAAGGCCG ACCTTGCCCT CGCGACGACC
GACCCGGCGA AGTGGGTCGC AGACAATGGC GGCGATGCCG CGCAGGCCGC GCTGAAATCC
GGCCTCATCG ACCGCGTCGG TGACCGCATC GCTTTCGGTA AGCGCGTGGC CGAGGTTGTC
GGCGCGGACG ATCAGGGACC GCTGGGTTCG TTCAAGGCGA CCGAGCTTCC TGTCTATCTG
GCCGACAAGC CCGCCGCCAA GGACGGCAGT GCTATCGCGG TGGTGACCGT GGCCGGCGAG
ATCGTCGATG GTGATGCCGG ACCCGGCGTT GCGGGCGGCG ACCGTATCGC GGACCTCATC
GACGGCGTGA CACAGGCCGA CGACTACGCC GGCCTCGTGC TGCGCGTCGA TTCGCCCGGC
GGCTCGGTCA TGGCCTCGGA ACGCATCCGG GCGGCGGTGG AGCGGGTCAG GGCCAAGGGC
CTGCCGGTGG CGGTTTCGAT GGGCAGCGTC GCGGCCAGCG GCGGCTACTG GGTTTCGACG
CCCGCGCAGC GCATCTTTGC AGAGCCCAGC ACGATCACTG GTTCGATCGG CGTGTTCGCC
GTCCTGCCGA CGTTCGAGCA GACCCTGCCG CAGTATGGGG TGACGACCGA ACAGGTCCGC
ACCACCCCGC TTTCCGGCCA GCCCGATCTC CTCGGTGGTC TCAACCCTCA GGTCTCGGCG
CTCATGCAGG GGCAGGTCGA ACAGACCTAC ACCCGCTTCC TCGGCCTCGT CGCCAAGGCG
CGCGGCAAAT CCCCTGCCGA CATCGACAGG ATCGCGCAGG GACGCATCTG GGACGGCGGT
ACTGCACGCC AATTGGGCCT GGTCGACCAG TTCGGCGGGC TGGACGACGC GGTCGCATGG
GTCGCCAAGC AGGCCAAGGC CGACAAGTGG CACGCCGAAT ACGTCGAGGA CGAACCCAGC
CCTGTCGCCC AGTTCCTCCG CCAGATGGAA ACCGGCGAGG CACCCGAGGC AAGAGCGCAC
GATCTTGCAG GCGCGCTGGC AGCGCAGCAG CAGGGCCTTG TCGCGCGCAT GCAGGCGGAC
CTGCTGCGGC TGATCGAAGG CGGCGGGATC AAGGCTTATT GCCTGGAGTG TGCAAGCGAT
GATCGGGGCA CCGCGAATAT GCGGCAGCGG AAGGCCGATT TGGGGCTTCT GGCCTGGCTC
GGACGGCTGG TTGCCTGA
 
Protein sequence
MSFARSVWKI LVAIKDGLVL LFLLLFFAGL YAVLAMRPAP GMVREGALYL PLEGRVVEEA 
TEISPADFLT GQTPAPEYVE RDLTRAIEAA ATDKRIKAVV LDLESFGGGS AVHLSSIGAA
MDRVRAARKP VFVRSLIYTD DAMQLAAHAS EAWVDPMGGV AITGPGGTML FYKGLIDKLK
ANVHIFKVGT YKSAVEPYMR AQSSPEAREA MQAVYGAIWQ NWQDEVRKAR PKADLALATT
DPAKWVADNG GDAAQAALKS GLIDRVGDRI AFGKRVAEVV GADDQGPLGS FKATELPVYL
ADKPAAKDGS AIAVVTVAGE IVDGDAGPGV AGGDRIADLI DGVTQADDYA GLVLRVDSPG
GSVMASERIR AAVERVRAKG LPVAVSMGSV AASGGYWVST PAQRIFAEPS TITGSIGVFA
VLPTFEQTLP QYGVTTEQVR TTPLSGQPDL LGGLNPQVSA LMQGQVEQTY TRFLGLVAKA
RGKSPADIDR IAQGRIWDGG TARQLGLVDQ FGGLDDAVAW VAKQAKADKW HAEYVEDEPS
PVAQFLRQME TGEAPEARAH DLAGALAAQQ QGLVARMQAD LLRLIEGGGI KAYCLECASD
DRGTANMRQR KADLGLLAWL GRLVA