Gene Saro_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1837 
Symbol 
ID3918397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1935151 
End bp1937463 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content70% 
IMG OID640444579 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_497111 
Protein GI87199854 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00570496 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCGT GGAAGTTCGC GGGGCAATCG GCCCTTTTGG TTCTGGCCTT GGCCCTGACA 
GGCTGCGGTG GAGGCGGCGG AGTAGCGAGC ACGCCGACGC CGACGCCGAC GCCCAGCCCG
ACCCCATCGC CGGCCCCCAC GCCAACGCCG ACGCCCACGC CCACGCCCAC GCCTACGCCG
GTTTCCTACA ATACCGCCGA ATATCGCAAT TCGGACGGCC CCGTGTTCCA CGGCGCGATT
ACCGCGTGGG AAGCGGGGGC ATCCGGCGCG GGGACGACGA TTGCCATCGT GGACAGCGGG
ATCGATACCG CCAACAGCGA GTTCGCGGGC CGGATCTCGT CAGCATCGGC GGACGTCGCC
GGATCGCGCG GCGTAAGGAA CACCGGGGAC GATCACGGAA CGCAGGTCGC GCTCGTCGCG
GCGGCAGCGC GCAACGATTA CGGAGTGATG GGCATCGCCT GGGGCGCCAC AGTACAGATG
CTGCGCGCGG ATACACCCGG AACCTGCGCA AGTGCGGATG GGTGCACATT CTCCGACACG
AACATCGCCG AAGGAATCGA CGCAGCGGTT GGAGCGGGCG CCCGGGTGGT CAATCTCTCG
CTTGGTGGCG AAGCGGCGGA TACAAACCTG CGCGCCGCCG TCGCCCGCGC CACGTCCGCC
GGGCTGATCG TGGTGGTATC AGCCGGCAAC GACGGGCAGA GCACCGATCC GGCCATTGAC
CCGAACAACC CCGACCCCTT CGCGGTGTCG CTGCTCCAGG CAGGCGGTGG CAACGTGATC
ATCGTCGGAT CGGTCAGCTC CGCCGGGCTG GTCTCGGACT TCAGCAACCG TGCAGGCTCG
TCGGCATCCT CGTTCCTCAT GGCCCAAGGC GAGACGATCT GCTGCGTGTA CGAAGGCAGC
CAGATCCGCG TGGTCGACGG TTTCCGCACG GCCGTGAATG GAACAAGCTT CGCCGCGCCC
CAGGTCTCGG GCGCGGTCGC TCTTCTCGCC CAGGCGTTCC CGAACCTCAC GGGCGCGCAA
ATCGTCAGCC TCCTGCTGAG TACGGCGCGC GAGGGCGGGG CAATCGGCAC CGACCCTATC
TACGGGCGCG GCATTCTCGA TATCGCGCGA GCCTTTTCGC CGCAGGGTAG CACTGGCCTC
GCCGGAACCA GCGTCGCCGT CTCGCTCGAC AGCGTTGCGG GCACGACTTC GGGCGCGATG
GGCGATGCGG TTTCGGGGGC CTCGCTCGGC ACGGTCATGC TCGACGGCTA CGGCAGGGCC
TATGCTCTCG ATCTCGGACG GGGGCTCCGT TTCGGTCAGG CGCGTCAGCC CTTGCTGGAG
GCGCTGCGGG CCGGGGCGCG GCCAACCGGG TTCGAGGCCG GCGACGTGTC GCTCGCCTTC
TCCGTGGACC GGCGGTTCGG GGCCGTGCCG TTGCGCCTTG CGCCGGGCGA GCGGGAGCAG
GCTCGCGTGC TGGCATCGAC GCTGGTGACG CGGATCGGGA ACGCGCGCGA ACTCGCGCTC
GGATGGAACA CATCCTCCGA TGCGCTGGCG GCACGTTTGC AGAAACGGCG CGAGGCGCAG
TTCCTCGTCG CCGGTACGGC CAATGGCATG TTCGAGCGTC CCGAGCTTGG CTTCGCGGCA
CGGCAGAGGT TCGGCGGCAT GGGCGTCACC GTCAGCGGCG GTCGCAGCCT TGTCTGGCGG
CCCGAAGCAT TGCGCGACCG TCGCGAGGAC AGGCTGGCGC GCCTTGGCGT GGCGTTGGAT
GGCCACGGGG GCGATGCCCT CGACTGGCGG CTTGGCCTGG GCCTCATGCG CGAGGAGCGT
ACCGTGCTGG GCGCACGCTT CGCCGATGCG CTGGGCGGCG GCGGTGCCAC GACCGGCACG
ATCGAGCCCG GCGTTACCTG GCGGCCGATG CGCGGATGGC ACGTGGGCGC AACGGGCAGC
CTGGGCTTCA CCCGGGTCGC GGGAACCTCT GTGGCGCCCG GCGGGTCGAG GCTGGTGTCG
TCGTCGTGGG CGTTCGACGT TGCGCGCGAT GGCGCCCTGA TCGAGGGCGA CCGCCTTGGC
CTGCGCCTGT CGCAGCCGTT GCGGGTGGAA AGCGGCGGGT TGGCGCTGAA CCTTCCGGTC
GCTTGGGACT ATGCGACGCA GACTGCCCGT TTTGGCCGCG TCCCTTTATC CCTTGCGCCA
AAGGGGCGCG AGCTTGACGC GGAAGTGGCG TGGTCGGCGC GGCTGTGGGA CGGCGCCTTT
TCCGCCAGCC TGTTCTGGCG GCGCGATCCG GGGCACTTCG CCGCCGCGCC CGACGATGCC
GGCGGGGCGG TGCGCTGGAC GGCGGGGTTC TAG
 
Protein sequence
MIAWKFAGQS ALLVLALALT GCGGGGGVAS TPTPTPTPSP TPSPAPTPTP TPTPTPTPTP 
VSYNTAEYRN SDGPVFHGAI TAWEAGASGA GTTIAIVDSG IDTANSEFAG RISSASADVA
GSRGVRNTGD DHGTQVALVA AAARNDYGVM GIAWGATVQM LRADTPGTCA SADGCTFSDT
NIAEGIDAAV GAGARVVNLS LGGEAADTNL RAAVARATSA GLIVVVSAGN DGQSTDPAID
PNNPDPFAVS LLQAGGGNVI IVGSVSSAGL VSDFSNRAGS SASSFLMAQG ETICCVYEGS
QIRVVDGFRT AVNGTSFAAP QVSGAVALLA QAFPNLTGAQ IVSLLLSTAR EGGAIGTDPI
YGRGILDIAR AFSPQGSTGL AGTSVAVSLD SVAGTTSGAM GDAVSGASLG TVMLDGYGRA
YALDLGRGLR FGQARQPLLE ALRAGARPTG FEAGDVSLAF SVDRRFGAVP LRLAPGEREQ
ARVLASTLVT RIGNARELAL GWNTSSDALA ARLQKRREAQ FLVAGTANGM FERPELGFAA
RQRFGGMGVT VSGGRSLVWR PEALRDRRED RLARLGVALD GHGGDALDWR LGLGLMREER
TVLGARFADA LGGGGATTGT IEPGVTWRPM RGWHVGATGS LGFTRVAGTS VAPGGSRLVS
SSWAFDVARD GALIEGDRLG LRLSQPLRVE SGGLALNLPV AWDYATQTAR FGRVPLSLAP
KGRELDAEVA WSARLWDGAF SASLFWRRDP GHFAAAPDDA GGAVRWTAGF