Gene Saro_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0005 
Symbol 
ID3916047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3946 
End bp5124 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content67% 
IMG OID640442730 
Productpeptidase M48, Ste24p 
Protein accessionYP_495288 
Protein GI87198031 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTCG ATCCTGCAAC CGAAGCGGCG CGGCTGGTAG ACAGTCTTGG CGCCGAGCAA 
CTGGCGCTGG CGGCGTCCTA CACGCGTGGC AACGAATGGC TGACCGTGGC CGGCGTAGCC
ATATCGCTGG TGCTAGCGTG GATCATCGTG CGCCTGCGTG TGCTGGACTG GCTGGCGGCC
AGGGTCGCCC GCTGGCCGCG CGTAGCGGCG ACATTCACGG TATCCTTCGG CTTCTTCCTG
ATCGCCGACG TGCTGCGCCT GCCCGTCACC GTCTGGACCG ACTGGTGGCG CGAAAAGTCC
TACGACATGA CCGACCAGCC GCTGGGCGAT TTCCTGTTCC AGTACGCGCT CGCCGGAGTG
ATCGAGTTCG CGATAAGCGC GATCTTCGTG GTGGGCGTGT TCTGGCTGGT GCGGCGATCG
CCGCGCCGCT GGTGGCTGTG GACGGGCGGA CTGGCCGGAG GCGGAGCGGC GGCGTTGCTA
CTGCTTGGCC CGGCGTTGAT CCAGCCGATG TTCAACACCT TCCAGCCAGT GCCTCCGGGG
CAGGTGCGCA CCGCGCTCGA GGCGATTGCG GATGACGTTG GCATCCCGCA CGACCGCATC
TTCATGTACG ACGGCTCCCG CCAGTCAGCC AACTTCACCG CGAACGTGTC GGGCATCGGC
CCTGCGGCCC GGATCGCTAT TGCCGACGTC GCGCTCAAGT CCGCCTCGCT CGACGAAGTG
CGCGCCGTGA CCGCGCACGA GGCGGGACAC TACAAGCTGG GCCACGTCTG GCGGCACCTC
GTTGTCATGC CGCTGATCGC GGTGCTGGTG GCATTCCTCA TCGGCCGACT CTACCCGTGG
ACGGCGCAAA GGCTGGGCGC CACGGCGCCG CTGGGCGATC CTGTGGGGCT GCCGGTGTTC
ATGGCGCTCG TATCTGTCCT GACGCTGTTC ACCCTGCCCG CGGTCAACAG CCTCACCCGC
ATGGGCGAGG CAGAGGCCGA CGCCTTTGCC ATGCAGACAG TGGGCCTGCC CGACGCCATG
GCGGGAGCCT TGCTCAAGAC CGCCGAATAC CGCTATCCCC GGCCACATCC GCTGGAGGAA
GCGATCTTCT ATACCCACCC TTCGGTCGAA CGTCGCATCG AGGCGGCGAT GGCCTGGAAG
GCAGGGCGTG GCGGTGCGAA CATGACCGGG GTCCAATGA
 
Protein sequence
MALDPATEAA RLVDSLGAEQ LALAASYTRG NEWLTVAGVA ISLVLAWIIV RLRVLDWLAA 
RVARWPRVAA TFTVSFGFFL IADVLRLPVT VWTDWWREKS YDMTDQPLGD FLFQYALAGV
IEFAISAIFV VGVFWLVRRS PRRWWLWTGG LAGGGAAALL LLGPALIQPM FNTFQPVPPG
QVRTALEAIA DDVGIPHDRI FMYDGSRQSA NFTANVSGIG PAARIAIADV ALKSASLDEV
RAVTAHEAGH YKLGHVWRHL VVMPLIAVLV AFLIGRLYPW TAQRLGATAP LGDPVGLPVF
MALVSVLTLF TLPAVNSLTR MGEAEADAFA MQTVGLPDAM AGALLKTAEY RYPRPHPLEE
AIFYTHPSVE RRIEAAMAWK AGRGGANMTG VQ