Gene Saro_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1675 
Symbol 
ID3916250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1757069 
End bp1758697 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content65% 
IMG OID640444416 
Producttetratricopeptide TPR_4 
Protein accessionYP_496949 
Protein GI87199692 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.184674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTGA TGCTGGCAAT GGCCAACGTC ACCACTTCCG CGCCCCGCGC CGTCGCACCC 
GCCTATGACC CGCGCCTGGT AAGGGCCGCC GTCGCCATGA ACGAGAACGA CCTGCCCACG
GCCGAACCCT TGCTGCGCGC CCTGCTGAAG GACGATCCGT TCGATGTCAG GGCGATCCGG
CTCTTTGCCG AACTGGCCGG GCGGATCGGG CGCTATCAGG ATGCGGAAAA CCTCCTGCGC
CGGGCGATAG AACTGGCGCC GCAGTTCACC GCCGCGCGCG CCAACCTCGC GCTCGTGCTA
TATCGCACGA ACCGCGCGCC CGAGGCGCTT GAAGAGCTCG CCAAGGTGAC CGCCGATGAT
CCCGAGAACG TCGGACATGC CAATCTTCAG GCCGCCGCCT ATGGCCGCAT CGGCGAGTTC
GACGAGGCGC TTGCCCTCTA CGAGCAGGTC CTGAAGCAGG CGGCGGCCCA GCCGCGCGTG
TGGATGAGCT ACGGCCATAT GCTCAAGACC GTGGGCCGTC AGGCCGATGG CGTCGCCGCC
TATCGCCGCG CCATCGAACT CCTGCCGACG CTGGGAGAGG CGTGGTGGAG CCTTGCCAAC
CTCAAGACCG TGCGCTTCGA CGATGCCGAT ATCGCAGCGA TGGAAGCCGC GCTGCGCGTT
CCGGACCTTG CGCCGGAAGA CCAGTGGCAC CTCGATTTCG CGCTGGGCAA GGCGTTCGAG
GATCGGGGTG AGGCGGAACG ATCGTTTCGC CATTACGCCG CCGGCAATGC CCTGCGGAAG
AAGCGCATGC CCTATCAGGC GGAAGAGATC ACCGCGCAGG TCGACCGCGC TGTCGCCGCC
TTCACGCCCG CCACGGTCGC CGGGCTTTCC GGCAAGGGGT GCGAGGCGGG CGATCCGATC
TTCGTGCTTG GAATGCCGCG CGCGGGGTCG ACCCTGGTCG AACAGATCCT GGCCAGCCAC
TCGATGGTCG AAGGTACCAG CGAACTGGCC GACATCGGCT ACCTTGCGCG GACCGTCGAG
GGCTATCCAG CCGGTCTTTC GTCGTTGCAG GGCAATGACT TGCGAGCGCT AGGGGAGCAA
TACCTCGCGC GCACCCGCAT CCAGCGGCAT ACCGACCGGC CACTGTTCGT CGACAAGATG
CCGAACAACT GGATCCATGT CCCCTTCATC CGCGCGATCC TGCCCAACGC CAAGATCGTC
GACGCCCGGC GCCATCCGCT TTCCTGTTGC TTTTCAAACT TCAAGCAGCA CTTCGCGCGC
GGGCAGGGGT TCAGCTACTC GCTCGAAGAC ATGGGCCGCT ACTACCGCGA CTACGTGCGC
GCGATGGCTC ATTTCGACAA GGTCATACCC GGGGCTGTCC ATCGCGTGAT CTACGAGCGA
ATGGTCGAGG ATACCGAGGC GGAAGTGCGT GCGCTGCTGG CATATTGCGG GCTGGCCTTC
GAGGACAACT GCCTCGCCTT TCACCGGACC GAGCGGGCCG TCCGCACGGC CAGTTCCGAG
CAGGTCCGCC AGCCCATCTT CAGGGACGGC ACAGATGCGT GGAAGGCCTT TGAACCCTGG
CTAGGTGAAC TCAAGGTCGC GTTAGGTGCC GTTCAGGACT TCTACCCCGA AGCGCCTCCG
TTCGACTGA
 
Protein sequence
MFVMLAMANV TTSAPRAVAP AYDPRLVRAA VAMNENDLPT AEPLLRALLK DDPFDVRAIR 
LFAELAGRIG RYQDAENLLR RAIELAPQFT AARANLALVL YRTNRAPEAL EELAKVTADD
PENVGHANLQ AAAYGRIGEF DEALALYEQV LKQAAAQPRV WMSYGHMLKT VGRQADGVAA
YRRAIELLPT LGEAWWSLAN LKTVRFDDAD IAAMEAALRV PDLAPEDQWH LDFALGKAFE
DRGEAERSFR HYAAGNALRK KRMPYQAEEI TAQVDRAVAA FTPATVAGLS GKGCEAGDPI
FVLGMPRAGS TLVEQILASH SMVEGTSELA DIGYLARTVE GYPAGLSSLQ GNDLRALGEQ
YLARTRIQRH TDRPLFVDKM PNNWIHVPFI RAILPNAKIV DARRHPLSCC FSNFKQHFAR
GQGFSYSLED MGRYYRDYVR AMAHFDKVIP GAVHRVIYER MVEDTEAEVR ALLAYCGLAF
EDNCLAFHRT ERAVRTASSE QVRQPIFRDG TDAWKAFEPW LGELKVALGA VQDFYPEAPP
FD