Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1675 |
Symbol | |
ID | 3916250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1757069 |
End bp | 1758697 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444416 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_496949 |
Protein GI | 87199692 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.184674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGTGA TGCTGGCAAT GGCCAACGTC ACCACTTCCG CGCCCCGCGC CGTCGCACCC GCCTATGACC CGCGCCTGGT AAGGGCCGCC GTCGCCATGA ACGAGAACGA CCTGCCCACG GCCGAACCCT TGCTGCGCGC CCTGCTGAAG GACGATCCGT TCGATGTCAG GGCGATCCGG CTCTTTGCCG AACTGGCCGG GCGGATCGGG CGCTATCAGG ATGCGGAAAA CCTCCTGCGC CGGGCGATAG AACTGGCGCC GCAGTTCACC GCCGCGCGCG CCAACCTCGC GCTCGTGCTA TATCGCACGA ACCGCGCGCC CGAGGCGCTT GAAGAGCTCG CCAAGGTGAC CGCCGATGAT CCCGAGAACG TCGGACATGC CAATCTTCAG GCCGCCGCCT ATGGCCGCAT CGGCGAGTTC GACGAGGCGC TTGCCCTCTA CGAGCAGGTC CTGAAGCAGG CGGCGGCCCA GCCGCGCGTG TGGATGAGCT ACGGCCATAT GCTCAAGACC GTGGGCCGTC AGGCCGATGG CGTCGCCGCC TATCGCCGCG CCATCGAACT CCTGCCGACG CTGGGAGAGG CGTGGTGGAG CCTTGCCAAC CTCAAGACCG TGCGCTTCGA CGATGCCGAT ATCGCAGCGA TGGAAGCCGC GCTGCGCGTT CCGGACCTTG CGCCGGAAGA CCAGTGGCAC CTCGATTTCG CGCTGGGCAA GGCGTTCGAG GATCGGGGTG AGGCGGAACG ATCGTTTCGC CATTACGCCG CCGGCAATGC CCTGCGGAAG AAGCGCATGC CCTATCAGGC GGAAGAGATC ACCGCGCAGG TCGACCGCGC TGTCGCCGCC TTCACGCCCG CCACGGTCGC CGGGCTTTCC GGCAAGGGGT GCGAGGCGGG CGATCCGATC TTCGTGCTTG GAATGCCGCG CGCGGGGTCG ACCCTGGTCG AACAGATCCT GGCCAGCCAC TCGATGGTCG AAGGTACCAG CGAACTGGCC GACATCGGCT ACCTTGCGCG GACCGTCGAG GGCTATCCAG CCGGTCTTTC GTCGTTGCAG GGCAATGACT TGCGAGCGCT AGGGGAGCAA TACCTCGCGC GCACCCGCAT CCAGCGGCAT ACCGACCGGC CACTGTTCGT CGACAAGATG CCGAACAACT GGATCCATGT CCCCTTCATC CGCGCGATCC TGCCCAACGC CAAGATCGTC GACGCCCGGC GCCATCCGCT TTCCTGTTGC TTTTCAAACT TCAAGCAGCA CTTCGCGCGC GGGCAGGGGT TCAGCTACTC GCTCGAAGAC ATGGGCCGCT ACTACCGCGA CTACGTGCGC GCGATGGCTC ATTTCGACAA GGTCATACCC GGGGCTGTCC ATCGCGTGAT CTACGAGCGA ATGGTCGAGG ATACCGAGGC GGAAGTGCGT GCGCTGCTGG CATATTGCGG GCTGGCCTTC GAGGACAACT GCCTCGCCTT TCACCGGACC GAGCGGGCCG TCCGCACGGC CAGTTCCGAG CAGGTCCGCC AGCCCATCTT CAGGGACGGC ACAGATGCGT GGAAGGCCTT TGAACCCTGG CTAGGTGAAC TCAAGGTCGC GTTAGGTGCC GTTCAGGACT TCTACCCCGA AGCGCCTCCG TTCGACTGA
|
Protein sequence | MFVMLAMANV TTSAPRAVAP AYDPRLVRAA VAMNENDLPT AEPLLRALLK DDPFDVRAIR LFAELAGRIG RYQDAENLLR RAIELAPQFT AARANLALVL YRTNRAPEAL EELAKVTADD PENVGHANLQ AAAYGRIGEF DEALALYEQV LKQAAAQPRV WMSYGHMLKT VGRQADGVAA YRRAIELLPT LGEAWWSLAN LKTVRFDDAD IAAMEAALRV PDLAPEDQWH LDFALGKAFE DRGEAERSFR HYAAGNALRK KRMPYQAEEI TAQVDRAVAA FTPATVAGLS GKGCEAGDPI FVLGMPRAGS TLVEQILASH SMVEGTSELA DIGYLARTVE GYPAGLSSLQ GNDLRALGEQ YLARTRIQRH TDRPLFVDKM PNNWIHVPFI RAILPNAKIV DARRHPLSCC FSNFKQHFAR GQGFSYSLED MGRYYRDYVR AMAHFDKVIP GAVHRVIYER MVEDTEAEVR ALLAYCGLAF EDNCLAFHRT ERAVRTASSE QVRQPIFRDG TDAWKAFEPW LGELKVALGA VQDFYPEAPP FD
|
| |