Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_4180 |
Symbol | |
ID | 5378379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | - |
Start bp | 4900071 |
End bp | 4902005 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640845707 |
Product | thimet oligopeptidase |
Protein accession | YP_001381342 |
Protein GI | 153007017 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCCCC ACGCCTCCCG CGAGCTGTGC GGGACCCCCG AGGAGTTCGA GCACGGCTGC CGCCGCGACA TGGAGCGCGC CCGCGCGGAG GCCGGGCGGC TGAAGGCCAT GCCGGCGCCG CGCCCCGCGC CGGCCGCGCT CGCCGCCTTC GACGCCGCGT TCGGCGCGCT CTCCGACGCC GCCGCGCGCG CCAGCCTCGC CCGGAACGTC CACCCCGATC CGCGCATGCG CGACGCCGCG GAGCGCTCGG AGCAGGAGAT CGACGCGCTC TCGACCGAGC TGTCGCTCGA CCGCGGCCTC TACGACGCGC TCGCCGCGCT CGACGTGTCC GGCGAGGACG CCGCGACCCG CTACTACCTC CAGAAGAGCC TGCGAGACTT CCGCCGCGCC GGCGTCGATC GCGACGAGGA GACCCGCGCG CGGGTGCGAG CCCTGCGCGA GGAGCTCGTC CGCATCGGCC AGGAGTTCGG CCGGAACATC AAGGACGACG TGCGGCGTCT CGAGCTCGAG CCGGCCGACC TGGAGGGCCT GCCGGAGGAC TGGCGGCGGG CGCACCCCCC CGGCCCGGAG GGCAAGGTCG TCGTCACGAC CGACAACACG GACTACGTGC CGTTCATGAC CTACGCCCGC AGCGAGCCCG CGCGGGAGGC GCTCTGGCGG CTGTACCGGC TGCGCGCCCA CCCGAAGAAC CTCGACGTGC TCGCGCGCCT GCTCGCGCGC CGCGCCGAGC TGGCCGGGCT CCTCGGCTAC GAGACCTGGG CCGCGTACGT CACCGAGGAC AAGATGATCG GCAGCCAGGC GGCGGCGGCC GAGTTCATCG AGCGGATCGC GCGCGCCGCC GAGGCGCGCA TGCGGCGCGA CTTCGCGCAG CTCCTCGAGC GCAAGCGCGT CGACGTCCCC GGCGCGGAGC GGGTCGAGCC GTGGGACAGC GCGTACCTCC AGGAGCGCGT GAAGGCCGAG CAGTACGGCT TCGACTCCCA GTCGGTGCGC CCCTACCTCG ACTACGAGCG CGTGAAGGAG GGCGTCCTGG ACGTCACCGG CCGCCTCTTC GGCATCGCCT ACCGTCGCGT CGCCGCGCCG GTGTGGGACG TGGAGGTGGA GGCGTACGAC GTGGTCGAGG GCGAGCGGCC TCTCGGGCGC GTCTACCTCG ACATGCACCC CCGCGACGGC AAGTACAAGC ACTACGCCCA GTTCACCCTC GCCTCGGGGC AGGAGGGGCG CCAGCTCCCC GAGGGCGTCC TCGTCTGCAA CTTCCCGCGC CCGCAGGGCG GCGCGCCCGC GCTCATGGAG CACGGGGACG TGAAGACGTT CTTCCACGAG TTCGGCCACC TGCTCCACCA CGTGCTCGGC GGCCACACGC GCTGGGCCGG CCAGTCCGGC GTCGCCACCG AGTGGGACTT CGTCGAGGCC CCCTCGCAGA TGCTGGAGGA GTGGGTGTGG GATCCCGGAG TGCTCGCGGG GTTCGCGCGC CACGTCGAGA CCGGGGAGTC GCTCCCCGCC GACGCCGTGC GCCGCATGAA GGCGGCGGAC GAGTACGGCA AGGGGCTCAT GGTGCGGCAG CAGATGTTCT ACGCCGCGAC GAGCCTCGAG CTGCACCGCC GCGATCCGAG GGGACTCGAC ACCACCGCCG TCGTGGCCGA GCTGCAGGAG CGCTACACCC CCTTCCGCCA CGTGGACGGC ACCTACTTCC ACGAGTCCTT CGGCCACCTC GACGGCTACT CCGCCATCTA CTACACGTAC ATGTGGTCGC TCGTGATCGC GAAGGACCTG TTCGGCCCGT TCCGCGAGAA GGGCCTCATG GACCCCGAGC CCGCCCGCCG CTACCGGCGG GCCATCCTCG AGGCCGGCGG CTCGAAGCCC GCCGCCGAGC TGGTGAAGGA CTTCCTCGGC AGGCCGCACG CGTTCGACGC GTACGAGCAG TGGCTGAACG CGTAG
|
Protein sequence | MHPHASRELC GTPEEFEHGC RRDMERARAE AGRLKAMPAP RPAPAALAAF DAAFGALSDA AARASLARNV HPDPRMRDAA ERSEQEIDAL STELSLDRGL YDALAALDVS GEDAATRYYL QKSLRDFRRA GVDRDEETRA RVRALREELV RIGQEFGRNI KDDVRRLELE PADLEGLPED WRRAHPPGPE GKVVVTTDNT DYVPFMTYAR SEPAREALWR LYRLRAHPKN LDVLARLLAR RAELAGLLGY ETWAAYVTED KMIGSQAAAA EFIERIARAA EARMRRDFAQ LLERKRVDVP GAERVEPWDS AYLQERVKAE QYGFDSQSVR PYLDYERVKE GVLDVTGRLF GIAYRRVAAP VWDVEVEAYD VVEGERPLGR VYLDMHPRDG KYKHYAQFTL ASGQEGRQLP EGVLVCNFPR PQGGAPALME HGDVKTFFHE FGHLLHHVLG GHTRWAGQSG VATEWDFVEA PSQMLEEWVW DPGVLAGFAR HVETGESLPA DAVRRMKAAD EYGKGLMVRQ QMFYAATSLE LHRRDPRGLD TTAVVAELQE RYTPFRHVDG TYFHESFGHL DGYSAIYYTY MWSLVIAKDL FGPFREKGLM DPEPARRYRR AILEAGGSKP AAELVKDFLG RPHAFDAYEQ WLNA
|
| |