Gene Anae109_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4180 
Symbol 
ID5378379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4900071 
End bp4902005 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content73% 
IMG OID640845707 
Productthimet oligopeptidase 
Protein accessionYP_001381342 
Protein GI153007017 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCCC ACGCCTCCCG CGAGCTGTGC GGGACCCCCG AGGAGTTCGA GCACGGCTGC 
CGCCGCGACA TGGAGCGCGC CCGCGCGGAG GCCGGGCGGC TGAAGGCCAT GCCGGCGCCG
CGCCCCGCGC CGGCCGCGCT CGCCGCCTTC GACGCCGCGT TCGGCGCGCT CTCCGACGCC
GCCGCGCGCG CCAGCCTCGC CCGGAACGTC CACCCCGATC CGCGCATGCG CGACGCCGCG
GAGCGCTCGG AGCAGGAGAT CGACGCGCTC TCGACCGAGC TGTCGCTCGA CCGCGGCCTC
TACGACGCGC TCGCCGCGCT CGACGTGTCC GGCGAGGACG CCGCGACCCG CTACTACCTC
CAGAAGAGCC TGCGAGACTT CCGCCGCGCC GGCGTCGATC GCGACGAGGA GACCCGCGCG
CGGGTGCGAG CCCTGCGCGA GGAGCTCGTC CGCATCGGCC AGGAGTTCGG CCGGAACATC
AAGGACGACG TGCGGCGTCT CGAGCTCGAG CCGGCCGACC TGGAGGGCCT GCCGGAGGAC
TGGCGGCGGG CGCACCCCCC CGGCCCGGAG GGCAAGGTCG TCGTCACGAC CGACAACACG
GACTACGTGC CGTTCATGAC CTACGCCCGC AGCGAGCCCG CGCGGGAGGC GCTCTGGCGG
CTGTACCGGC TGCGCGCCCA CCCGAAGAAC CTCGACGTGC TCGCGCGCCT GCTCGCGCGC
CGCGCCGAGC TGGCCGGGCT CCTCGGCTAC GAGACCTGGG CCGCGTACGT CACCGAGGAC
AAGATGATCG GCAGCCAGGC GGCGGCGGCC GAGTTCATCG AGCGGATCGC GCGCGCCGCC
GAGGCGCGCA TGCGGCGCGA CTTCGCGCAG CTCCTCGAGC GCAAGCGCGT CGACGTCCCC
GGCGCGGAGC GGGTCGAGCC GTGGGACAGC GCGTACCTCC AGGAGCGCGT GAAGGCCGAG
CAGTACGGCT TCGACTCCCA GTCGGTGCGC CCCTACCTCG ACTACGAGCG CGTGAAGGAG
GGCGTCCTGG ACGTCACCGG CCGCCTCTTC GGCATCGCCT ACCGTCGCGT CGCCGCGCCG
GTGTGGGACG TGGAGGTGGA GGCGTACGAC GTGGTCGAGG GCGAGCGGCC TCTCGGGCGC
GTCTACCTCG ACATGCACCC CCGCGACGGC AAGTACAAGC ACTACGCCCA GTTCACCCTC
GCCTCGGGGC AGGAGGGGCG CCAGCTCCCC GAGGGCGTCC TCGTCTGCAA CTTCCCGCGC
CCGCAGGGCG GCGCGCCCGC GCTCATGGAG CACGGGGACG TGAAGACGTT CTTCCACGAG
TTCGGCCACC TGCTCCACCA CGTGCTCGGC GGCCACACGC GCTGGGCCGG CCAGTCCGGC
GTCGCCACCG AGTGGGACTT CGTCGAGGCC CCCTCGCAGA TGCTGGAGGA GTGGGTGTGG
GATCCCGGAG TGCTCGCGGG GTTCGCGCGC CACGTCGAGA CCGGGGAGTC GCTCCCCGCC
GACGCCGTGC GCCGCATGAA GGCGGCGGAC GAGTACGGCA AGGGGCTCAT GGTGCGGCAG
CAGATGTTCT ACGCCGCGAC GAGCCTCGAG CTGCACCGCC GCGATCCGAG GGGACTCGAC
ACCACCGCCG TCGTGGCCGA GCTGCAGGAG CGCTACACCC CCTTCCGCCA CGTGGACGGC
ACCTACTTCC ACGAGTCCTT CGGCCACCTC GACGGCTACT CCGCCATCTA CTACACGTAC
ATGTGGTCGC TCGTGATCGC GAAGGACCTG TTCGGCCCGT TCCGCGAGAA GGGCCTCATG
GACCCCGAGC CCGCCCGCCG CTACCGGCGG GCCATCCTCG AGGCCGGCGG CTCGAAGCCC
GCCGCCGAGC TGGTGAAGGA CTTCCTCGGC AGGCCGCACG CGTTCGACGC GTACGAGCAG
TGGCTGAACG CGTAG
 
Protein sequence
MHPHASRELC GTPEEFEHGC RRDMERARAE AGRLKAMPAP RPAPAALAAF DAAFGALSDA 
AARASLARNV HPDPRMRDAA ERSEQEIDAL STELSLDRGL YDALAALDVS GEDAATRYYL
QKSLRDFRRA GVDRDEETRA RVRALREELV RIGQEFGRNI KDDVRRLELE PADLEGLPED
WRRAHPPGPE GKVVVTTDNT DYVPFMTYAR SEPAREALWR LYRLRAHPKN LDVLARLLAR
RAELAGLLGY ETWAAYVTED KMIGSQAAAA EFIERIARAA EARMRRDFAQ LLERKRVDVP
GAERVEPWDS AYLQERVKAE QYGFDSQSVR PYLDYERVKE GVLDVTGRLF GIAYRRVAAP
VWDVEVEAYD VVEGERPLGR VYLDMHPRDG KYKHYAQFTL ASGQEGRQLP EGVLVCNFPR
PQGGAPALME HGDVKTFFHE FGHLLHHVLG GHTRWAGQSG VATEWDFVEA PSQMLEEWVW
DPGVLAGFAR HVETGESLPA DAVRRMKAAD EYGKGLMVRQ QMFYAATSLE LHRRDPRGLD
TTAVVAELQE RYTPFRHVDG TYFHESFGHL DGYSAIYYTY MWSLVIAKDL FGPFREKGLM
DPEPARRYRR AILEAGGSKP AAELVKDFLG RPHAFDAYEQ WLNA