Gene Anae109_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1785 
Symbol 
ID5376088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2015911 
End bp2017779 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content71% 
IMG OID640843293 
Productpeptidyl-dipeptidase A 
Protein accessionYP_001378972 
Protein GI153004647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.44307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACA TTCTCCTCGT CACAGCACTG CTGCTCTCCA CGGCCGCGGG GGGCGCCGCC 
GCCGAGCCCT CCGACCCCGA CGCCCGCGCG ACCGTCGTCG CTCCGGCGCC CTCCTCGAAG
CCGCCCACCG CCGCGGAGGC GAAGGCGTTC GTCGACGGCG TCAACGCCGA GCTGAAGCGG
CTCTGGATCC GCTCGTCCAC CGCCGACTGG ATCAAGGCCA CCTACATCAC CGACGACACC
GAGCGGAACG CCGCCGCCCT CAACGAGGAC GTCATGGCGT ACCTCTCGCG CGCCATCGCC
GAGTCGGTCC GCTTCGACGG CGTGAAGGCG GACGCCGACA CCGCGCGCAT GCTGCACCTG
CTCAAGGTCG CCTCGTCGCT GCCCGCGCCG AGCGATGCGG CCCGCCGGCG GGAGCTCGCC
GAGATCTCCG CGAAGCTCGA GGGGATCTAC GGCAAGGGCA AGTGGTGCGG GACGCCCGCG
CCGGGCAGGG CGGCGCCGCG CTGCCGCGAC CTGCAGCAGC TCGAGGAGGT CCTCGCGAAG
AGCCGCAGCT ACCCCGAGCT GCTCGACGCC TGGACCGGCT GGCACACCAT CTCGCGCGAG
ATGCGCCCGC TGTACGAGCG GCTCGTCACC CTCGGCAACG AGGGGGCCAG GGAGATCGGC
TTCAGCGATC TCGGCGACCT CTGGCGCGCC GACTACGACA TGGCGCCCGA GGCGTTCGAG
GCCGACGTCG GCCGGCTGTG GGCGGAGGTG AAGCCGCTCT ACGACGAGCT CCACTGCTAC
GTGCGCGGCC GGCTCCAGCA GGCCTACGGG AAGGCCAAGG TCCCCGACGG AAAGCCGATC
CCGGCGCACC TGCTCGGCAA CATGTGGGCG CAGGACTGGT CGAACCTCTA CCCGCTCGTC
GAGCCGTTCA AGGGCGTGGG GAGCCTCGAC GTGGACGCGG CCCTGAAGCG TCAGAAGTAC
GACGCGGCGC GGATGGTGAA GCTCGGCGAG GCGTTCTTCA CCTCGCTCGG CCTCGAGCCG
CTCCCGCCCA GCTTCTGGGA GCGCTCCCAG CTCGTGAAGC CGCGCGACCG GGAGGTGGTG
TGCCACGCGA GCGCGTGGGA CGTCACCTTC GCCGCCGACC TGCGCATCAA GATGTGCATC
CGGCCCATCG AGGAGGACCT CGTCACCATC CACCACGAGC TGGGCCACAA CTATTACCAG
CGCGCCTACG TCCACCTGCC GCTGCTCTTC CAGGACAGCG CCAACGACGG CTTCCATGAG
GCGCTCGGCG ACGCGATCGC GCTCTCCGTG ACGCCGGGAT ATCTGAAGCA GGTCGGGCTC
GTCCCGGGCG TCCCGAAGGA CGACCGCGGC ACCATCAACT TCCAGATGAA GAAGGCGCTC
GAGAAGATCG CCTTCCTCCC GTTCGGGCTC CTCATCGACC AGTGGCGCTG GGATGTGTTC
AGCGGGAAGG TGCCGCCGGA CCGCTACAAC GCCGCGTGGT GGGAGCTCCG GCGGAAGTAC
CAGGGCGTCG ACGCCCCGGT CGCGCGGAGC GAGGCCGACT TCGACCCGGG CGCCAAGTAC
CACATCCCCT CGAACGTCCC GTACACCCGC TACTTCCTGG CGCACGTGTA CCAGTTCCAG
TTCCACCAGG CGCTGTGCGA GGCGGCCGGC TGGAAGGGGC CGCTCCACCA GTGCTCGATC
TACGGCTCCA AGGACGCCGG CAAGCGGCTC GTGGCGATGA TGGAGCTCGG CGCGTCGCGG
CCGTGGCCGG AGGCGTACGC GGCCCTCGCC GGCGCGAAGC AGGCCGACGC GTCGGCGCTG
CTCGCGTACT TCGCCCCTCT CCGCAAGTGG CTCGCGGAGC AGAACGCGGG CCGCACGTGC
GGGTGGTGA
 
Protein sequence
MRHILLVTAL LLSTAAGGAA AEPSDPDARA TVVAPAPSSK PPTAAEAKAF VDGVNAELKR 
LWIRSSTADW IKATYITDDT ERNAAALNED VMAYLSRAIA ESVRFDGVKA DADTARMLHL
LKVASSLPAP SDAARRRELA EISAKLEGIY GKGKWCGTPA PGRAAPRCRD LQQLEEVLAK
SRSYPELLDA WTGWHTISRE MRPLYERLVT LGNEGAREIG FSDLGDLWRA DYDMAPEAFE
ADVGRLWAEV KPLYDELHCY VRGRLQQAYG KAKVPDGKPI PAHLLGNMWA QDWSNLYPLV
EPFKGVGSLD VDAALKRQKY DAARMVKLGE AFFTSLGLEP LPPSFWERSQ LVKPRDREVV
CHASAWDVTF AADLRIKMCI RPIEEDLVTI HHELGHNYYQ RAYVHLPLLF QDSANDGFHE
ALGDAIALSV TPGYLKQVGL VPGVPKDDRG TINFQMKKAL EKIAFLPFGL LIDQWRWDVF
SGKVPPDRYN AAWWELRRKY QGVDAPVARS EADFDPGAKY HIPSNVPYTR YFLAHVYQFQ
FHQALCEAAG WKGPLHQCSI YGSKDAGKRL VAMMELGASR PWPEAYAALA GAKQADASAL
LAYFAPLRKW LAEQNAGRTC GW