Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_3441 |
Symbol | |
ID | 5376575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | + |
Start bp | 4041773 |
End bp | 4043554 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640844966 |
Product | peptidase U34 dipeptidase |
Protein accession | YP_001380609 |
Protein GI | 153006284 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4690] Dipeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.534665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.658242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTC GCTCGCTCGC CGTCCCGCTC GCCGCCCTCT TCGCGGCCAG CGCGCTCGCC GCCCGCCCCG CGCGCGCCTG CACGAACATC CTGGTCTCCC GCGGCGCCAC CGCGGACGGC TCCACGCTCG TCACCTACGC CGCCGACTCG CACGACCTGT ACGGCGAGCT GTACTACACC GCCGCCGCCC GCCACCCGGC CGGCGCCGCG CGCGAGGTCG TCGAGTGGGA CACGGGCGAC CGCCTCGGCC AGATCCGGCA GGCGGAGGTC ACCTACAACG TCGTCGGCAA CATGAACGAG CACCAGGTCG CGATCGGCGA GACCACCTTC GGCGGCCGCG ACGAGCTCGT CGACCCGAAG GGCGGCATCG ACTACGGCAG CCTCATGTAC GTCGCGCTCG AGCGGGCGCG CACGGCGCGC GAGGCGATCG AGGTCATGAC CTCGCTCGTC GCCGAGTACG GCTACCGCAG CTCGGGCGAG TCCTTCTCGA TCTCGGATCC GAAGGAGGTC TGGATCCTCG AGATGATCGG CAAGGGGCCG GGGCGGAAGG GCGCGGTGTG GGTCGCCCGG CGCGTGCCCG ACGGCCACGT CTCGGCCCAC GCGAACCACG CGCGCATCCG CCAGTTCCCG CGCAACGATC CGAAGAACAC CCTCTACGCG AAGGACGTCG TCTCGTTCGC CCGCGAGAAG GGCTGGTTCA CGGGCAAGGA CGAGGACTTC AGCTTCTCCG ACACCTACGC GCCCCTCACC GCCGGCGCCC TGCGCGCGTG CGAGGCGCGG GTGTGGAGCG TGTTCCGCCG GGTCGCCCCG TCGCTGAACC TCTCCGTCGA GCACGTGACG GGCGGCCCGA GCGCGCCGCG CCTGCCGCTG TGGGTGAAGC CCGACGCCAA GGTGTCGGTC CGTGGCGCGA TGGAGCTCAT GCGCGATCAC TTCGAGGGGA CCCCGCTCGA TCTCTCGCAG GGCGTGGGCG CCGGCCCGTT CGCGCTGCCC TACCGGTGGC GGCCGATGAC CTGGAAGGTG GACGGGGCCG AGTACCTCCA CGAGCGCGCC ATCTCCACGC AGCAGACCGG CTTCTCGTTC GTGGCGCAGG CGCGCGAGTG GCTCCCGGGG CCGATCGGCG GCGTCCTCTG GTTCGGGCTC GACGACACCT ACTCCACCGT CTACGTCCCC CAGTACTGCG GCAACCGCGC GGTCCCGCGG ACGTTCGGCG TGGGCAGCGG CAACTTCCAG GAGTTCAGCT GGGACTCGGC GTTCTGGGTC TTCAACTTCG TCTCGAACTG GGCGTACGGG CGCTACAGCG ACATGATCCA GGACGTCCAG AAGGTGCAGC GGGAGCTCGA GAGCGGGTTC CTCTCGCGGC AGGAGGACGT GGAGAAGGCC GCCCTCGCCC TCCACAAGAC GTCCCCGGGG CTCGCGCGCG ACTACCTCAC CCAGTACTCG GTCGAGCAGG GCGACCGCAC CACCGCGCGC TGGCGCAGGC TGGGAGAGAC GCTCATCGTG AAGTACCTGG ACGGGAACGT GCGCGACGAG CACGGCAAGG TGAAGCATCC CGATTACCCG GAGGGCTGGC GGCGGCGGAT CGCGCAGGAC CGGGGGGAGC AGATCAAGGT GGTGAGGTTC CCGGGGGAGA AGGAGGAAGA GGAGCCGACC TCGACCCCGA CCTCGACGGC GCCGGATCGG ACGGCCGCTC ATCCCGAGCG CAGCGACCCC GCGGCCGCGG CGGCGCGGAG TCGAGGGACG ACCCCGACCC CGAGCGGATC TCCTGAGGCC GCTCGCCCGT GA
|
Protein sequence | MIRRSLAVPL AALFAASALA ARPARACTNI LVSRGATADG STLVTYAADS HDLYGELYYT AAARHPAGAA REVVEWDTGD RLGQIRQAEV TYNVVGNMNE HQVAIGETTF GGRDELVDPK GGIDYGSLMY VALERARTAR EAIEVMTSLV AEYGYRSSGE SFSISDPKEV WILEMIGKGP GRKGAVWVAR RVPDGHVSAH ANHARIRQFP RNDPKNTLYA KDVVSFAREK GWFTGKDEDF SFSDTYAPLT AGALRACEAR VWSVFRRVAP SLNLSVEHVT GGPSAPRLPL WVKPDAKVSV RGAMELMRDH FEGTPLDLSQ GVGAGPFALP YRWRPMTWKV DGAEYLHERA ISTQQTGFSF VAQAREWLPG PIGGVLWFGL DDTYSTVYVP QYCGNRAVPR TFGVGSGNFQ EFSWDSAFWV FNFVSNWAYG RYSDMIQDVQ KVQRELESGF LSRQEDVEKA ALALHKTSPG LARDYLTQYS VEQGDRTTAR WRRLGETLIV KYLDGNVRDE HGKVKHPDYP EGWRRRIAQD RGEQIKVVRF PGEKEEEEPT STPTSTAPDR TAAHPERSDP AAAAARSRGT TPTPSGSPEA ARP
|
| |