Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AnaeK_3888 |
Symbol | |
ID | 6785355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. K |
Kingdom | Bacteria |
Replicon accession | NC_011145 |
Strand | + |
Start bp | 4390730 |
End bp | 4392379 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642765358 |
Product | peptidase U34 dipeptidase |
Protein accession | YP_002136226 |
Protein GI | 197124275 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4690] Dipeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.309446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACC ACCGCAAGAC CGCCCCCGCC GCCGCGCTCG CCGCGGCAGC CCTCGTCCTC TCCCTGCCCG GCGCGGCCGA CGCCTGCACC AGCATCCTGG TCTCGAAGGG CGCGAGCGCG GACGGCTCCA CCTTCATCAC CTACGCGGCC GACTCGCACG ACCTCTACGG CGACCTCCCG CTCCGCCCGG CGGCGCAGCA CGCGCCCGGC GCACAGCGCG AGATCATCGA GTGGGACACC GGCAAGTTCC TGGGCCGCAT CCCGCAGCCG GCCGTCACCT ACCACGTGGT CGGCAACATC AACGAGCACC AGGTCGCCAT CGGCGAGACC ACCTTCACCG GCCGCAAGGA GCTGCAGGAT CCCGAGGGCC GGGTGGACTA CGGCTCGCTC ATGTACATCG CGCTGGAGCG CGCCCGCACC GCGCGCGAGG CGATCCAGGT GATGACCGAC CTCGTGGCCG AGTACGGCTA CGCCTCCACC GGCGAGTCCT TCTCCATCTC GGATCCGAAC GAGGCCTGGA TCCTCGAGAT GATCGGCAAG GGGCCGAAGC GGAAGGGCGC GGTCTGGGTG GCCCGCCGCA TCCCGGACGG CTACGTGTCG GCGCACGCGA ACCACGCCCG CATCCGCCAG TTCCCGCTCG ACGAGCCGAA GACCACGCTC TACGCGAAGG ACGTCATCTC GTTCGCCCGC GAGAAGGGCT GGTTCAAGGG CAAGGACGCC GAGTTCAGCT TCGCCGACAC CTACGCGCCG CTCGACTTCG GCGCGCTGCG CGCCTGCGAC GCGCGGGTGT GGAGCGTGTT CCGCCGCGTG GCGCCGGGGC AGTCGCTGCC GTCCTCCATG GTGAAGGGAC AGGACCCGAA GGCCGAGCGC GTGCCGCTGT GGGTGAAGGC CGAGAAGCCG CTCGCGGTGC GCGACGTGAT GGCGCTCATG CGCGACCACT TCGAGGGCAC CGAGCTCGAC CTGTCGAAGG GCGTGGGCGC GGGCCCGTTC TCGGTGCCGT ACCGCTGGCG GCCCATGACG TTCAAGGTGG ACGACCAGGA GTACCTGAAC GAGCGGGCCA TCTCGACGCA GCAGACCGGC TTCTCGTTCG TGGCGCAGTC GCGCGCCGCG CTGCCGGCGG CGGTGGGCGG GGTGCTCTGG TTCGGCGTGG ACGACACGTA CAGCACCGTC TACGTGCCGA TGTACTGCTC GATCCACGAG GTGCCGCGCA GCTTCGCGGT GGGCACCGCC GACTTCAAGA CGTTCAGCTG GGACTCGGCG TTCTGGGTGT TCAACTTCGT GTCGAACTGG GCCTACTCGC GCTACTCGGA CATGATCCAG GACGTGCAGC AGGTGCAGGG CGAGCTGGAG GGCGGGTTCC TCTCGCGGCA GGCGGAGCTG GAGAAGGCCG CCCTGACGCT CTACAAGGAC TCGCCCGGCC TGGCCCGCGA CTACCTCACC CGCTACTCGG TCAGCCAGGG CGACATGGTC ACGGCGCGCT GGCGCAAGCT GGGCGAGTCG CTGATGGTGA AGTACCTCGA CGGCAACGTG CGCGACGCGC AGGGCAACGT CACGCACCCG GACTACCCGG AGGCCTGGCG CCGGCGCATC GCGGCCGAGG ACGACGGCAT CCTGCGGGTG CCGAAGGAGC CGCAGAAGGT GGCGCAGTAG
|
Protein sequence | MLDHRKTAPA AALAAAALVL SLPGAADACT SILVSKGASA DGSTFITYAA DSHDLYGDLP LRPAAQHAPG AQREIIEWDT GKFLGRIPQP AVTYHVVGNI NEHQVAIGET TFTGRKELQD PEGRVDYGSL MYIALERART AREAIQVMTD LVAEYGYAST GESFSISDPN EAWILEMIGK GPKRKGAVWV ARRIPDGYVS AHANHARIRQ FPLDEPKTTL YAKDVISFAR EKGWFKGKDA EFSFADTYAP LDFGALRACD ARVWSVFRRV APGQSLPSSM VKGQDPKAER VPLWVKAEKP LAVRDVMALM RDHFEGTELD LSKGVGAGPF SVPYRWRPMT FKVDDQEYLN ERAISTQQTG FSFVAQSRAA LPAAVGGVLW FGVDDTYSTV YVPMYCSIHE VPRSFAVGTA DFKTFSWDSA FWVFNFVSNW AYSRYSDMIQ DVQQVQGELE GGFLSRQAEL EKAALTLYKD SPGLARDYLT RYSVSQGDMV TARWRKLGES LMVKYLDGNV RDAQGNVTHP DYPEAWRRRI AAEDDGILRV PKEPQKVAQ
|
| |