Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_3047 |
Symbol | |
ID | 4483110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 3814161 |
End bp | 3815129 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639723794 |
Product | dipeptidyl aminopeptidases/acylaminoacyl-peptidases-like protein |
Protein accession | YP_866944 |
Protein GI | 117926327 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0033788 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0365868 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACCCCC CCCATAATAG CCAAACCGTA CGCATGCTCA TGACAGCCGT TATGGATAAC AAGGGGCGCT TCAGCGTGCG TTACAATCCC CTGACGCTTC TAACTTCCAT TGCTCACCGC TGGCTTCTGC ACGCCTTTCG CATTCGTCAG CGCGGTTATG CGGCTCCGCC GATCAAAGAC GGCTTGCATG CCCACCAAGT ACAGTTCCCT ACCGCCAACG GTAAAAATCT CATAGGCTGG TGGTGTGATC CAGGCCAAAG AGGTACAGTG GTCGTCATGA TGCACGGTTG GGGAGCCAAC GCCTCCCACC TGTTTCCCTT GGCTCAAGCT TTTGTCGCTG CGGGCCACCC TGTGTTGCTG TTTGACGCCC GCTGCCATGG GCTGAGCGAT GATGACGGTT TTGCCTCCTT ACCCCGTTTT AGTGAGGATA TTTTGGCGGC ACTGCATTAT TTAGCCTCCC TTGGTCACAC CACTCCTCTG CTGTTGGGGC ACTCCGTAGG AGCAGGCGCG GCCCTGTTGG CGGCAACCCG TTGGAAATCG CTACAGGGCG TTGTGAGCAT CTCGGCGTTT GCCCATCCGC AGGAGATGAT GCGGCGCTGC CTGCGCGGCT GGCACATCCC CTATTGGCCC ATTGGAGGAT GGCTGCTTCG CCACGTTCAA CGCATCATCG GCCACCGATT TGATGACATT GCCCCCATCC ACACCATACG CCAACTGGAA ATTCCTCTGC TGCTTATCCA TGGCGAAGCC GACACCACTG TGCCCGTGGC CGACGCCCAG CGCCTGCATA GGGCCAACCC CCTATCCGAG CTGTTCGTGC TGCCCGAGGC CGGACATAAC CGGGTGGAGG AGCTGCTGCC CCATACAGAG CAGCTACTGG CTTGGATCGC GGCATTGAAC CCAACCGCAA TAAGCCGCGC GCAGAGCGAG CCCATCCCTA CCGTTTATTC CAAGAAAGTG TCGCTATGA
|
Protein sequence | MHPPHNSQTV RMLMTAVMDN KGRFSVRYNP LTLLTSIAHR WLLHAFRIRQ RGYAAPPIKD GLHAHQVQFP TANGKNLIGW WCDPGQRGTV VVMMHGWGAN ASHLFPLAQA FVAAGHPVLL FDARCHGLSD DDGFASLPRF SEDILAALHY LASLGHTTPL LLGHSVGAGA ALLAATRWKS LQGVVSISAF AHPQEMMRRC LRGWHIPYWP IGGWLLRHVQ RIIGHRFDDI APIHTIRQLE IPLLLIHGEA DTTVPVADAQ RLHRANPLSE LFVLPEAGHN RVEELLPHTE QLLAWIAALN PTAISRAQSE PIPTVYSKKV SL
|
| |