Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_1773 |
Symbol | |
ID | 4482599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 2191616 |
End bp | 2193085 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639722515 |
Product | protease Do |
Protein accession | YP_865687 |
Protein GI | 117925070 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000225032 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000850393 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATACACG CATGCAAATT TGGCGCGCGC GCCTTGGTCG GGGGGCTCTC CATGCTGGCG TTGGTGGCGG TCTCGCTGCC CCAAGCTTAC GCGGAGGGCG GTTTTCCCAA TCTTGTGCCG CTGGTCAAAA GGCTTAAACC CGCAGTGGTC AATATCAGCA CCACTCAAAC CGTCGAAAAT TCAAATAAAA TACCAAAAAA ACATGGCCAG AACGAGTTTG AGGGGACCCC CTTTGAGGAC CTCTTTCGAC ACTTTTTTGA CCGTCTGCCC GATCAAGACC GCTCGTTTAA AACCAACTCC CTGGGCTCGG GGTTTATTGT CGATGCCGCG GGCTATATTC TTACCAACCA TCATGTGATT GATAAGGCGA CTGAGATTAC CGTCAAGCTC TATGATGAGA CGGAATATCG GGCCGAGGTG GTTGGTAAAG ATAAAAAAAC CGATCTAGCC CTGATTCGTA TCCATACCGA CAAGCCGCTG GCTGTGGCCA AGCTTGGGGA TTCGTCCAAG GCAGAGGTGG GTTCTTGGGT GATGGCCATC GGCAACCCGT TTGGGCTAGA AGAGACGGTG ACAGTCGGGA TTATCTCCGC CAAAGGGCGG GTGATTGGGG CAGGTCCTTA TGATAATTTT ATTCAGACCG ATGCGGCCAT CAATCCTGGC AATTCGGGCG GGCCACTGTT TAATTTGGAC GGGGATGTGG TGGGCATTAA CACCGCCATT TATTCCCGTG GTGGTGGCAG TGTAGGGGTT GGTTTTGCCA TACCGGTTAA TCTGGCCAGT CATGTGATGG AGCAGTTGAA AAATAAGGGC TTTGTCGAGC GTGGTTGGCT TGGGGTACGC ATACAGACCA TCACCAAAGA GCTGGCCGAA GCGATGCATC TTAAAGATCG CGTGGGTGCG TTGGTGGCCG AGGTTATCGA AGATAGTCCA GCGGCCAAAG CGGGCATCCA TCCCGAGGAT GTGATTATCT CCTTTAATGA AAAGGAGGTC ACCAAGATGA ACAGCCTACC CGCCATTGTG GCGAATACGC CGGTGGGGAC ACGGGTACCG GTTAAGGTGA TACGCGAAGG TAAAGAGCGC ACCCTATGGG TCGGCATTGC TAAACTGGAT GATGATAAAG TGGCGGCTGA TGAAGATGGC CGTGCAGCCT CTGGCGAGAA AAAGGCGGAT AGCAGCGCCG TGAAAGAGCG TTTAGGGTTG CGGGTTAGCC AAGTGACCAC CGAGTTGATG GAGCGCATGA AGCTACCCGA CGATGCCAAA GGGGTGGTCA TTACCGCTTT GGAGGCGGAT GGCAGTGCCG TGCAAGCGGG TCTGCGCACG GGGGATGTGA TTACACAGTT TGACCGTAAG CCTATCAAAG ATGTGGATGA TTTGGTAAAG GTGCTCAAGG GCGTTAAGGC CGATAGCATG GCGTTGGTCT ATGTGCTTCG TGCTGGAGAG CCGTTGTTCG TGCCTATTCG GGTTAAGTAA
|
Protein sequence | MIHACKFGAR ALVGGLSMLA LVAVSLPQAY AEGGFPNLVP LVKRLKPAVV NISTTQTVEN SNKIPKKHGQ NEFEGTPFED LFRHFFDRLP DQDRSFKTNS LGSGFIVDAA GYILTNHHVI DKATEITVKL YDETEYRAEV VGKDKKTDLA LIRIHTDKPL AVAKLGDSSK AEVGSWVMAI GNPFGLEETV TVGIISAKGR VIGAGPYDNF IQTDAAINPG NSGGPLFNLD GDVVGINTAI YSRGGGSVGV GFAIPVNLAS HVMEQLKNKG FVERGWLGVR IQTITKELAE AMHLKDRVGA LVAEVIEDSP AAKAGIHPED VIISFNEKEV TKMNSLPAIV ANTPVGTRVP VKVIREGKER TLWVGIAKLD DDKVAADEDG RAASGEKKAD SSAVKERLGL RVSQVTTELM ERMKLPDDAK GVVITALEAD GSAVQAGLRT GDVITQFDRK PIKDVDDLVK VLKGVKADSM ALVYVLRAGE PLFVPIRVK
|
| |