Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_1533 |
Symbol | |
ID | 4482858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 1884052 |
End bp | 1886982 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639722275 |
Product | TPR repeat-containing protein |
Protein accession | YP_865448 |
Protein GI | 117924831 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAT GTAGACACAG CAGGGTAGGT GTGAGCAGAC GCGCTGCCCT GTTTTTAGCG CTCATCGTGG TGATGTCGTC CGGCCTATCT GCTGGCTTGG CAGCGCCCAA TCTGGCCGAG GATGTTCTGT TGGCTGGCCC GCAGCGTACC GCGTTGTGGC CTCATGGGGT CGCGCTTCCA ACAGACCGTG GGAGCCTGTT ACAACTGGCC AATCAAGGCT ACCGTCGAGC CCAGTTTTTT ATGGGGCTTT ATCTGGATTA TGGTATCGGC GGAGAGGCTC AACCTTTTGA GGCCTTTCAA TGGTATAGCC GCGCAGCGGG GCAGGGCAGT CGTTGGGCTT GGATTAAATT GGGTGATCTC TATTTCCGTG GCCGTGGCAC CGCACGGGAT GCCAAAAAAG CGCTGCAATG GTACCTGCAT GCCGGGGAAA ATGGCGAGCC TAGTGGCTAT CTGGCTGCCG CAATGGTACA GATTCGTGGT ACGGGCGATA GCATTGATTG GCCAACCGTT TTACAGCGTG TGCACAAGGC TGTGCAAGCG GGGGTATTGG AGGGGCACAC AGCACTCTGT CTCTTAGGGG TGCGACATGT GGTGGCGACG CAGCTACCCC CAGAGCAGAC CCGCCAACAG TGCCAAATGA GTGCGGATGG GGGGCAGCCC TTTGCCGCGA TTTTGATGGC CCAATGGCTG GCTGAGCAGC ATAACGAGAA CAATCCCCAA CTGCAAAAAC AGGTCGAACA GTGGTACGAG CGGGCTGCTG CCATGGAGGT AGGTAACTTG GCCGTTCTGC CCGAGGCACC ACTGGTGATG GCCGAGCATT TCCCCGTTGT TTGGATAGAG GCTCTAGAAC CACTGGTCAG TGCCAACCGC GCCAGCGTGC CGCGTGCGGC CTATCGTGCT GGGGTGGCGC TGCTCAAGTC GGGGCAGGGT TTTGTGGCGC GTAGTCGCGG CGTGCGCTGG TTACAGAAGG GCGCGGAGTT GGGCGATGCC AATGCTCAAT TTAGACTGGG GCTTGCCTAT GCCCAGGGTG AAGGGGTGGT GGTTAATCCA GAACGCGCCA TTTACTGGTA TACCCTTGCG TCGGAGCAGG GTGAAGTTTC GGCACAGTTT AATTTGGCCT TGCTCTACTA CCAAGGGCGT TTGGTTGAGC AGGATTTTAC CAAGGCCCGT TTTTGGTTTG AGCATGCCTC GGAACAGGGT GACGTGCAGG CGCGGGATCA TCTGGGGGAT ATTTATCGTC ATGGCCGGGG CATACCGGTC AATATAGCCG AGGCCATGAA GTGGTACCGC CACGCTGCCG AGCAGAAAAA TGTCTACGCG CTTACCTCAA TGGGGGATAT CTACCAAGCG GGTGAGGGGG TTGCCGAGGA TGCCGCAGAG GCCGCAAAAT GGTACCGCAA AGCGGCGCTG CTGGGTCATG CCCCGGCGCA GGGTAACTTG GCCGACCTTT ATCGACAGGG TAAAGGGGTA GAAAAGGATC TTAATCAGGC CGCACAGTGG TATACAAAAG CTGCTGAACA GGGGGACATG GTCTCTCAAA ACTGGCTGGG CACGCTCTAC CTGGATGGGG ACGGTGTTGA AAAAAATCCC CAGTTGGCGC AGCAATGGTA CGAAAAATCA GCGGCTCAAG GGTACGCTTT TGCCCAGAAC AATTTGGCGG TTATGTTGCG GGATGGCTTG GCGGGCAAGG CTGATTATAA GCGAGCCAGA CAGCTCTTTT TGTTAGCTGC TCGGCAAAAC AGTGGTGATG CGCAAAATAG TCTTGGCGTG CTTTATGAAA AGGGGTTAGG GGGCGAAACC GATCCCATTG AGGCTGCTGC GTGGTATCGT AAGGCCATCC AATATGGCAA TGACAGTGCC CGTTATAACC TTGGCATGCT CTATTATGCC AACCGTCAAT TTGGTAGTAT AGAAGAGGCC CTGCGGCTGT TGCAAGATGC CCAAAGTGCC GGGGTAGCCC AGGCTCAAAC CGCACTGGCT CGGATCTATC TTAGTAAAGA AAACAGCCAC TATAATCCTG AGTTGGGCGA GCGCTTTTTG CGTGAAGCCG CAGAACAGGG TGGGGCGGAT GCACAGGCTT TATTAGGGGT TTTATTAACC TTTAAAACCC CTCTTAAACA GGATTATGAG CAGGCTCTAA GGTGGTTAAA AAAGGGTGCT GAAGGGGGGA GTCCGGAGGC GCAGTTTCAC TTGGGCTATA TGTTACATTT GGGGGTAGGG CTGGCCCCCA ATGCTCACCG TGCGGTGCAT TGGTACCGCA AAGCCGCAGA ACAGGGCTTT GCTGAAGCGG CCAACAATTT GGGGACCCTC TATTTCCAGG GTAATGGGGT TGATCGAGAC GTTTTTAAAG CGGTGGAATG GTACACGCGC GGTGCCAAAC TTGGCCATGT GCCCGCCTTA CATAACTTGG GCAACCACTA CCGCCATGGG TTGGGGGTGG CCGTCGATGC CCGACTGGCA AGGCACTATT TTGAAAAAGC GCAGGCCGCC GGCTTTATGC CGTCTAAATT AGCCCTCGGT GAGATGCTGG AGAAGGGTGA AGGCGGGGTG GCCTCCCTTA AACGGGCAGA AGGGCTGTTT GGTGAGGTGG CCCGGTCGGG CAATATGGAT GGTAAATACC GGCTTGCCCG ACTCTACTTA ACCCATGGCC CAGAGGGTAA ACAAGTTTAT GCCATGCGTC TGCTCCAACA GACGGCTAAA TTGGGGCATC CCGCAGCGCA GTATGGTTTA GCGGCCCTCT ACCTAAAAGA GGCCGATCTT GAAGCCCCGT TGACGGATCG GGTTAAACAG GCTTATTCAC TGCTGTTGGC CAGCCGCCAC GGCGGTATGA GCGAAGCCGA ATCCCTGCTA GAGCGGCTGG AAAAACATTT GCCTGAAGGC GTGCGTCAAG AACTACAGCA CGCTTTTGAA AAAACGAAGG CAAAGCTGTA G
|
Protein sequence | MVKCRHSRVG VSRRAALFLA LIVVMSSGLS AGLAAPNLAE DVLLAGPQRT ALWPHGVALP TDRGSLLQLA NQGYRRAQFF MGLYLDYGIG GEAQPFEAFQ WYSRAAGQGS RWAWIKLGDL YFRGRGTARD AKKALQWYLH AGENGEPSGY LAAAMVQIRG TGDSIDWPTV LQRVHKAVQA GVLEGHTALC LLGVRHVVAT QLPPEQTRQQ CQMSADGGQP FAAILMAQWL AEQHNENNPQ LQKQVEQWYE RAAAMEVGNL AVLPEAPLVM AEHFPVVWIE ALEPLVSANR ASVPRAAYRA GVALLKSGQG FVARSRGVRW LQKGAELGDA NAQFRLGLAY AQGEGVVVNP ERAIYWYTLA SEQGEVSAQF NLALLYYQGR LVEQDFTKAR FWFEHASEQG DVQARDHLGD IYRHGRGIPV NIAEAMKWYR HAAEQKNVYA LTSMGDIYQA GEGVAEDAAE AAKWYRKAAL LGHAPAQGNL ADLYRQGKGV EKDLNQAAQW YTKAAEQGDM VSQNWLGTLY LDGDGVEKNP QLAQQWYEKS AAQGYAFAQN NLAVMLRDGL AGKADYKRAR QLFLLAARQN SGDAQNSLGV LYEKGLGGET DPIEAAAWYR KAIQYGNDSA RYNLGMLYYA NRQFGSIEEA LRLLQDAQSA GVAQAQTALA RIYLSKENSH YNPELGERFL REAAEQGGAD AQALLGVLLT FKTPLKQDYE QALRWLKKGA EGGSPEAQFH LGYMLHLGVG LAPNAHRAVH WYRKAAEQGF AEAANNLGTL YFQGNGVDRD VFKAVEWYTR GAKLGHVPAL HNLGNHYRHG LGVAVDARLA RHYFEKAQAA GFMPSKLALG EMLEKGEGGV ASLKRAEGLF GEVARSGNMD GKYRLARLYL THGPEGKQVY AMRLLQQTAK LGHPAAQYGL AALYLKEADL EAPLTDRVKQ AYSLLLASRH GGMSEAESLL ERLEKHLPEG VRQELQHAFE KTKAKL
|
| |