Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_2252 |
Symbol | |
ID | 4481596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | - |
Start bp | 2877320 |
End bp | 2880388 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639723001 |
Product | TPR repeat-containing protein |
Protein accession | YP_866159 |
Protein GI | 117925542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0647167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCATT CCACACCGCG AACCGTGTCT CACTGGATGG AGCAATCTCT AGCTGACGTA GCGGATGCGT CGCAGGCTCA GGGGGATGCC ACAGCTACGC CTGCTATTGA GCCCATGGAC GATACGCAAC GCGCGCTCTT CCCGAATCAT GATGATCTGT TTTTGCCTAT GCATCGCCTA GACCCTGAGC ATGTTTCTGA TCTTCTGATG GCTCCAGCAC AGCGTCGTGG TTATATAGAG GTAGCCGCTT CATCTGCTGT GCAGAAGGAG GCGATGGAGG CGCCGTCGCC CTTGAATAGT CGTGCGGAGT CCACACGTCA GTTCAGCAAG GTAGAGGAAG AGCGTCATCG CCGCGTGGTG CAGGTAGGCA CACCTATCAA CCGCAGCAAT GTGGTGATTT TGCCGAAAAG GGAGATGGTT GGTGCCGTTG ATTATGCGGC GTTTTTACGT AGCCGTGAGC AGCACACATC ACAGCATGGG CCGGTTGTTG AAGAGACTCC TGATTTCAAC ATGGAGAATC AATCTCAACA GCCAGTTGGT GGCTTGCTTG AGGCGACTTA TCCCCTCTCT CGTGGTTCTG GCGTTGTCTC TGAGAATAAT GCTCAATGGG TGGAAAAGCC GGGCTTGCGG GATGGCCTAA AGGTCGAGCC CTCTCATGCA TTGCGGGAGG GTTCAGAGCT TTTGGAGGCT TCTGCGTTCA AGGCGTCACC TCTTGACAGG CTGGAACACG AAGAGGGTGC GCTTTTATTG ACAGAGACGG GGTGGTCCAA CGAATCACGG CGGGAAGTGG TTGCGCCTGC ACGAGAAGAG GCGGACGTCA CCCAAAATAG TGAGCGCTTA TTTCAGCAAA ATTCAGTGCC TTCGCATGAA CAAGTGATGA CCTTTGTGCA GATTGATAAT CTGCTCAATC AGGTCTTGGC GGATCGATTG CTTGTGCAAA GCCAAGAACC CTCAAACAGC GGTTCGAATG TCGATTTTCC TGCCATTTCT CTCCTGCACA CAGATCAAAC AGCGTTTTTG GGGCTTCCTG ATACAATCCT TGATCCACAC CGCCCTCGTG TGGATGAACA GGGCGGGCTG TTTTACTCGG ATCTACCACC TGCTGCCGAA CCGCACATGT TGGACGAAAG CGAAAACCTA GGATCTAGGC CAGGTGTTGT AGCGGCTCCA GAAGCCGAGT CCGTTCAGGA GGTCGAGTCC GTTCAGGAGG TCGAGCCCGT TCAGGAGGTC GAGCCCGTCC AGGAGGTCGA GTCCGTTCAG GAGGTCCAGC CCGTTCAGGA GGTCGAGTCC GTTCAGGAGG TCGAGTCCGT TCAGGAGGTC GAGCCCGTTC AGGAGGTCGA GCCCGTCCAG GAGGTCGAGT CCGTTCAGGA GGTCCAGCCC GTTCAGGAGG TTGAGTCCGT TCAGGAGGTC GAGCCCGTCC AGGAGGTCGA GCCCGTTCAG GAGGTCGAGT CCGTTCAGGA GGTCGAGCCC GTTCAGGAGG TCGAGTCCGT TCAGGAGGCC GAGTCCGTTC AGGAGGTCGA GCCCGTCCAG GAGGCCGAGT CCGTTCAGGA GGTCGAGTCC GCCCCAGTGG AAGAGGAGCG CGGGGTAGAG AAAACCCCCG GCGTGGTGCC AACGGCAGAA GCTACTCACG AAGCGGCGCG TGGGGCTGTA GAAAATGGTT CTGAAGGTGA TGCTTCTGGC GGGTTGCAGC ATGGGTCGGA TAGGCCAGTC CACCGCGCAC AGCGGCCTGA CGGATACGCT TCAGCGTTTT CCGCTCCTGT TGCATCCATA GCGGCCAATA AGAGTGCCGC GGGGATGGAT CCCGCGCAGC TCCACAAACC GGGGTCGGGT CTGTCAGGGC CTGAGGCCGG TGGCAAGCAA CAGGCTACCT CGGTGGTCCA GGGAAAAGAG GCAATGGCGT TGGCGGGGGG GGCAGTGCAG GGGGGCAGCA CGTTTACGCA AGCGAGCTTT GACTCTGCTT CGAGCGTTAC CGTTAGCCGA GTGGCACAAG ATATTTTGGA ACAGCTCGAC AGAGATGAAC AGCTTAGTGA GCCAACCTCA AGCCACGGGT TGGGAACGGT GCAAATACCA GCCCATGAGC TGGCCGATGT TCAATTGGAT GAGCAGAGCG GTCTAATTGT CGATGAGATC CCGGTCTTAG AGACAACCCG CATCCAACAA CGTCGCTCTT TAGCCAATCA AGCCCGGGAA CAGGTGGCTC AAAAAGCCGC TGCGAACCCA AAAGAAAAGG AGTATGGGTC TTCTTTTTCA TGGGCCAACG GTTTTTTGTT TGGACGAATG AAGTCTCATC GTATAAAACT GGCCGCCGCA GCCCTAGAAG CTCCTGTGCG TATCGAGCCG CAAATTCGCA GGCCGAAACC GCTGGTCAGT CGTCTTGGCC AACAAAGTGG TTTGCCCAAA ACCATGTGGC TGGCTTTTGG TTGGTATGCC TGTGGGGTGA TTCTGGAAAA TCTAAGTTCG GGGGCCATCC GTTGGTACCA GGGTTTAACC CGCTTGGATG GGGATGTGCA GGTCGCGTTC CTTATGGATC GTGCCCGGCA TCTGCAACGC CTGGGACGGC AGGATGAGGC CCAACCCTTG TTGGAGCATG TGCTTAATGC CCCACAGGCA CCGTGTTCGG CCTATCTGTT ACTTGCGGAT GTGTTTAAAG CACAGGGAGC CCCTGATCGC GAAGAGCAGG TGCTGTTTAA GGCGCTTAAT AAGGGTTATG ATGACTGTGA GCTCTTTCTG CGTCTGGGAC TGTTAATGCA GCGGAGCAAA CGGCGTGAGT CGGCCATTGC CTACCTGCGG CAATGTGTGG CGATCAATCC CGATCTCTAC GAAGGTCTTT GCGCTTTAGC CCAGAATCTA GGCCATGCCG GTTTACCCCA TGAGGGGTTG GCACATCTTG AGCGGGCTCG GGAGATCCGT CCATACGATC CCCATGCCTT TATGCTGACA GCGGAAATCC TTGAGCGTTT GAATGAGCAT GAAGATGCCG CGCTATTCTA CCAACATGTT GATGAACTGA ATCAGATGTT ACAAGTCAAA CCTGCCCACG AGCAGGTGGT CGAGGATGAA TTGAGTTGA
|
Protein sequence | MSHSTPRTVS HWMEQSLADV ADASQAQGDA TATPAIEPMD DTQRALFPNH DDLFLPMHRL DPEHVSDLLM APAQRRGYIE VAASSAVQKE AMEAPSPLNS RAESTRQFSK VEEERHRRVV QVGTPINRSN VVILPKREMV GAVDYAAFLR SREQHTSQHG PVVEETPDFN MENQSQQPVG GLLEATYPLS RGSGVVSENN AQWVEKPGLR DGLKVEPSHA LREGSELLEA SAFKASPLDR LEHEEGALLL TETGWSNESR REVVAPAREE ADVTQNSERL FQQNSVPSHE QVMTFVQIDN LLNQVLADRL LVQSQEPSNS GSNVDFPAIS LLHTDQTAFL GLPDTILDPH RPRVDEQGGL FYSDLPPAAE PHMLDESENL GSRPGVVAAP EAESVQEVES VQEVEPVQEV EPVQEVESVQ EVQPVQEVES VQEVESVQEV EPVQEVEPVQ EVESVQEVQP VQEVESVQEV EPVQEVEPVQ EVESVQEVEP VQEVESVQEA ESVQEVEPVQ EAESVQEVES APVEEERGVE KTPGVVPTAE ATHEAARGAV ENGSEGDASG GLQHGSDRPV HRAQRPDGYA SAFSAPVASI AANKSAAGMD PAQLHKPGSG LSGPEAGGKQ QATSVVQGKE AMALAGGAVQ GGSTFTQASF DSASSVTVSR VAQDILEQLD RDEQLSEPTS SHGLGTVQIP AHELADVQLD EQSGLIVDEI PVLETTRIQQ RRSLANQARE QVAQKAAANP KEKEYGSSFS WANGFLFGRM KSHRIKLAAA ALEAPVRIEP QIRRPKPLVS RLGQQSGLPK TMWLAFGWYA CGVILENLSS GAIRWYQGLT RLDGDVQVAF LMDRARHLQR LGRQDEAQPL LEHVLNAPQA PCSAYLLLAD VFKAQGAPDR EEQVLFKALN KGYDDCELFL RLGLLMQRSK RRESAIAYLR QCVAINPDLY EGLCALAQNL GHAGLPHEGL AHLERAREIR PYDPHAFMLT AEILERLNEH EDAALFYQHV DELNQMLQVK PAHEQVVEDE LS
|
| |