Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_1430 |
Symbol | |
ID | 4480659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | - |
Start bp | 1735595 |
End bp | 1738423 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639722173 |
Product | Sel1 domain-containing protein |
Protein accession | YP_865347 |
Protein GI | 117924730 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.356598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGTA CCCCCCTAAG CTTGGCCATC GCAGCCTGTA TCTTCAGCCC TCTGCTGTTG ACCTTGGTAT GGTTGCAGCC CCTGCATGCC GAAGAGCTTT CGGCGGAGGC ACTTTATCGA CAGGGATACC GCTATTTTTA TGGTCAAGGC GGTGTGGCTG TGGATCAAAA GCAGGCCTTT CAATATTACC AGCACGCAGG TAATTTGGGG CACCCTGCCG CCCAGTATGC CACCGGATGG ATGCTGATGA CCGGTCGGGG GATTGCCAAA AATCATGTCG AAGCGCTGCC CTGGCTAGAA AAAGCAGCTT CAAGTGGCGA TGCCAAAGCC CAATATTTTA CAGGCATGCT CAAACTACAG GGTGAAGGCA TTACCCCTGA GCCAAGCCAA GCCGTAGATT GGATTACCCA AGCCGCCAAC CAAAACTATG CCATTGCCCA GCGCCGTTTG GGTCTTCTGT ACCAGCAGGG CAAGCATGTC GCGCTCGATC CTAAACGCTC CCATGATTGG CTCGAAAAAG CGGCCGCACA GGGGGATGCT CAAGCCAAGC AGTTGCTCTA CGCCACCAAG GTTGCCCGCT ATCACCCCCC CGTAGCCGCT CCTGCTGCGG GGGGTAACGC GGCGAGTACC AGCACTCCGC CCCCTGCCGT GGCCGCCAAG AGACCCGGCA ACAAAGCCGC TGCGGCTACC CAACCCTCAA CGCCCCATAG CCCCGTGCAG GCAAGCACGC CTGCCCGCCA ACCCGTGGTC ACCGCCCCCA CCACCCGTCA ACCCGCAGCA AACTCCACAG CTGCGGTTAC TACCCCAACA GCCCCCACAC CCGCGGTTAC CACCCCGGCG GCCAGCCAAC CAGCGGCATT AAACTTTACT GAGCAACGCT ATCCAGGTCC TACCCCATCT GCGGCAACGC CCGCGCCAAG CTCAACCGCG TTGGCCCCAG TGCAATCCCG CATCAACCAA CCTACCCCCA TGGCTGCGGT GGCACCACAC GTTGCAGCGG CACCACCATC CCCCATGGCT GCGATGGCAC CATACGTTGC AGCGGCACCA CCATCCCCCA TGGCTGCGAT GGCACCATAC GTTGCAGCGG CACCACCATC CCCCATGGCG ACGCTTGCCA CAGACGAGAT ACCCGTCCAG GCAGATCTTG CGATGGCACC GGACTCAGCC CGTTTACCAG AGCCTATCAA CCTAAAACAA CTTACCGTGG CACAGACCGT CTCGTCACCG CCCCCTCTCC TGAGTAACCT TCCAGCCAGT TGGCTGCAAC GTAAGCAGAT GTTGCGCTAT AAATTTAGGC AGTGGGTACC CGAAGAGAGC AACTACCCGG CCTCCTTTCA CTATTTTTTG GCGGTTAAGC AGCATTTAAC ACGCGCTAAA GCGGAACCCC AACAGCTTAA TGATGCCGTT GCCCTAACCC TGCTCTACAC CGAGGAGGGC GGCCCGCAAC CGGCCGACTG GTTGTTTACC CTACCCCCGC AAATGTTTGA TGCCCAACAG CAGGAGCGCG TGGTCGAAGA GCTACAGATG CGCGCCGACG AGGGTAGCCC TGCGGCCAAC TATTTTTTAG GCATGCGCCA ACTGCATCAT GATGGGGATC TCAAGCAGGG TGTGGCACGC ATTGAACAGA GCGCTACCCA CGGTTTTCCC CTGGCACAAC ACCGCATGTT CCTGCTCCAC CTTCAGGGTC CAGACGCCTA TCGCGATCCC GCTAAGGCCC TAGCTTGGGG CGAAAAAGCG GCGGCACACC AGCAGCTAGA GACCCAAGCC CGCTTAGGGC TAGAACAGCT TACCGGCAAC TGGTTGCACA AAGATCAACA CGCGGGGCTC ACGCATTTGC GTCAGGTGGC CCAGGCCGAT GATGGGGCCG CGCTACGCGC TCTACAAGAG GCCCTTAATC CCAAGGGGGT CAACTATCCA GATGCCCCGG CTCTGCTCAC CCACACCCTG CAATGGTTAC GCAACTATGG CGCCCATCAA GGGCGCGACT ATGACCTGGT CATGCAGCTC TTTTTGCGCC AGGAACCGCC CCTCAACCTC ATTCAACGAC TAGAGCGCCT GCCTGCCCCC GCCCCAGTGG CGCAAGGGCA AGCCCTTGAG CCCTTGGTTG AGGAAGATTT TGATCAACTA AGCCAGCACA TTGATCGGCT GTGGCGTCGC CATTATGGGG ATCAACAGGC CCTGCAAGCC CAGCAGACGG CATTACCTGC CCGTGCACTC TCGCCCAAAT TGCTCGAAAC CTTGGAACAG GCGGCCTTTA ACAACATCCC CCAAGCGCAA CAGCGTTTGG CGCATCTCTA TTTGACAGGG ATTTGGGTAA AGCGTGACCT GGATAAAGCG CGTTACTGGC TGTTTCGCCA TGCCCGTAGT GGGAATGAAC CCGATCAAGC CCGTAGTCAG GGCCTGCTGG CCCTGTTGAT GGATGCCCAA CTGGGCCAAC GCAGCGTCAC CGCCCAGGGG TGGATGAATC GTGCGCAAAA GTGGAATTTT CTCGCTCTGC TCGATGGTGT GCTGTTAAGC AAGCCCGATC TTGCCATTCT ACCAAGCACC CCCTATAAAT CCTCTAGCCG CACCTTAGAT GGTCTCTACC GCAAGAGCGC CCTGGGGCTT TATCAACTCA AAAAGGGCGG TAACCCTGCC CCGCTCTTAC AGGGCATACA CCAAGCAGCC AAGCAGGGGC ATGCTCAGGC ACAGTTTGTA TTAGGGCTGC TCTATCTGGA TGGTATCGGG CTTGAGGCCT CCCCCAGCAA GGCCACCCAC TGGCACCAAC TGGCCCTGGC CCAGGGCTAC CAGCGTAGCC CTCTACAGGT TGCCGATGCC ATGCGCTGA
|
Protein sequence | MMRTPLSLAI AACIFSPLLL TLVWLQPLHA EELSAEALYR QGYRYFYGQG GVAVDQKQAF QYYQHAGNLG HPAAQYATGW MLMTGRGIAK NHVEALPWLE KAASSGDAKA QYFTGMLKLQ GEGITPEPSQ AVDWITQAAN QNYAIAQRRL GLLYQQGKHV ALDPKRSHDW LEKAAAQGDA QAKQLLYATK VARYHPPVAA PAAGGNAAST STPPPAVAAK RPGNKAAAAT QPSTPHSPVQ ASTPARQPVV TAPTTRQPAA NSTAAVTTPT APTPAVTTPA ASQPAALNFT EQRYPGPTPS AATPAPSSTA LAPVQSRINQ PTPMAAVAPH VAAAPPSPMA AMAPYVAAAP PSPMAAMAPY VAAAPPSPMA TLATDEIPVQ ADLAMAPDSA RLPEPINLKQ LTVAQTVSSP PPLLSNLPAS WLQRKQMLRY KFRQWVPEES NYPASFHYFL AVKQHLTRAK AEPQQLNDAV ALTLLYTEEG GPQPADWLFT LPPQMFDAQQ QERVVEELQM RADEGSPAAN YFLGMRQLHH DGDLKQGVAR IEQSATHGFP LAQHRMFLLH LQGPDAYRDP AKALAWGEKA AAHQQLETQA RLGLEQLTGN WLHKDQHAGL THLRQVAQAD DGAALRALQE ALNPKGVNYP DAPALLTHTL QWLRNYGAHQ GRDYDLVMQL FLRQEPPLNL IQRLERLPAP APVAQGQALE PLVEEDFDQL SQHIDRLWRR HYGDQQALQA QQTALPARAL SPKLLETLEQ AAFNNIPQAQ QRLAHLYLTG IWVKRDLDKA RYWLFRHARS GNEPDQARSQ GLLALLMDAQ LGQRSVTAQG WMNRAQKWNF LALLDGVLLS KPDLAILPST PYKSSSRTLD GLYRKSALGL YQLKKGGNPA PLLQGIHQAA KQGHAQAQFV LGLLYLDGIG LEASPSKATH WHQLALAQGY QRSPLQVADA MR
|
| |