Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_2985 |
Symbol | |
ID | 4482713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | - |
Start bp | 3712669 |
End bp | 3715902 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639723732 |
Product | sulfotransferase |
Protein accession | YP_866882 |
Protein GI | 117926265 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGAA AAAAAAAGCA GAAAAAATTC CCCTCCAGCC AAGCTGCGGC CATTTCCGCT ACCGAGGCCC CACAGCAACC CACGACTGGA GATCGCCACA CCCCTCCCGC GCAGCGCTAC AGTGCGGCCA TGGCTTTGCA CCGTCGTGGT GCTTTAAACG AGGCCGAGGC TGCTTATAAA ACAGAGTTGC TGCAAGATCC TGATGACGCA GAGGGGCATC ATCTGCTGGG GCTGCTGTTG TATCAACAGC AGCGTCATGC CGAGGCACGC CCCCATTTGG CGCGGGTCAG TGCCCTTAGG CCACAGTCGG CCTACCACCA CTACCATTGT GGATTGGTCG ATATGGCGCT ACAGGATTTT GCCGCAGCCA CGACCTGTTT TCAAAAGGCT TTGCAGTATA AACCTGACTA TACGGATGCT CTCCTGAATC TAGGGGTGGT TTACCAGCAG CAGGGCAAGC CACAGGCTGC GGAACAGGCG TGGCTGCAAG CGATTCAGGT AGATCCAAGT TCAATGGGGG CCTACATCAA CCTGGGGCTG TTTTATCAAG AACAGCAGCG CTTGCCGGAT GCCAAACAGG TGTTGATGCA GGGGTTAAAG CACGCTCCGA ATGCGCTGCC CTTGCAGGAA AAACTGGCGC AGCTCTTGAC CACCCTAGGG GAGTACCCGG CGGCGTTGCC ACTGCTTGAG GCGGTGGCTC AGGCCAAGCC AACGGATTGG GGGGCGCAGC GGGCGCTTGG TTTGGCGCAA ATTCAGGTGG GGGCTTTTGC CGCAGCGGTG CCTCAACTGG AGCGGGTGGC CCAGGCAGAC GAAGGTAGGA TCGCAACCCT GCATGGGCTG GCCACCGCCC TGCAAGGCTG TGGACGGGTG GATGAAGCGG TGCAGGTGTG GTGGCGCATA CTCGATAACT TTCCTACCCA TAAAGATACC CTGTTTCAAC TGAGTTCAAC CCTTTTTCAT CAAGGTGAAA AAGAGGTCGC AGAACCCCTC TTTGCGCGTA TGGCCGCCTT GCATCCCCAT TCCCCCGAAG CCTTTGCCAA CTGGGGCCAT TGCCTTTCGG AGATGGGGCG GTTTGCTGAG GCAGAGCGGG CCTGCCGACA CGCCCTGCGC TTAAACCCAG CCACCTTAGA GGCGGGTATT GTGCTGGGTG GGCGGGTGCT GCGCCGATTG CACCGGCTGG AGGAGGCGGT GGCAGTGTGC CAAACCCATT TGGCCTACCA TCCTAACAAT CCGCGGGTGG CCAATAATCT GGCGATGATA TTGCTCAATG TGGGAAAGAT CGCCGAGGCT AAACAGGTCT GTTTGGAGGT GCTGGCCCAC GACCCCAACT ACTCGGACAC CCACATGAAC TTGAGTGTGA TCGAACTGGT GGAGGGGGAT CTACGGGCCG GGTTTAAACG TTATCAGCAC CGCTACCATA CCAAGCTGTT TGTGCAGGGC ATGCAGGCTC TGGCGGGGAA CACCCTCTGG GAGGGGCAGG ATCTCGCGGG TAAATCCCTG GTGGTGTTGC CCGAGCAGGG TTTGGGGGAT CAGTTGCAGA TGGTGCGCTA TCTGCCCCTC TTCAAAACGC GTGGGCTGCG CCGTTTGGAG GTGGTCTGTT CAAAGCCGCT GCTCGCTCTA TTCCAGCAGG TTGCAGGGGT GGATGCATGG CATACCAGTG CTGAGGCGCT CTCTATGCAG GCTTTTGATT ATTACTGCAT GGATATGAAT CTGCCCCACG GATTTGGGAC AGAGTTGGCC ACCATACCGG CGACTGTGCC CTATTTGCAG GCCGAGGCTG CCATTCAGGC GGCCTGGAAA AGCCGTCTGG ATGCCGCGTG TGACCAGCCG CTGCGGGTGG GGCTGGTGTG GGCCGGCAAC CCCAATCATG AGCGGGATCA GGCACGCTCC ATACCGTTAG CGCGCTTGGC CCCGCTGTTG GCGATTGAGG GGATCTCCTG GGTTTCCTTA CAACTGAATG CCAGCAGCGC GGATCGTAAC AGTCCGCTCT GGTCGAAACT GGTGCACCTG GAGTCCCATG TGAGCGATTT TGCCCAGACC GCAGGGTTGC TGGCTAACCT GGATCTGCTT ATCGCGGTGG ATACGTCAGT GGTGCATTTG GCGGGGGCCA TGGGCTGCCC GGTGTGGAGC TTAATCGCCT ATAGCCCTGA TTGGCGGTGG TTGTTGGAGC GCCAGGACTC GCCATGGTAC CCAACCATGA CGCTGTTTCG ACAATCGCGC TTAGGGGACT GGCCTGGGGT TGTGCGCGGG GTGGTAGAGG CGTTGGGTGA ACGGTTTAAG CTGCCCTTGC CTGTGCGGCC AGCGGCGGTG GAGCCCCTCG CCCAAGCGGA GCCTAACTTT AAGTCTCGCT GCAGCGCGGT GTTAACGGAT AAGCAGCAGC TATTTTTCTG TTTTGGCCCC CCCAAAAATG GGACCACCCT GCTGCAATAT CTGTTGGATC AGCATCCAGA GATCTCCTGC CCGGCGGAGC ATGATTTTAG CAAGCTAAAA ACGTCGCTGA ATACCGGCTG TGTGGAGTAT GCCGCCCATG CGGCACGGAT TCATAGCCGT ATTGGTGGGT TAAGCGTGGG GGGGGTGAAT CCTTTATTGT CGTCACGCGG CTTTCGGTTT ATGGTGGAGA CGATTATCCA AGATAGTGCC CGGGGACGAT CCATTATAGG GGCCAATGAT AACCAGATCA TTGAGGATAT GGAGGGGTAT TATCTGGATT TTGAGCGGCC CAAGATGATT GCGATTTTTC GCAATCCCAT CGATATGGCC CTCTCCTCCT GGAACCACAA CCATCTTCTG GCTGAGCAGG AGAAAAACGA TGCTCATTTG GAGGTTATGC GCCCCTATGG GGGTCTGGCC GGGTGGGCTG ATCGCATGGT AACGGAGTTT ATCCACCATG TGCAACAGGC GCTGGCCTTT CATGGACGCT ATGGTGATCT GCACGTTATG CGGTATGAGT CGCTGGTAAA ACAAAAGCGC CAGGTGAGCG CGGGGCTGTT TGATTATCTG GGGGCTAGTT GCAGCGATGC TCTGTTGGAG CGTATCGAGC AGCTGACTTC GTTACAAGCC ATGCGCGAAC AGGCGCGTCG GCGTGATTTT TTCCGGTCGG GTTCGGTGAA TATGGGGGCT GGTGCCTTGG ATGATGGGGT ACGTCAGCAT CTGCTGCAAC GGGCCCAACC TTGGTTGCAG CAGTTGGATG CTTTGATTGA AGGTCAACGG GCGACGGGAC AAGGGCCTGC ATAA
|
Protein sequence | MARKKKQKKF PSSQAAAISA TEAPQQPTTG DRHTPPAQRY SAAMALHRRG ALNEAEAAYK TELLQDPDDA EGHHLLGLLL YQQQRHAEAR PHLARVSALR PQSAYHHYHC GLVDMALQDF AAATTCFQKA LQYKPDYTDA LLNLGVVYQQ QGKPQAAEQA WLQAIQVDPS SMGAYINLGL FYQEQQRLPD AKQVLMQGLK HAPNALPLQE KLAQLLTTLG EYPAALPLLE AVAQAKPTDW GAQRALGLAQ IQVGAFAAAV PQLERVAQAD EGRIATLHGL ATALQGCGRV DEAVQVWWRI LDNFPTHKDT LFQLSSTLFH QGEKEVAEPL FARMAALHPH SPEAFANWGH CLSEMGRFAE AERACRHALR LNPATLEAGI VLGGRVLRRL HRLEEAVAVC QTHLAYHPNN PRVANNLAMI LLNVGKIAEA KQVCLEVLAH DPNYSDTHMN LSVIELVEGD LRAGFKRYQH RYHTKLFVQG MQALAGNTLW EGQDLAGKSL VVLPEQGLGD QLQMVRYLPL FKTRGLRRLE VVCSKPLLAL FQQVAGVDAW HTSAEALSMQ AFDYYCMDMN LPHGFGTELA TIPATVPYLQ AEAAIQAAWK SRLDAACDQP LRVGLVWAGN PNHERDQARS IPLARLAPLL AIEGISWVSL QLNASSADRN SPLWSKLVHL ESHVSDFAQT AGLLANLDLL IAVDTSVVHL AGAMGCPVWS LIAYSPDWRW LLERQDSPWY PTMTLFRQSR LGDWPGVVRG VVEALGERFK LPLPVRPAAV EPLAQAEPNF KSRCSAVLTD KQQLFFCFGP PKNGTTLLQY LLDQHPEISC PAEHDFSKLK TSLNTGCVEY AAHAARIHSR IGGLSVGGVN PLLSSRGFRF MVETIIQDSA RGRSIIGAND NQIIEDMEGY YLDFERPKMI AIFRNPIDMA LSSWNHNHLL AEQEKNDAHL EVMRPYGGLA GWADRMVTEF IHHVQQALAF HGRYGDLHVM RYESLVKQKR QVSAGLFDYL GASCSDALLE RIEQLTSLQA MREQARRRDF FRSGSVNMGA GALDDGVRQH LLQRAQPWLQ QLDALIEGQR ATGQGPA
|
| |