Gene Mmc1_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_2985 
Symbol 
ID4482713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp3712669 
End bp3715902 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content58% 
IMG OID639723732 
Productsulfotransferase 
Protein accessionYP_866882 
Protein GI117926265 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGAA AAAAAAAGCA GAAAAAATTC CCCTCCAGCC AAGCTGCGGC CATTTCCGCT 
ACCGAGGCCC CACAGCAACC CACGACTGGA GATCGCCACA CCCCTCCCGC GCAGCGCTAC
AGTGCGGCCA TGGCTTTGCA CCGTCGTGGT GCTTTAAACG AGGCCGAGGC TGCTTATAAA
ACAGAGTTGC TGCAAGATCC TGATGACGCA GAGGGGCATC ATCTGCTGGG GCTGCTGTTG
TATCAACAGC AGCGTCATGC CGAGGCACGC CCCCATTTGG CGCGGGTCAG TGCCCTTAGG
CCACAGTCGG CCTACCACCA CTACCATTGT GGATTGGTCG ATATGGCGCT ACAGGATTTT
GCCGCAGCCA CGACCTGTTT TCAAAAGGCT TTGCAGTATA AACCTGACTA TACGGATGCT
CTCCTGAATC TAGGGGTGGT TTACCAGCAG CAGGGCAAGC CACAGGCTGC GGAACAGGCG
TGGCTGCAAG CGATTCAGGT AGATCCAAGT TCAATGGGGG CCTACATCAA CCTGGGGCTG
TTTTATCAAG AACAGCAGCG CTTGCCGGAT GCCAAACAGG TGTTGATGCA GGGGTTAAAG
CACGCTCCGA ATGCGCTGCC CTTGCAGGAA AAACTGGCGC AGCTCTTGAC CACCCTAGGG
GAGTACCCGG CGGCGTTGCC ACTGCTTGAG GCGGTGGCTC AGGCCAAGCC AACGGATTGG
GGGGCGCAGC GGGCGCTTGG TTTGGCGCAA ATTCAGGTGG GGGCTTTTGC CGCAGCGGTG
CCTCAACTGG AGCGGGTGGC CCAGGCAGAC GAAGGTAGGA TCGCAACCCT GCATGGGCTG
GCCACCGCCC TGCAAGGCTG TGGACGGGTG GATGAAGCGG TGCAGGTGTG GTGGCGCATA
CTCGATAACT TTCCTACCCA TAAAGATACC CTGTTTCAAC TGAGTTCAAC CCTTTTTCAT
CAAGGTGAAA AAGAGGTCGC AGAACCCCTC TTTGCGCGTA TGGCCGCCTT GCATCCCCAT
TCCCCCGAAG CCTTTGCCAA CTGGGGCCAT TGCCTTTCGG AGATGGGGCG GTTTGCTGAG
GCAGAGCGGG CCTGCCGACA CGCCCTGCGC TTAAACCCAG CCACCTTAGA GGCGGGTATT
GTGCTGGGTG GGCGGGTGCT GCGCCGATTG CACCGGCTGG AGGAGGCGGT GGCAGTGTGC
CAAACCCATT TGGCCTACCA TCCTAACAAT CCGCGGGTGG CCAATAATCT GGCGATGATA
TTGCTCAATG TGGGAAAGAT CGCCGAGGCT AAACAGGTCT GTTTGGAGGT GCTGGCCCAC
GACCCCAACT ACTCGGACAC CCACATGAAC TTGAGTGTGA TCGAACTGGT GGAGGGGGAT
CTACGGGCCG GGTTTAAACG TTATCAGCAC CGCTACCATA CCAAGCTGTT TGTGCAGGGC
ATGCAGGCTC TGGCGGGGAA CACCCTCTGG GAGGGGCAGG ATCTCGCGGG TAAATCCCTG
GTGGTGTTGC CCGAGCAGGG TTTGGGGGAT CAGTTGCAGA TGGTGCGCTA TCTGCCCCTC
TTCAAAACGC GTGGGCTGCG CCGTTTGGAG GTGGTCTGTT CAAAGCCGCT GCTCGCTCTA
TTCCAGCAGG TTGCAGGGGT GGATGCATGG CATACCAGTG CTGAGGCGCT CTCTATGCAG
GCTTTTGATT ATTACTGCAT GGATATGAAT CTGCCCCACG GATTTGGGAC AGAGTTGGCC
ACCATACCGG CGACTGTGCC CTATTTGCAG GCCGAGGCTG CCATTCAGGC GGCCTGGAAA
AGCCGTCTGG ATGCCGCGTG TGACCAGCCG CTGCGGGTGG GGCTGGTGTG GGCCGGCAAC
CCCAATCATG AGCGGGATCA GGCACGCTCC ATACCGTTAG CGCGCTTGGC CCCGCTGTTG
GCGATTGAGG GGATCTCCTG GGTTTCCTTA CAACTGAATG CCAGCAGCGC GGATCGTAAC
AGTCCGCTCT GGTCGAAACT GGTGCACCTG GAGTCCCATG TGAGCGATTT TGCCCAGACC
GCAGGGTTGC TGGCTAACCT GGATCTGCTT ATCGCGGTGG ATACGTCAGT GGTGCATTTG
GCGGGGGCCA TGGGCTGCCC GGTGTGGAGC TTAATCGCCT ATAGCCCTGA TTGGCGGTGG
TTGTTGGAGC GCCAGGACTC GCCATGGTAC CCAACCATGA CGCTGTTTCG ACAATCGCGC
TTAGGGGACT GGCCTGGGGT TGTGCGCGGG GTGGTAGAGG CGTTGGGTGA ACGGTTTAAG
CTGCCCTTGC CTGTGCGGCC AGCGGCGGTG GAGCCCCTCG CCCAAGCGGA GCCTAACTTT
AAGTCTCGCT GCAGCGCGGT GTTAACGGAT AAGCAGCAGC TATTTTTCTG TTTTGGCCCC
CCCAAAAATG GGACCACCCT GCTGCAATAT CTGTTGGATC AGCATCCAGA GATCTCCTGC
CCGGCGGAGC ATGATTTTAG CAAGCTAAAA ACGTCGCTGA ATACCGGCTG TGTGGAGTAT
GCCGCCCATG CGGCACGGAT TCATAGCCGT ATTGGTGGGT TAAGCGTGGG GGGGGTGAAT
CCTTTATTGT CGTCACGCGG CTTTCGGTTT ATGGTGGAGA CGATTATCCA AGATAGTGCC
CGGGGACGAT CCATTATAGG GGCCAATGAT AACCAGATCA TTGAGGATAT GGAGGGGTAT
TATCTGGATT TTGAGCGGCC CAAGATGATT GCGATTTTTC GCAATCCCAT CGATATGGCC
CTCTCCTCCT GGAACCACAA CCATCTTCTG GCTGAGCAGG AGAAAAACGA TGCTCATTTG
GAGGTTATGC GCCCCTATGG GGGTCTGGCC GGGTGGGCTG ATCGCATGGT AACGGAGTTT
ATCCACCATG TGCAACAGGC GCTGGCCTTT CATGGACGCT ATGGTGATCT GCACGTTATG
CGGTATGAGT CGCTGGTAAA ACAAAAGCGC CAGGTGAGCG CGGGGCTGTT TGATTATCTG
GGGGCTAGTT GCAGCGATGC TCTGTTGGAG CGTATCGAGC AGCTGACTTC GTTACAAGCC
ATGCGCGAAC AGGCGCGTCG GCGTGATTTT TTCCGGTCGG GTTCGGTGAA TATGGGGGCT
GGTGCCTTGG ATGATGGGGT ACGTCAGCAT CTGCTGCAAC GGGCCCAACC TTGGTTGCAG
CAGTTGGATG CTTTGATTGA AGGTCAACGG GCGACGGGAC AAGGGCCTGC ATAA
 
Protein sequence
MARKKKQKKF PSSQAAAISA TEAPQQPTTG DRHTPPAQRY SAAMALHRRG ALNEAEAAYK 
TELLQDPDDA EGHHLLGLLL YQQQRHAEAR PHLARVSALR PQSAYHHYHC GLVDMALQDF
AAATTCFQKA LQYKPDYTDA LLNLGVVYQQ QGKPQAAEQA WLQAIQVDPS SMGAYINLGL
FYQEQQRLPD AKQVLMQGLK HAPNALPLQE KLAQLLTTLG EYPAALPLLE AVAQAKPTDW
GAQRALGLAQ IQVGAFAAAV PQLERVAQAD EGRIATLHGL ATALQGCGRV DEAVQVWWRI
LDNFPTHKDT LFQLSSTLFH QGEKEVAEPL FARMAALHPH SPEAFANWGH CLSEMGRFAE
AERACRHALR LNPATLEAGI VLGGRVLRRL HRLEEAVAVC QTHLAYHPNN PRVANNLAMI
LLNVGKIAEA KQVCLEVLAH DPNYSDTHMN LSVIELVEGD LRAGFKRYQH RYHTKLFVQG
MQALAGNTLW EGQDLAGKSL VVLPEQGLGD QLQMVRYLPL FKTRGLRRLE VVCSKPLLAL
FQQVAGVDAW HTSAEALSMQ AFDYYCMDMN LPHGFGTELA TIPATVPYLQ AEAAIQAAWK
SRLDAACDQP LRVGLVWAGN PNHERDQARS IPLARLAPLL AIEGISWVSL QLNASSADRN
SPLWSKLVHL ESHVSDFAQT AGLLANLDLL IAVDTSVVHL AGAMGCPVWS LIAYSPDWRW
LLERQDSPWY PTMTLFRQSR LGDWPGVVRG VVEALGERFK LPLPVRPAAV EPLAQAEPNF
KSRCSAVLTD KQQLFFCFGP PKNGTTLLQY LLDQHPEISC PAEHDFSKLK TSLNTGCVEY
AAHAARIHSR IGGLSVGGVN PLLSSRGFRF MVETIIQDSA RGRSIIGAND NQIIEDMEGY
YLDFERPKMI AIFRNPIDMA LSSWNHNHLL AEQEKNDAHL EVMRPYGGLA GWADRMVTEF
IHHVQQALAF HGRYGDLHVM RYESLVKQKR QVSAGLFDYL GASCSDALLE RIEQLTSLQA
MREQARRRDF FRSGSVNMGA GALDDGVRQH LLQRAQPWLQ QLDALIEGQR ATGQGPA