Gene Mmc1_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_1533 
Symbol 
ID4482858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp1884052 
End bp1886982 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content56% 
IMG OID639722275 
ProductTPR repeat-containing protein 
Protein accessionYP_865448 
Protein GI117924831 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAT GTAGACACAG CAGGGTAGGT GTGAGCAGAC GCGCTGCCCT GTTTTTAGCG 
CTCATCGTGG TGATGTCGTC CGGCCTATCT GCTGGCTTGG CAGCGCCCAA TCTGGCCGAG
GATGTTCTGT TGGCTGGCCC GCAGCGTACC GCGTTGTGGC CTCATGGGGT CGCGCTTCCA
ACAGACCGTG GGAGCCTGTT ACAACTGGCC AATCAAGGCT ACCGTCGAGC CCAGTTTTTT
ATGGGGCTTT ATCTGGATTA TGGTATCGGC GGAGAGGCTC AACCTTTTGA GGCCTTTCAA
TGGTATAGCC GCGCAGCGGG GCAGGGCAGT CGTTGGGCTT GGATTAAATT GGGTGATCTC
TATTTCCGTG GCCGTGGCAC CGCACGGGAT GCCAAAAAAG CGCTGCAATG GTACCTGCAT
GCCGGGGAAA ATGGCGAGCC TAGTGGCTAT CTGGCTGCCG CAATGGTACA GATTCGTGGT
ACGGGCGATA GCATTGATTG GCCAACCGTT TTACAGCGTG TGCACAAGGC TGTGCAAGCG
GGGGTATTGG AGGGGCACAC AGCACTCTGT CTCTTAGGGG TGCGACATGT GGTGGCGACG
CAGCTACCCC CAGAGCAGAC CCGCCAACAG TGCCAAATGA GTGCGGATGG GGGGCAGCCC
TTTGCCGCGA TTTTGATGGC CCAATGGCTG GCTGAGCAGC ATAACGAGAA CAATCCCCAA
CTGCAAAAAC AGGTCGAACA GTGGTACGAG CGGGCTGCTG CCATGGAGGT AGGTAACTTG
GCCGTTCTGC CCGAGGCACC ACTGGTGATG GCCGAGCATT TCCCCGTTGT TTGGATAGAG
GCTCTAGAAC CACTGGTCAG TGCCAACCGC GCCAGCGTGC CGCGTGCGGC CTATCGTGCT
GGGGTGGCGC TGCTCAAGTC GGGGCAGGGT TTTGTGGCGC GTAGTCGCGG CGTGCGCTGG
TTACAGAAGG GCGCGGAGTT GGGCGATGCC AATGCTCAAT TTAGACTGGG GCTTGCCTAT
GCCCAGGGTG AAGGGGTGGT GGTTAATCCA GAACGCGCCA TTTACTGGTA TACCCTTGCG
TCGGAGCAGG GTGAAGTTTC GGCACAGTTT AATTTGGCCT TGCTCTACTA CCAAGGGCGT
TTGGTTGAGC AGGATTTTAC CAAGGCCCGT TTTTGGTTTG AGCATGCCTC GGAACAGGGT
GACGTGCAGG CGCGGGATCA TCTGGGGGAT ATTTATCGTC ATGGCCGGGG CATACCGGTC
AATATAGCCG AGGCCATGAA GTGGTACCGC CACGCTGCCG AGCAGAAAAA TGTCTACGCG
CTTACCTCAA TGGGGGATAT CTACCAAGCG GGTGAGGGGG TTGCCGAGGA TGCCGCAGAG
GCCGCAAAAT GGTACCGCAA AGCGGCGCTG CTGGGTCATG CCCCGGCGCA GGGTAACTTG
GCCGACCTTT ATCGACAGGG TAAAGGGGTA GAAAAGGATC TTAATCAGGC CGCACAGTGG
TATACAAAAG CTGCTGAACA GGGGGACATG GTCTCTCAAA ACTGGCTGGG CACGCTCTAC
CTGGATGGGG ACGGTGTTGA AAAAAATCCC CAGTTGGCGC AGCAATGGTA CGAAAAATCA
GCGGCTCAAG GGTACGCTTT TGCCCAGAAC AATTTGGCGG TTATGTTGCG GGATGGCTTG
GCGGGCAAGG CTGATTATAA GCGAGCCAGA CAGCTCTTTT TGTTAGCTGC TCGGCAAAAC
AGTGGTGATG CGCAAAATAG TCTTGGCGTG CTTTATGAAA AGGGGTTAGG GGGCGAAACC
GATCCCATTG AGGCTGCTGC GTGGTATCGT AAGGCCATCC AATATGGCAA TGACAGTGCC
CGTTATAACC TTGGCATGCT CTATTATGCC AACCGTCAAT TTGGTAGTAT AGAAGAGGCC
CTGCGGCTGT TGCAAGATGC CCAAAGTGCC GGGGTAGCCC AGGCTCAAAC CGCACTGGCT
CGGATCTATC TTAGTAAAGA AAACAGCCAC TATAATCCTG AGTTGGGCGA GCGCTTTTTG
CGTGAAGCCG CAGAACAGGG TGGGGCGGAT GCACAGGCTT TATTAGGGGT TTTATTAACC
TTTAAAACCC CTCTTAAACA GGATTATGAG CAGGCTCTAA GGTGGTTAAA AAAGGGTGCT
GAAGGGGGGA GTCCGGAGGC GCAGTTTCAC TTGGGCTATA TGTTACATTT GGGGGTAGGG
CTGGCCCCCA ATGCTCACCG TGCGGTGCAT TGGTACCGCA AAGCCGCAGA ACAGGGCTTT
GCTGAAGCGG CCAACAATTT GGGGACCCTC TATTTCCAGG GTAATGGGGT TGATCGAGAC
GTTTTTAAAG CGGTGGAATG GTACACGCGC GGTGCCAAAC TTGGCCATGT GCCCGCCTTA
CATAACTTGG GCAACCACTA CCGCCATGGG TTGGGGGTGG CCGTCGATGC CCGACTGGCA
AGGCACTATT TTGAAAAAGC GCAGGCCGCC GGCTTTATGC CGTCTAAATT AGCCCTCGGT
GAGATGCTGG AGAAGGGTGA AGGCGGGGTG GCCTCCCTTA AACGGGCAGA AGGGCTGTTT
GGTGAGGTGG CCCGGTCGGG CAATATGGAT GGTAAATACC GGCTTGCCCG ACTCTACTTA
ACCCATGGCC CAGAGGGTAA ACAAGTTTAT GCCATGCGTC TGCTCCAACA GACGGCTAAA
TTGGGGCATC CCGCAGCGCA GTATGGTTTA GCGGCCCTCT ACCTAAAAGA GGCCGATCTT
GAAGCCCCGT TGACGGATCG GGTTAAACAG GCTTATTCAC TGCTGTTGGC CAGCCGCCAC
GGCGGTATGA GCGAAGCCGA ATCCCTGCTA GAGCGGCTGG AAAAACATTT GCCTGAAGGC
GTGCGTCAAG AACTACAGCA CGCTTTTGAA AAAACGAAGG CAAAGCTGTA G
 
Protein sequence
MVKCRHSRVG VSRRAALFLA LIVVMSSGLS AGLAAPNLAE DVLLAGPQRT ALWPHGVALP 
TDRGSLLQLA NQGYRRAQFF MGLYLDYGIG GEAQPFEAFQ WYSRAAGQGS RWAWIKLGDL
YFRGRGTARD AKKALQWYLH AGENGEPSGY LAAAMVQIRG TGDSIDWPTV LQRVHKAVQA
GVLEGHTALC LLGVRHVVAT QLPPEQTRQQ CQMSADGGQP FAAILMAQWL AEQHNENNPQ
LQKQVEQWYE RAAAMEVGNL AVLPEAPLVM AEHFPVVWIE ALEPLVSANR ASVPRAAYRA
GVALLKSGQG FVARSRGVRW LQKGAELGDA NAQFRLGLAY AQGEGVVVNP ERAIYWYTLA
SEQGEVSAQF NLALLYYQGR LVEQDFTKAR FWFEHASEQG DVQARDHLGD IYRHGRGIPV
NIAEAMKWYR HAAEQKNVYA LTSMGDIYQA GEGVAEDAAE AAKWYRKAAL LGHAPAQGNL
ADLYRQGKGV EKDLNQAAQW YTKAAEQGDM VSQNWLGTLY LDGDGVEKNP QLAQQWYEKS
AAQGYAFAQN NLAVMLRDGL AGKADYKRAR QLFLLAARQN SGDAQNSLGV LYEKGLGGET
DPIEAAAWYR KAIQYGNDSA RYNLGMLYYA NRQFGSIEEA LRLLQDAQSA GVAQAQTALA
RIYLSKENSH YNPELGERFL REAAEQGGAD AQALLGVLLT FKTPLKQDYE QALRWLKKGA
EGGSPEAQFH LGYMLHLGVG LAPNAHRAVH WYRKAAEQGF AEAANNLGTL YFQGNGVDRD
VFKAVEWYTR GAKLGHVPAL HNLGNHYRHG LGVAVDARLA RHYFEKAQAA GFMPSKLALG
EMLEKGEGGV ASLKRAEGLF GEVARSGNMD GKYRLARLYL THGPEGKQVY AMRLLQQTAK
LGHPAAQYGL AALYLKEADL EAPLTDRVKQ AYSLLLASRH GGMSEAESLL ERLEKHLPEG
VRQELQHAFE KTKAKL