Gene Mmc1_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_0203 
Symbol 
ID4483449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp223407 
End bp224483 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content62% 
IMG OID639720948 
ProductSel1 domain-containing protein 
Protein accessionYP_864136 
Protein GI117923519 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGCT GGATGTTGAT CCTCATGCTG TGCATACCAC GTGGTCTGTA TGGGGCCGCT 
TTACCGTGGC AGGAAGTGCA GAGTGCGGCG GTGGGTGGCC ATGTAGAGGC GCAACGTATG
CTGGGCCAGT TCTACCTGGA AGGTGAGGGT GGGGTGGAGA AAGATCCCAA ACGGGCTGGG
TATTGGTTGG AAAAGGCGGC GCGTGGTGGG GATGGGCTGG CCCAAAGCCT GTATGGTTAT
CTGCTCAGCC AGGGATTGGG GCGGGCGGTG GATGAGGAGG GGGCGGTTTA TTGGTACCGG
TTGGCCGCCG CCCAGGGTGA GCCTAAGGCG ATGATTGCCT TGGCGCTAAA ATATCGTGCC
GGCTTAGGGG TTAAGCGTGA TGCCCGTCAG GCCGTGCAGC TCTTTCGCCA AGCGGCGGAG
CTGGGTGACG GGCGGGCTCA ATACTATCTG GGCGACCATC TGGCGCGGGG GGAGGGCATC
CCCAGGGATG GTGCGCAGGC GGCCCAATGG TATGAGCGGG CAGCCCGGAG TGGATCGTTG
CTGGCTGCGC TGGCGCTGGG CCAGATGTTG GAGCAGGGTA AAGGGGTGCA GGCCGATGGG
GCCATGGCCC GGCATTGGTA TGAGCAGGCC GCCCAAGGGG GGCATGCCGA GGCCCAATTT
CGCTTAGCAC TCATGTGGGA AGAGGGGCGC GGTGGTGTAC GCGATGTCGC CGTGGCGGTG
GATTGGTACC GCAAGGCAGC GGCCCAGGGA GACACCCGAG GGGCGGTAAA CTTGGGTTAT
CTGTTGGCCC ATGGGGTCGG GGCCCCCCGG GATGAGCAGC AGGCGGTGGC TCTCTATACC
CAAGCGGCCC AAGGGGGAAG CGCCACGGCC ATGTATAATC TGGGGGTGCG TTACAGCATG
GGAAGTGGGG TAAAACAGGA TCTTATCGCG GCCTATCAAT GGTTTCATCT GGCATGGCAG
CAGCACTATA AGGGGGCTGA TGCCGCGCGG GAACAGGTCG CCATGCAGTT AAAAGGGGCA
CAAATTGCCC AAGCGCGGGC GCAAGCGGCC CAGTGGTTAA GCGATAAGGG TCGATAA
 
Protein sequence
MYRWMLILML CIPRGLYGAA LPWQEVQSAA VGGHVEAQRM LGQFYLEGEG GVEKDPKRAG 
YWLEKAARGG DGLAQSLYGY LLSQGLGRAV DEEGAVYWYR LAAAQGEPKA MIALALKYRA
GLGVKRDARQ AVQLFRQAAE LGDGRAQYYL GDHLARGEGI PRDGAQAAQW YERAARSGSL
LAALALGQML EQGKGVQADG AMARHWYEQA AQGGHAEAQF RLALMWEEGR GGVRDVAVAV
DWYRKAAAQG DTRGAVNLGY LLAHGVGAPR DEQQAVALYT QAAQGGSATA MYNLGVRYSM
GSGVKQDLIA AYQWFHLAWQ QHYKGADAAR EQVAMQLKGA QIAQARAQAA QWLSDKGR