Gene Mmc1_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_2003 
Symbol 
ID4483324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp2480318 
End bp2482813 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content52% 
IMG OID639722746 
ProductSel1 domain-containing protein 
Protein accessionYP_865910 
Protein GI117925293 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0515794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0202924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC ATTGGGCAAA AGGGTTGGGT TTTGGCATAT TGCTTGTGTT GGCGGGTGGG 
CTATTTGCGT GGAATTTTTT CTTATCCCAT CCACCAAGGG CCAAAACAGC TGCCCAACGA
GCTGTGGCTG AGATTTTTGC GCCGATTTAC CGTGAACCGA GGTGGGATCA CGTACGGACC
ATCGGATTGC AGCCAAAATT TACGGGTTGT GAGACGGTGG ACGAACACCA TTATAAGACG
GCATTGGTGC CGATGGCTGG TCCTAAACTC TCGCTTCGCG AGCGGGGCGG GGATTTGCAG
GCCCTTTTTG CGGATGAAGC CATGATTCGC CGTCTGTTTA ATCATGAAGT AACATCCGAA
AAGTCGGCGC ATACCCTAAT AAAAATAGAG CATGTACAAG CGCTTTTGCT TGGACAATGT
GTGGAGCAGA AGCTTTTTTT AAAATTGGTC ACAACGGCCC ATGGAGATAT TGTGGCAACC
GTTGAGATAG CCCTGGGAGA TGCAGCCGAA GCCGAGCAGC GTGCCGCCGA GCAGGCCAGA
GTTAAAGAGG AGGAGGAGCG CCTAACCCGT ATACGGGTCA AAGCGCAAGA GGCCGAGCAG
CGTGCCGCCG AGCAGGCCAG AGTTAAAGAG GAGGAAGAGC GCCTAACCCG TATACGGGTC
AAAGCGCAAG AGGCCGAGCA GCGTGCCGAG CAGGCCAGAG TTAAAGAGGA GGAGGAGCGC
CTAGCCCGTA TACGGGTCAA AGCGCAAGAG GCCGAGCAGC GTGCCGCCGA GCAGGCCAGA
GTTAAAGAGG AGGAAGAGCG CCTAATCCGT ATTCGGGTCA AAGCGCAAGA GGCCGAGCAG
CGTGCCGAGC AGGCGCGTGA ACATGCGAAG GATGCCGCTA AGATAGCCCT GGGGGATGCC
GCCAAGCTGG CCAATCCCTC TGTTCAATTC AATCTTGGCG TTATGTATGA CAAAGGACAG
GGCGTCACGA AAGACGCCAA AGAGGCGGTG AAGTGGTTTA GGAAATCTGC TGAACAGGGG
TATGCCCAAG CTCAACACAA TCTTGGCGTT ATGTATGACA AAGGAGAGGG CGTCACGAAA
GACGCCAAAG AGGCGGTGAA GTGGTTTAGG AAATCTGCTG AACAGGGGCA TGCCCAAGCT
CAACACAATC TTGGCGTTAT GTATGACAAA GGACAGGGCG TCACGAAAGA CGCCAAAGAG
GCGGTGAAGT GGTTTAGGAA ATCTGCTGAA CAGGGGCATG CCCAAGCTCA ACACAATCTT
GGCGTTATGT ATAACAATGG AGAGGGCGTC ACGAAAGACG CCAAAGAGGC GGTGAAGTGG
TATCGGAAAG CGGCTGAACA GGGGCATGCC CGAGCTCAAA ACAATCTTGG CGTTATGTAT
AACAATGGAG AGGGCGTCAC GAAAGACGCC AAAGAGGCGG TGAAGTGGTA TCGGAAAGCG
GCTGAACAGG GGCAAGCTGA AGCTCAAAAC GATCTTGGCG TTATGTATGA CAAAGGAGAG
GGCGTCACGA AAGACGCCAA AGAGGCGGTG AAGTGGTATC GGAAAGCGGC TGAACAGGGG
CATGCCCGAG CTCAAAACAA TCTTGGCGTT ATGTATAACA ATGGAGAGGG CGTCACGAAA
GACGCCAAAG AGGCGGTGAA GTGGTATCGG AAAGCGGCTG AACAGGGGCA AGCTGAAGCT
CAACACAATC TTGGCTTTAT GTATGACAAA GGAGAGGGCG TCACGAAAGA CGCCAAAGAG
GCGGTGAAGT GGTATCGGAA AGCGGCTGAA CAGGGGCAAG CTAAAGCTCA ACACAATCTT
GGCGTTATGT ATAACAATGG AGAGGGCGTC ACGAAAGACG CCAAAGAGGC GGTGAAGTGG
TTTAGGAAAT CTGCTGAACA GGGGGAAGCT AAAGCTCAAC ACAATCTTGG CGTTATGTAT
AACAATGGAG AGGGCGTCAC GAAAGACGCC AAAGAGGCGG TGAAGTGGTT TAGGAAATCT
GCTGAACAGG GGGAAGCTGA AGCTCAAAAC AATCTTGGCT TTATGTATGA CAATGGAGAG
GGCGTCACGA AAGACGCCAA AGAGGCGGTG AAGTGGCTTA GGAAAGCGGC TGAACAGGGG
CACGCTAACG CTCAAGCCTT TCTTGGACAA AGCTATGATG TAGGATATGG AGTCACGAAA
GACGCCAAAG AGGCGGTGAA GTGGTATAGG AAATCTGCTG AACAGGGGCA AGCTGAAGCT
CAAAACAATC TTGGCGTTAT GTATGACAAA GGACAGGGCG TCACGAAAGA CGCCAAAGAG
GCGGTGAAGT GGTATCGGAA AGCTGCTGAA CAGGGAGATG CTCGAGCTCA ATTCAATCTT
GGAGATAAGT ATGACAAAGG AGAGGGCGTC ACGAAAGACG CCAAAGAGGC GGTGAAGTGG
TATCGGAAAG CGGCTGAACA GGGCTTAACG GAAGCAAGTA CCAGGCTGAA GCGGCTACGC
TTTAACGTTG TTGGACACGA AAATGAGGGA AAATGA
 
Protein sequence
MKIHWAKGLG FGILLVLAGG LFAWNFFLSH PPRAKTAAQR AVAEIFAPIY REPRWDHVRT 
IGLQPKFTGC ETVDEHHYKT ALVPMAGPKL SLRERGGDLQ ALFADEAMIR RLFNHEVTSE
KSAHTLIKIE HVQALLLGQC VEQKLFLKLV TTAHGDIVAT VEIALGDAAE AEQRAAEQAR
VKEEEERLTR IRVKAQEAEQ RAAEQARVKE EEERLTRIRV KAQEAEQRAE QARVKEEEER
LARIRVKAQE AEQRAAEQAR VKEEEERLIR IRVKAQEAEQ RAEQAREHAK DAAKIALGDA
AKLANPSVQF NLGVMYDKGQ GVTKDAKEAV KWFRKSAEQG YAQAQHNLGV MYDKGEGVTK
DAKEAVKWFR KSAEQGHAQA QHNLGVMYDK GQGVTKDAKE AVKWFRKSAE QGHAQAQHNL
GVMYNNGEGV TKDAKEAVKW YRKAAEQGHA RAQNNLGVMY NNGEGVTKDA KEAVKWYRKA
AEQGQAEAQN DLGVMYDKGE GVTKDAKEAV KWYRKAAEQG HARAQNNLGV MYNNGEGVTK
DAKEAVKWYR KAAEQGQAEA QHNLGFMYDK GEGVTKDAKE AVKWYRKAAE QGQAKAQHNL
GVMYNNGEGV TKDAKEAVKW FRKSAEQGEA KAQHNLGVMY NNGEGVTKDA KEAVKWFRKS
AEQGEAEAQN NLGFMYDNGE GVTKDAKEAV KWLRKAAEQG HANAQAFLGQ SYDVGYGVTK
DAKEAVKWYR KSAEQGQAEA QNNLGVMYDK GQGVTKDAKE AVKWYRKAAE QGDARAQFNL
GDKYDKGEGV TKDAKEAVKW YRKAAEQGLT EASTRLKRLR FNVVGHENEG K