Gene Mmc1_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_2050 
Symbol 
ID4481784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp2531961 
End bp2533817 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content58% 
IMG OID639722793 
Productphage terminase GpA 
Protein accessionYP_865957 
Protein GI117925340 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0184089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTGT CCGCAGCGCA GATTTACGAA GATGCATTCC TTCAGGGGCT GATGCCCGAT 
CCGCGTCTGA CGGTCTCTGA ATGGGCTGAT GAACACCGCA TGCTCTCTTC GGTGGCCTCG
GGGGAACCCG GCCCTTGGCG CACCGATCGC ACCCCCTATT TGCAGGAGGT GATGGACTCT
CTTTCACCCA GTTCCCCCAT TGAGCGGGTG GTGGCGATGT TTGGTAGTCA GCTGGGCAAA
ACCGAGTGTG GCCTGAACTG GGTGGGGTAT GTGATCCACC ATGCACCGGG TCCCATGCTG
ATGGTTCAGC CCACGGTGGA GATGGCTAAG CGCTACTCCA AACAGCGGGT GGGACCGCTC
ATTGAATCAA GTTCTGAGAT CCGGGAGCGG GTAAAACCGG CTCGATCTCG GGACAGTGGC
AACACGGTGC TTTCCAAGGA GTTCCCTGGC GGCATCCTTT TGATGACCGG AGCCAACAGC
GCGGTGGGGC TCTCCTCCGC GCCGATCCGC TATCTCTTCA TGGATGAGGT GGACCGTTTT
CCAGGTGACG CTGATGGCGA AGGGGATCCG GTGGCGCTGG CGATTCAGCG CACGGCGAAC
TTCTCCAATC GCAAGGTGCT GCTTACTAGC ACACCGACAA TCAAGGGCTT CAGTCGCATC
GAAGCAGCTT ATGCAGAAAG CGACCAGCGT CAGTTCTGGG TACCCTGCCC AGAGTGTGGT
GAATTCCAGG TGCTGACATG GGCCCAGGTG AAATGGCCTC ATGGTGAGCG AAAAGCGGCG
TATTACCTCT GTCCACACTG TGAATCACAG CTGGCTGACC ATCAAAAGGG CTGGATGCTG
GAAAACGGCG TCTGGCGAGC GGCGGCTGCT GGGGATGGGA AAACCGCAGG CTTTCACCTC
TCCTCCCTCT ATAGCCCCCA TGGCTGGACC TCATGGGGGG ACATTGCGGT GGAGCATGGT
CTGGTGCACA AGGATCCATC TCGATTAAAG ACCTGGGTCA ACACTAAAAT GGGGCAGTGT
TGGGAGGAGC AAGGTGACCG TATCGATGGC GAAGGGCTCA TGGAGCGCCG AGAGGTCTGG
GGCGAACTGC TCCCTGCTGA TGTGGCTGTA CTCACAGCTG GCGTTGATGT TCAGGATGAT
CGTCTTGAAG TTGAAATCGT AGGCTGGGGC CGGGATGAAG AGTCCTGGTC CATTGACTAT
CGGGTATTGT GGGGTGATCC CTCATCCCCA GCTGTATGGG AAGATCTGGA CAACCTGCTC
CGTCACCCCC TTGGTCACAG TCGGCAGTTG CCAGATATGA CCATCCGGGC CGCCGCTATC
GATACCGGTG GTCACCATAC CCTGAAGTCC TATGCCTTCT GCCGATCCAG GCAGGGCCGA
AGGATCTGGG CCATCAAGGG GCGTGGTGGT CAGGGAGTGC CCATCTGGCC CCGTAAGCCC
TCCACTAAAA ACAAGGGGAA GGTGCCGCTT TTCATTTTGG GTGTGGATGC CTGTAAAGAG
GCGATCCTCT CCCGTCTACG CATTGAAGAG CCAGGACCGG GCTTTCTGCA CTTTCCCATG
CAACGGGATG GGGACTACTT CAAACAGCTC ACCGCTGAAT CGGTGGTGAC CCGTTATCAC
AAGGGCCGCC CCATTCGGGA GTGGAATAAG CGGGATTCCG ATCGCAACGA GGCCCTGGAT
TGCCGTGTCT ATGCCATGGC GGCACTACAG GGGCTGATTG CCATGGGGTT TCGGCTCAAC
ATGGCGGTTG AGAAGATCGC TGAACACCCT CTTAAAGATG CGATGCCTGA GCCCGGCAAA
GTGCAGCAAA AGAAGATCAA ACCTAAACGC CGGGTTATTC AATCCAGTTG GGTCTAA
 
Protein sequence
MALSAAQIYE DAFLQGLMPD PRLTVSEWAD EHRMLSSVAS GEPGPWRTDR TPYLQEVMDS 
LSPSSPIERV VAMFGSQLGK TECGLNWVGY VIHHAPGPML MVQPTVEMAK RYSKQRVGPL
IESSSEIRER VKPARSRDSG NTVLSKEFPG GILLMTGANS AVGLSSAPIR YLFMDEVDRF
PGDADGEGDP VALAIQRTAN FSNRKVLLTS TPTIKGFSRI EAAYAESDQR QFWVPCPECG
EFQVLTWAQV KWPHGERKAA YYLCPHCESQ LADHQKGWML ENGVWRAAAA GDGKTAGFHL
SSLYSPHGWT SWGDIAVEHG LVHKDPSRLK TWVNTKMGQC WEEQGDRIDG EGLMERREVW
GELLPADVAV LTAGVDVQDD RLEVEIVGWG RDEESWSIDY RVLWGDPSSP AVWEDLDNLL
RHPLGHSRQL PDMTIRAAAI DTGGHHTLKS YAFCRSRQGR RIWAIKGRGG QGVPIWPRKP
STKNKGKVPL FILGVDACKE AILSRLRIEE PGPGFLHFPM QRDGDYFKQL TAESVVTRYH
KGRPIREWNK RDSDRNEALD CRVYAMAALQ GLIAMGFRLN MAVEKIAEHP LKDAMPEPGK
VQQKKIKPKR RVIQSSWV