Gene Mmc1_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_3020 
Symbol 
ID4483531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp3778648 
End bp3779580 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content60% 
IMG OID639723767 
ProductPDZ/DHR/GLGF domain-containing protein 
Protein accessionYP_866917 
Protein GI117926300 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC CGCTGCGTTA TCTGACGTTG CTGGCTGCTG GCCTATTGAG CAGCACCCCC 
GCTTTGGCCG CTGGCTGGCT AGGGGCCAGC CTCAACACGC CACGCGGGGT CCAAGTAGGC
GAGATTATTA AGGGCAGCCC GGCCGATCAT GCTGGTTTGG AGCCCGAGGA TATTATTTTG
GAATTGAATG GGCAGGCCAT TCACGGCCCC GGCCAATTTG CCCGCCGCAT CGCCACCACC
AAACCGGGTA CCCGCATCAC CCTAAAAGTG ATGCGTAAAG GCAAGCTCAC CGAGCTAAAA
ACCACCCTGG AAGATAGCAA AGACCATGCA AGCGTAACCT CCTACATGGG GGGGCCTATG
GGGAGCATGT TAGAGACCCC CAATCAAATG ATGCGGAGCA TGCCTTTTCC ACCCATGCGC
GGCAATCCTG ATGCTTATGG CGCTCCGGGC GGGGCGTATG GGCCCCCTCC CGGCGGGCGT
AACCAAGGCG GCTTTGGCGG CGAAGATTTT GGCAATCGGC CACCCCCTCC TGGTCAGGGC
GGTTTTGCCC AAGCTGGCGA GGCGAACGAC CTTGCTTTCA CCCCTCCCCC GCCACCCCCT
AACGATCAAC GTCAGGGCGG GTTCGGCCCA CCGCCTCACC TGCGTGGCAT GCCCATGGAT
ACCAACAAAG CGTGGTTAGG TATCGCGGTA CAGAACAGCG CCGCTGGGGT TGTGCTGGAA
GGGGTAGCCC CCACCAGCCC CGCCGCCAAG GCAGGCTTAC AGGCCGGGGA TCGCATTGAA
AAAATCAATG ACACCCCCAT CACCGGGGAC CAACAACTGG TGCAGAGCTT AGCCCCCTTA
CAGCCCGGGC AAAAGCTGAC GATACACTAT GTTCGTGACG GCAAATCAAA CGCCGTGGTG
GTGGATCTGG TTAGCCGCCC AGCCCATCCG TAA
 
Protein sequence
MKKPLRYLTL LAAGLLSSTP ALAAGWLGAS LNTPRGVQVG EIIKGSPADH AGLEPEDIIL 
ELNGQAIHGP GQFARRIATT KPGTRITLKV MRKGKLTELK TTLEDSKDHA SVTSYMGGPM
GSMLETPNQM MRSMPFPPMR GNPDAYGAPG GAYGPPPGGR NQGGFGGEDF GNRPPPPGQG
GFAQAGEAND LAFTPPPPPP NDQRQGGFGP PPHLRGMPMD TNKAWLGIAV QNSAAGVVLE
GVAPTSPAAK AGLQAGDRIE KINDTPITGD QQLVQSLAPL QPGQKLTIHY VRDGKSNAVV
VDLVSRPAHP