Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1041 |
Symbol | |
ID | 3103676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1092330 |
End bp | 1094567 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637170225 |
Product | DsbD family thiol:disulfide interchange protein |
Protein accession | YP_113516 |
Protein GI | 53804808 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGACG CCGTGAAACG GATTGGCTTG CTGTTGATTT TGGGATTGGT CGGCATTCCG GCACTCGCCG TCGATGGCCG CGACCTCCTG CCGCCGGAGC GGGCGTTCCC GGTAGCCGCC CGGCAGTCCG GCGCCGGCGT GGTTTCACTG GTGTGGGACA TAGTGGACGG CTATTACCTT TACCGGAGCA AATTCAAGTT CGCATCGCGC ACCGCCGGTG TCCGTCTGGG CGAGCCGGTC TTTCCGGAAG GCCACAAGAA GCACGATGCA TTCTTCGGCG AAGTCGAAAC TTACCGGGGA CACATCGAGG TGCGGCTGCC TCTCGTTGCG GAATCCGCAT TGCCGGAATC GCTGGAACTG GAAGTGACGG TGCAGGGGTG TGCCGATGCC GGCGTCTGTT TTCCGCCGTA TCAGCGGCGG GTCGAGGTGA AGACCAGTGC GATCAGCGGT GGCGGCGCGT TCACGCGTCT GGCCGGTGCG CTTCGAAACG TCGGAACCGG CGTGGCGACC GGGGACCTGC TGCCGCCGGA CCAGGCGTTC CGCTTTTTCG CCGAGGCGCC CACGGGGGAG ATCCTCCGGC TCGGTTGGCA GATCGCCCCC GGTTATTACC TCTACCGGGA AAAATTCCGC ATCAGCCTGC GCGATTCCGG GGGCGTCGCC CTGGGAGACT ATACGTTCCC GCGCGGGGAG CCGAAGGTCG ACGAAGAGTT CGGTGCGGTG GAGGTCTTCC ACGGCGAGGT CGCGGTTGAC GTCCCCTTGG TCCGGACTGA CGCCGGTCCG CGCACCGTCA CCGTGGAAGC CGGCTTCCAA GGCTGCGCCG AGCGCGGCGT GTGTTATCCG CCGATGACCA GGACGATCGA TCTCGTTCTG CCAGCCGCTG CGGAAGGAAA AAGCGTCTCG GCGATGGCCG GAGCCACCCT CATCACCGAG CAGGACCGCA TCGCCGACGG ACTCTGGCGC GGTTCCCTCT GGCTCAACCT GTTGAGCTTC CTTGGTCTCG GCATCCTGCT CGCCTTCACC CCCTGCATCT TCCCGATGAT CCCGATTCTT TCCGGCGTCA TCGTGGGTCA TGGCCATGCG ATCACCACGC GGCGGGCGTT CTCGCTCTCG CTGGCCTATG TGCTCGCCCA TGCCTTGGCT TACACCTGCT TCGGTGTGTT GGCCGCCCTG TTCGGTGCGA ATCTCCAGGT CGCACTGCAG AACCCCTGGG CGATCGGGGT GTTCAGTGGT CTGTTCGTGG TGTTGGCGCT ATCCATGTTC GGCTTCTACC AGCTTCAGCT CCCGACCGTC CTGCAGTCGC GGCTCGCGGC GCTCAGTTCG CGGCAGCAGG CTGGCAGCCT GGTCGGCGCG GCCGTGATGG GACTGCTGTC GGCCGGCCTC GTGGGGCCTT GCGTGGCGGC GCCGCTGGCG GGTGCACTGA TCTACATCGG CCGCAGTGGC GATGTGCTGT TGGGCGGGCT CGCCCTGTTT TCCCTGGGGC TGGGTATGGG GTTGCCGCTC CTGGCGGTCG GGACCTCGGC GGGCAAACTG CTGCCCAAGG CCGGCATGTG GATGAATACC GTCAAGTCGG TGTTCGGTGT AGCCATGCTG GCGGTGGCGG TATCGATGCT GGAGCGGATC GTCCCTCTCT CCCTCGCCAT GCTGATGTGG GCGCTGCTGC TCATCGTGCC GGCCGTGTAC ATGGGAGCGC TCGATGCGCT GCCCGCTGGG GCGTCGGGCT GGCGCAAGCT ATGGAAAGGA CTGGGAGTGG CGATGGTCGC CTATGGCGTG TTCATGCTGC TGGGGGTGGC AGCCGGTAGT CGCGATCCGT TGCAGCCCTT GCGCGGTGTG GCCGGCGCCT CCGCTACCGT CGCCGATGCC GGTCCGCCGC CTTTCGTGAG GGTGGCGAGC CTGGCGGATC TGGAGGCTCG TCTGGCTGAA GCCGCTGCCG CAGGACGCTG GGTGATGCTC GATTTCTACG CCGACTGGTG CGTATCGTGC GAGGAGATGG AGCGTCATAC CTTCAGCGAT CCCGTGGTCA AGGCGAAGCT GAAAGAGATG GTGCTGCTCA AGGTCGACGT CACCGAAAAC GACGATGAGG ACCAGGCGCT GCTGCAGCGC TTCGGTCTGA TCGGTCCGCC AGCCACCCTG TTTTTCGGTC CCGACCGGAG CGAGCGGAAA CCTTTCCGGC TGGTGGGGTT CACCGAGTCG GAAGAGTTCG TCCATCACCT CGAGCGGCTG TTCGACGGTG TCCGATAG
|
Protein sequence | MRDAVKRIGL LLILGLVGIP ALAVDGRDLL PPERAFPVAA RQSGAGVVSL VWDIVDGYYL YRSKFKFASR TAGVRLGEPV FPEGHKKHDA FFGEVETYRG HIEVRLPLVA ESALPESLEL EVTVQGCADA GVCFPPYQRR VEVKTSAISG GGAFTRLAGA LRNVGTGVAT GDLLPPDQAF RFFAEAPTGE ILRLGWQIAP GYYLYREKFR ISLRDSGGVA LGDYTFPRGE PKVDEEFGAV EVFHGEVAVD VPLVRTDAGP RTVTVEAGFQ GCAERGVCYP PMTRTIDLVL PAAAEGKSVS AMAGATLITE QDRIADGLWR GSLWLNLLSF LGLGILLAFT PCIFPMIPIL SGVIVGHGHA ITTRRAFSLS LAYVLAHALA YTCFGVLAAL FGANLQVALQ NPWAIGVFSG LFVVLALSMF GFYQLQLPTV LQSRLAALSS RQQAGSLVGA AVMGLLSAGL VGPCVAAPLA GALIYIGRSG DVLLGGLALF SLGLGMGLPL LAVGTSAGKL LPKAGMWMNT VKSVFGVAML AVAVSMLERI VPLSLAMLMW ALLLIVPAVY MGALDALPAG ASGWRKLWKG LGVAMVAYGV FMLLGVAAGS RDPLQPLRGV AGASATVADA GPPPFVRVAS LADLEARLAE AAAAGRWVML DFYADWCVSC EEMERHTFSD PVVKAKLKEM VLLKVDVTEN DDEDQALLQR FGLIGPPATL FFGPDRSERK PFRLVGFTES EEFVHHLERL FDGVR
|
| |