Gene MCA1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1041 
Symbol 
ID3103676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1092330 
End bp1094567 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content65% 
IMG OID637170225 
ProductDsbD family thiol:disulfide interchange protein 
Protein accessionYP_113516 
Protein GI53804808 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGACG CCGTGAAACG GATTGGCTTG CTGTTGATTT TGGGATTGGT CGGCATTCCG 
GCACTCGCCG TCGATGGCCG CGACCTCCTG CCGCCGGAGC GGGCGTTCCC GGTAGCCGCC
CGGCAGTCCG GCGCCGGCGT GGTTTCACTG GTGTGGGACA TAGTGGACGG CTATTACCTT
TACCGGAGCA AATTCAAGTT CGCATCGCGC ACCGCCGGTG TCCGTCTGGG CGAGCCGGTC
TTTCCGGAAG GCCACAAGAA GCACGATGCA TTCTTCGGCG AAGTCGAAAC TTACCGGGGA
CACATCGAGG TGCGGCTGCC TCTCGTTGCG GAATCCGCAT TGCCGGAATC GCTGGAACTG
GAAGTGACGG TGCAGGGGTG TGCCGATGCC GGCGTCTGTT TTCCGCCGTA TCAGCGGCGG
GTCGAGGTGA AGACCAGTGC GATCAGCGGT GGCGGCGCGT TCACGCGTCT GGCCGGTGCG
CTTCGAAACG TCGGAACCGG CGTGGCGACC GGGGACCTGC TGCCGCCGGA CCAGGCGTTC
CGCTTTTTCG CCGAGGCGCC CACGGGGGAG ATCCTCCGGC TCGGTTGGCA GATCGCCCCC
GGTTATTACC TCTACCGGGA AAAATTCCGC ATCAGCCTGC GCGATTCCGG GGGCGTCGCC
CTGGGAGACT ATACGTTCCC GCGCGGGGAG CCGAAGGTCG ACGAAGAGTT CGGTGCGGTG
GAGGTCTTCC ACGGCGAGGT CGCGGTTGAC GTCCCCTTGG TCCGGACTGA CGCCGGTCCG
CGCACCGTCA CCGTGGAAGC CGGCTTCCAA GGCTGCGCCG AGCGCGGCGT GTGTTATCCG
CCGATGACCA GGACGATCGA TCTCGTTCTG CCAGCCGCTG CGGAAGGAAA AAGCGTCTCG
GCGATGGCCG GAGCCACCCT CATCACCGAG CAGGACCGCA TCGCCGACGG ACTCTGGCGC
GGTTCCCTCT GGCTCAACCT GTTGAGCTTC CTTGGTCTCG GCATCCTGCT CGCCTTCACC
CCCTGCATCT TCCCGATGAT CCCGATTCTT TCCGGCGTCA TCGTGGGTCA TGGCCATGCG
ATCACCACGC GGCGGGCGTT CTCGCTCTCG CTGGCCTATG TGCTCGCCCA TGCCTTGGCT
TACACCTGCT TCGGTGTGTT GGCCGCCCTG TTCGGTGCGA ATCTCCAGGT CGCACTGCAG
AACCCCTGGG CGATCGGGGT GTTCAGTGGT CTGTTCGTGG TGTTGGCGCT ATCCATGTTC
GGCTTCTACC AGCTTCAGCT CCCGACCGTC CTGCAGTCGC GGCTCGCGGC GCTCAGTTCG
CGGCAGCAGG CTGGCAGCCT GGTCGGCGCG GCCGTGATGG GACTGCTGTC GGCCGGCCTC
GTGGGGCCTT GCGTGGCGGC GCCGCTGGCG GGTGCACTGA TCTACATCGG CCGCAGTGGC
GATGTGCTGT TGGGCGGGCT CGCCCTGTTT TCCCTGGGGC TGGGTATGGG GTTGCCGCTC
CTGGCGGTCG GGACCTCGGC GGGCAAACTG CTGCCCAAGG CCGGCATGTG GATGAATACC
GTCAAGTCGG TGTTCGGTGT AGCCATGCTG GCGGTGGCGG TATCGATGCT GGAGCGGATC
GTCCCTCTCT CCCTCGCCAT GCTGATGTGG GCGCTGCTGC TCATCGTGCC GGCCGTGTAC
ATGGGAGCGC TCGATGCGCT GCCCGCTGGG GCGTCGGGCT GGCGCAAGCT ATGGAAAGGA
CTGGGAGTGG CGATGGTCGC CTATGGCGTG TTCATGCTGC TGGGGGTGGC AGCCGGTAGT
CGCGATCCGT TGCAGCCCTT GCGCGGTGTG GCCGGCGCCT CCGCTACCGT CGCCGATGCC
GGTCCGCCGC CTTTCGTGAG GGTGGCGAGC CTGGCGGATC TGGAGGCTCG TCTGGCTGAA
GCCGCTGCCG CAGGACGCTG GGTGATGCTC GATTTCTACG CCGACTGGTG CGTATCGTGC
GAGGAGATGG AGCGTCATAC CTTCAGCGAT CCCGTGGTCA AGGCGAAGCT GAAAGAGATG
GTGCTGCTCA AGGTCGACGT CACCGAAAAC GACGATGAGG ACCAGGCGCT GCTGCAGCGC
TTCGGTCTGA TCGGTCCGCC AGCCACCCTG TTTTTCGGTC CCGACCGGAG CGAGCGGAAA
CCTTTCCGGC TGGTGGGGTT CACCGAGTCG GAAGAGTTCG TCCATCACCT CGAGCGGCTG
TTCGACGGTG TCCGATAG
 
Protein sequence
MRDAVKRIGL LLILGLVGIP ALAVDGRDLL PPERAFPVAA RQSGAGVVSL VWDIVDGYYL 
YRSKFKFASR TAGVRLGEPV FPEGHKKHDA FFGEVETYRG HIEVRLPLVA ESALPESLEL
EVTVQGCADA GVCFPPYQRR VEVKTSAISG GGAFTRLAGA LRNVGTGVAT GDLLPPDQAF
RFFAEAPTGE ILRLGWQIAP GYYLYREKFR ISLRDSGGVA LGDYTFPRGE PKVDEEFGAV
EVFHGEVAVD VPLVRTDAGP RTVTVEAGFQ GCAERGVCYP PMTRTIDLVL PAAAEGKSVS
AMAGATLITE QDRIADGLWR GSLWLNLLSF LGLGILLAFT PCIFPMIPIL SGVIVGHGHA
ITTRRAFSLS LAYVLAHALA YTCFGVLAAL FGANLQVALQ NPWAIGVFSG LFVVLALSMF
GFYQLQLPTV LQSRLAALSS RQQAGSLVGA AVMGLLSAGL VGPCVAAPLA GALIYIGRSG
DVLLGGLALF SLGLGMGLPL LAVGTSAGKL LPKAGMWMNT VKSVFGVAML AVAVSMLERI
VPLSLAMLMW ALLLIVPAVY MGALDALPAG ASGWRKLWKG LGVAMVAYGV FMLLGVAAGS
RDPLQPLRGV AGASATVADA GPPPFVRVAS LADLEARLAE AAAAGRWVML DFYADWCVSC
EEMERHTFSD PVVKAKLKEM VLLKVDVTEN DDEDQALLQR FGLIGPPATL FFGPDRSERK
PFRLVGFTES EEFVHHLERL FDGVR