Gene MCA1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1016 
Symbol 
ID3103867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1067515 
End bp1068624 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content61% 
IMG OID637170201 
Productdioxygenase, iron-sulfur subunit, putative 
Protein accessionYP_113492 
Protein GI53804857 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.621392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGAT CGATCAGAAA CCAAGATGTC CCCGAACTGC CCCGTCGGCG GCAGGTCCGC 
ACCGTCGGCA TGAGCGGCAA TTACTGGTAT GTGGTCGAGA TCGACGGCAG GCTCAAGCCC
CGGCAAGTCA AACGGGTGCG TTTCTGGGGA CAGGACATCG CGTTGTTCCG CGACGCTGCT
GGCGAACTGC ATGCCGTGGA AGACCGTTGC CCGCATCGGC AACTCCCGCT GTCCCAGGGC
TTCGTCGAGG GGGGAAACCT GGTCTGTACC TATCATGGAT GGAAATTCGA TGGCTGCGGC
CGGTGCACCG AAATCCACCA TGAGCTTGGC AAAGGCCGTA CCAGGTTACC TAGAATCCGC
ATCAGGACCT ATCCCGTCAA GGCGCAATGG GGGCTCATCT GGCTGTTTCC GGGCGATCCC
GCCCTGGCGG ACGGAACCCC GCTGCCGACG ATCCCCCAGC TCGAAGGGGG ACGGCCCTGG
CCGTTCTTCC CGATCGACGT GACGATCAAA GCGCACTTCT CGATGATCGT GGAGAACGTT
TGTGATTTCA ACCACGAATA CCTGCACCGG CACAAACGCC CCTTCCTGCA GCCGATCCTG
CGCGAGTGGA AGCAGGACGC CGACAGTGTT CGGGTCTACT ACGACACCCG TTTCGACGGG
AGCCCCGTCG CCAAGCTCTT CATGGAAGGC GGGGCGCGTG ATCTCAACGA GATCGAGATC
TGGTACCAGT ACCCTTATCA GGGTTCCGAC ATCGGCGGCA AGTACATCCA CTGGCTGTTC
ATGCTGCCGG AGGACGAGCG CACCACCCGC TGTTTCTTCG TCTTCCTGTT CGGGCCGATC
CATGTCCCGA TCGTGAACTG GAAGATGCCC GAATTCCTGC GCAAGCCCAT CCTCTGGTTC
ACCAACAAGT GGTACATCGA GCCCCTGTTG GGCGAGGACA AATGGGCGCT GGAATTGGAG
CAGGACGGTT TCGAGCGCCA TCCCGATGCG CCGCAGATCG AGCTCAATCC GGCCATCAGC
TCGTTCCAGA GGTTGTCGCT GGAGAAGTGG AAAGCTTACC AGCAGTCCAT GGAGAGAGCC
GGGCCAAAGC CGGCGGCAGA CCCGGCATGA
 
Protein sequence
MSRSIRNQDV PELPRRRQVR TVGMSGNYWY VVEIDGRLKP RQVKRVRFWG QDIALFRDAA 
GELHAVEDRC PHRQLPLSQG FVEGGNLVCT YHGWKFDGCG RCTEIHHELG KGRTRLPRIR
IRTYPVKAQW GLIWLFPGDP ALADGTPLPT IPQLEGGRPW PFFPIDVTIK AHFSMIVENV
CDFNHEYLHR HKRPFLQPIL REWKQDADSV RVYYDTRFDG SPVAKLFMEG GARDLNEIEI
WYQYPYQGSD IGGKYIHWLF MLPEDERTTR CFFVFLFGPI HVPIVNWKMP EFLRKPILWF
TNKWYIEPLL GEDKWALELE QDGFERHPDA PQIELNPAIS SFQRLSLEKW KAYQQSMERA
GPKPAADPA