Gene M446_5800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5800 
Symbol 
ID6130989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6374695 
End bp6375759 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content73% 
IMG OID641645907 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001772521 
Protein GI170743866 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00198597 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGCCCG GCCCCGACCC GCAGGACAGC ATGACAGGGG ATCCCATCCG CCGCCCGCTC 
GACGACGTCT GGTACTGCGT CGGCGAGAGC CGCCGCTTCC GCGAGGGGCG GCTCTGTGCC
GTCACGCTCG GCGAGGAGGC GATCGTGGTC GGGCGGGCGG CGGGCGGCGG CCTCTTCGCC
CTGCGCGACC GCTGCCCGCA CCGGGGCATG GCGCTCTCGG CCGGGCGCCT GGTCGAGGGC
CGCCTCGTCT GCCCGTTCCA CGGCTGGGAG TTCCGGCCGG ACGGGCAATG CGCGGCGATC
CCCGCCCTCG CGGCCCGGGA CGAATCCGAC TTCCGCACCG TGCGGCTGCC GCGCTTCTCC
GTGAGGGAGG CGTCCGGCCT CGTCTGGATC GCGGCGGGCG AGCCGCGGCC GGACGCGCCG
CCGATTCCGG AGGTGGAGTT CCCCTACGCG GGCATGCTGG TCGAGACGCT GGAGGTCGAG
GCGAGCTTCG ACCTCGTGGC GTTGAGCTTC GTCGATCCGG CCCATGTCGC CTACGTGCAC
GATCTGTGGT GGTGGCGGAC CTCCAAGACC CTGCGGGAGA AGGAGAAGCA CTTCGCCCCC
TCCCCGTTCG GCTTCACCAT GACGAGCCAC AGGGCGAAGA GCGCCTCGGC GGTCTACCGC
CTGCTCGGGG CGGTGCCGGA GGTCGAGATC GAGTTCCGCC TGCCGGGCGT GCGGCTGGAG
CGGATCAGGG CCGGGGAGAA GCGCATCGCG AACTACACCT TCGCGACGCC GCTCGGCCCG
GGCCGCACGG CGCTCGTCAA CGCGATGTAC TGGAACCTGC CGCTGCTGAA CCTCCTGCGG
CCGCTGGCGC GGCCGCTGAT GCGGCAATTC CTCGGCCAGG ACCGCGACGT GCTGCAGGTC
GCCCAGAAGG GCCTCGACCG GAAGCCCGCG ATGGTGCTGA TCGGGGAGAG CGACGTGCAG
TCGCAATGGT ATTTCAGCCT CAAGCGCGAG TTCCTGCGCG CCCGTGAGGC GGGCGCGGCC
TTCGCCAACC CGCTCGCGCC GCGGCTCCTG CGCTGGCGCA GCTGA
 
Protein sequence
MGPGPDPQDS MTGDPIRRPL DDVWYCVGES RRFREGRLCA VTLGEEAIVV GRAAGGGLFA 
LRDRCPHRGM ALSAGRLVEG RLVCPFHGWE FRPDGQCAAI PALAARDESD FRTVRLPRFS
VREASGLVWI AAGEPRPDAP PIPEVEFPYA GMLVETLEVE ASFDLVALSF VDPAHVAYVH
DLWWWRTSKT LREKEKHFAP SPFGFTMTSH RAKSASAVYR LLGAVPEVEI EFRLPGVRLE
RIRAGEKRIA NYTFATPLGP GRTALVNAMY WNLPLLNLLR PLARPLMRQF LGQDRDVLQV
AQKGLDRKPA MVLIGESDVQ SQWYFSLKRE FLRAREAGAA FANPLAPRLL RWRS