Gene Mext_4521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4521 
Symbol 
ID5834720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5050994 
End bp5052100 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content72% 
IMG OID641370315 
Producthypothetical protein 
Protein accessionYP_001641960 
Protein GI163853917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.893773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.535871 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGGG TCGTCCTGCC CGTGGGGCTG GCGCTCGGCG CGGGTGCGCT CGGCGCGCTC 
ACGCTGACAG AAGCCGGCAT CGGCCTGCGC GTGAAGGCCG GCCCCGTCCT CTCGGACCTG
CATGATCGGT TTTCCGGGGT TGCGACGCCC GCGGCGCATC AGACGCCGAC CGCACCGAGC
CCCCCGCCCT CGCGAGTCGC GGTGGAAGGC GGTCAGGCCG TCGTGCGGCT GACCGATGCG
GAGCAGGCGC GGATCGGCGT CGCGACAGCC CGCCATAAGC GGATGCCCCA CCGCATCGAG
GTCCAGGCCT TCGGCTCGGT CCTCGATCTC GCGCGGGTCA CGGAGCTCAC CAACAGCTAC
GCCAGCGCCA GGGCGCAGTT GCAGACCGCC GAAGCCAAGG CGGAAGTCTC GCGCGCCGCC
TATACCCGGG CGCGCAGCCT CGGCCAATAC GCGACACAGG TGCAGCTGGA GACGGCCGAG
GGCACCTTCC GCACCGACGA GGCGGCGCTC GCTGCGGCGC AGTCGCAGGT CCGGACGCTT
GCGGCCACCG CGCAGCAGGA ATGGGGCACG GTGATCGGGC GGGCCATCAT CGAGCGTTCG
CCCGCCATCA CCCGGCTGAT CGAGCGCACC GACTTCCTGG TGCAGGTCAC GCTGCCGCCC
GGCGAGACGC TGCGGGCGCC GCCCGGCACG GCCCATGCCG AGGTGCCGCC GCAGAGCGAG
CGCGTCGCCT TGCGTTACGT CTCGCCCGCG ACCCGGACCG ATCAGCGCAT CCAGGGCGTC
AGCTACTTCT ACACCGTGGC CGGCAATAGC GGGCTCCTGC CGGGCATGAG CACGCTCGCC
TTCCTGACCT CGGAGCGCGA GACGACGGGC ATCGCCGTGC CGGAAAGCGC CGTGGTGCAC
TGGCAGGGCG GCGCCTGGAT CTACCGGAGC GTCGGCGACG ACGCCTTTGC GCGCCATCCC
CTCCGGGCCG ACGCGCCGAT CTCGGCCGAC GCCTACGTCG TGGACGATCT CGGCGCGGAG
GCGGAGATCG TCGTGACCGG GCCGCAGGCC GTCCTCTCCG AGGAGCTGAA GGGGCAGATC
CAGTCCTCGG ATGCGGACGA CGATTGA
 
Protein sequence
MRRVVLPVGL ALGAGALGAL TLTEAGIGLR VKAGPVLSDL HDRFSGVATP AAHQTPTAPS 
PPPSRVAVEG GQAVVRLTDA EQARIGVATA RHKRMPHRIE VQAFGSVLDL ARVTELTNSY
ASARAQLQTA EAKAEVSRAA YTRARSLGQY ATQVQLETAE GTFRTDEAAL AAAQSQVRTL
AATAQQEWGT VIGRAIIERS PAITRLIERT DFLVQVTLPP GETLRAPPGT AHAEVPPQSE
RVALRYVSPA TRTDQRIQGV SYFYTVAGNS GLLPGMSTLA FLTSERETTG IAVPESAVVH
WQGGAWIYRS VGDDAFARHP LRADAPISAD AYVVDDLGAE AEIVVTGPQA VLSEELKGQI
QSSDADDD