Gene Mext_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4472 
Symbol 
ID5833637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4990002 
End bp4991102 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content72% 
IMG OID641370265 
Producthypothetical protein 
Protein accessionYP_001641911 
Protein GI163853868 
COG category[R] General function prediction only 
COG ID[COG4111] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.859238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGGGC CAATCCGGCA AGACCGGAAT GCGGGCCGGA ACCGCTTCCA CCGCCGAGGG 
GATGCCCACG AAGAGGGGCT GAAGGCCGAC GAACGCATGA GCACGGCCGA ACTGGGTGTG
ACCGCCGCCG CGGCGGACAA GGGGACGCGC GCCGCCAGCG CTTCGTCGGT GGGGCTCGTG
GCCGTCATCG TCGCGGCGAC GGACGGCGAG CCGCGCGCGC TCACCGTGCA GGTCGAGGGA
CAAGCCGAGG GTCGCGAGAG CGCCCTGCCC GCCGGGCCTT TGGTGCCCGA GCACGCCACC
CTGGAGCGGG GCCTTCGCGC CTGGGTCGAG CAGCAGACGC ATCAGCGCCT CGGTTATGTC
GAGCAGCTCT ACACTTTCGG CGACCGCGAC CGGGAGGGCG GCCAGCACGA CGTGCACCTG
CTGTCGGTGG CCTATCTCGC CCTCGTGCGC GAGCTGCGCC CGGCGGGCCT TGCGGAAGCC
GCATGGCGCA ACTGGTACCG CTACCTGCCT TGGGAGGATT TCCGCGAGGG CCGGCCCCCG
GCGCTCGCCG AGATCGAGCC GCGCCTGATG GCCTGGGTCG CCGCCGCCTC CGATCCGAAG
CTCCGGCGCA TGCGCGAGGA CCGGGTCGGG CTGAGTTTCG GGATCGGCGG CGCCTGGAAC
GAGGAGCGGG TTCTGGAGCG CTACGAATTG CTGTTCGAAG CCGGGTTGAT CCCCGAAGCC
AACGGCCAGA ACGGCGCCGC CGTGCCCGAC GACCTCGCGA TCACCGGCCA GCCGATGGCC
CATGACCATC GCCGGGTGCT CGCCACGGCG ATCGGCCGCC TGCGCGGCAA GATCAAGTAT
CGCCCGGTGG TGTTCGAGTT GATGCCGCCG GCCTTCACCC TGCTTCAGCT TCAACGCACG
GTCGAGGCGC TCTCGGGCAT CCGGCTGCAC AAGCAGAACT TCCGCCGCCT CGTGGCGCAA
CAGGGCCTCG TCGAGGAGAC CGAGGCGCTC ACCAGCGGCA ATGCCGGGCG CCCGGCCCGG
CTGGTGCGCT TCCGCCGGGA AGTTCTCCTG GAGCGCCCCG CCCCCGGCGT TCGGCTCACC
CCGACGCGGC GAACGGTGTG A
 
Protein sequence
MPGPIRQDRN AGRNRFHRRG DAHEEGLKAD ERMSTAELGV TAAAADKGTR AASASSVGLV 
AVIVAATDGE PRALTVQVEG QAEGRESALP AGPLVPEHAT LERGLRAWVE QQTHQRLGYV
EQLYTFGDRD REGGQHDVHL LSVAYLALVR ELRPAGLAEA AWRNWYRYLP WEDFREGRPP
ALAEIEPRLM AWVAAASDPK LRRMREDRVG LSFGIGGAWN EERVLERYEL LFEAGLIPEA
NGQNGAAVPD DLAITGQPMA HDHRRVLATA IGRLRGKIKY RPVVFELMPP AFTLLQLQRT
VEALSGIRLH KQNFRRLVAQ QGLVEETEAL TSGNAGRPAR LVRFRREVLL ERPAPGVRLT
PTRRTV