Gene Mext_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1590 
Symbol 
ID5835819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1772416 
End bp1773516 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content71% 
IMG OID641367388 
Producthypothetical protein 
Protein accessionYP_001639060 
Protein GI163851017 
COG category[S] Function unknown 
COG ID[COG4093] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.625227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0229246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGA CACCCGATCC CATTCCCGGC GGATCGCCCC CCCGGCGGCG TATCGGCCTG 
TTCCTCCCCT ACATCCTGCT CGCCATCCTC GTCGTCGCCT GGACCGCCGC ATGGTTCTTC
ATCCGCGGCA AGGCCGAGAG CGAGATGGAT GCGTGGCTCG CCCGTGAAGC GCAGGCCGGC
CGGCAATGGA CCTGCGCCGA CCGCTCGATC ACCGGCTATC CCTTCCGCCT CGAACTGCGC
TGCGGCTCGG TCCGCTTCGC CCGCTCCGAC GGCAATTTCA CCCTCGGGCC GACCACCGCC
GTGGTGCAGG TCTACGATCC GCGCCACGCC GTTCTCGAAG TCGCCGGCCC CTTCCGTGTC
GAGCAGGGTG ATCTGACCGC GGACGTGACC TGGACGTCTC TCGAAGCGAG CTTCCACGCC
GCCTCGAACG GCTTCAGCCG CGCCTCCGTC GTCGTCGATG GTCCCAAGGG CACGGTGCAA
TCCCCCGATC CGGGCCCGGT GGACTTCGCC GCCCAGCACC TCGAACTCCA CGCCCGGCCC
ACCCCCGGCC GCTTCGACAG CGACGGCGCC GTCGACATCA GCCTGCGCCT CGCCAAGGCC
GCCGTGCCGC AGCTCGACGC CTTGAGCGGC AGCGGCGACC CGGCCGATGT CGATCTCGAC
ACGACCATCG AGCGAGCCAC GGTGCTGCGC ACCGGCACGG TGGCGCGGGA ACTGGAGAAA
TGGCGTCAGG CCGATGGCCG CCTCGACGTG ACCCGCCTGT CGATCGCCAA GGGCGAGCGC
CGCCTTCAGG CCAAGGGCGA AGTCGGTCTC GACGAGGCGC ATCGCCCCGA GGGACGCTTC
GAGATCCGCG CGCTCGGGCT CGAAGCCCTG GTCGGGCAGG TGATGGGCCA GCGCTACGGC
TCGGACAAGG GCGCGTTGAT CGGCAACCTC GTCGGCCAGT TCCTGGGTGG CCTGCGCAAG
CGCGAGAGCG CCGCCGGCGA GGTGCAGGCG GCCGACAGCC CCAACGGCCT CAAGCCCCTG
CCGACGCTGC GCCTGGGCGA CGGGCGCCTG ATGCTTGGCC CGCTCGCCGT GCCGAACGTG
GTGTTACCGG CCCTGTACTG A
 
Protein sequence
MAQTPDPIPG GSPPRRRIGL FLPYILLAIL VVAWTAAWFF IRGKAESEMD AWLAREAQAG 
RQWTCADRSI TGYPFRLELR CGSVRFARSD GNFTLGPTTA VVQVYDPRHA VLEVAGPFRV
EQGDLTADVT WTSLEASFHA ASNGFSRASV VVDGPKGTVQ SPDPGPVDFA AQHLELHARP
TPGRFDSDGA VDISLRLAKA AVPQLDALSG SGDPADVDLD TTIERATVLR TGTVARELEK
WRQADGRLDV TRLSIAKGER RLQAKGEVGL DEAHRPEGRF EIRALGLEAL VGQVMGQRYG
SDKGALIGNL VGQFLGGLRK RESAAGEVQA ADSPNGLKPL PTLRLGDGRL MLGPLAVPNV
VLPALY