Gene Mext_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4149 
Symbol 
ID5832504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4615910 
End bp4616800 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content68% 
IMG OID641369939 
Productextracellular solute-binding protein 
Protein accessionYP_001641589 
Protein GI163853546 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.514926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCG TGAACGGACG GCGCCGTACA GCCGCATCGG TCGTCGCTCT GACCGCCTTC 
GCCGCCCTGT GCGCACCCGC CCAGGCGCAG GACACGAAAG CCACCTCGAA AGCCGCCGAG
GCCGCCAAAC CCGATGCCGG GACCTTGCGC GTCTGCGCCG CCGAGCAGCC GCCGCTCTCG
ATGAAGGACG GCTCGGGGCT CGAGAACCGC ATCGCGACGA CGGTGGCCGA GGCCATGGGC
CGCAAAGCCC AGTTCGTCTG GCTCGGAAAG CCCGCGATCT ACCTCGTGCG CGACGGGCTG
GAGAAGAAGA CCTGCGACGT GGTGATCGGG CTCGATGCCG ACGACGCCCG CGTGCTGACC
AGCAAGCCCT ATTACCGCTC GGGCTACGTC TTCCTCACCC GCGCCGACAA GGATCTCGAC
GTCAAGTCCT GGTCCGATCC GCGCCTGAAG GACGTCAGCC ACATGGTGGT CGGCTTCGGC
ACGCCCGGCG AGGCGATGCT CAAGGATATC GGCCGCTACG AGGAGGACAT GGCCTACCTC
TACTCGCTGG TGAACTTTCG CGCGCCGCGG AATCAATACA CCCAGATCGA TCCGGCCCGG
ATGGTGAGCG AGGTCGCCAC CGGCAAGGCC GAGGTCGGCG TGGCCTTCGG GCCCGACGTC
GCCCGCTACG TGCGCGATTC CTCGACCAAG CTGCGCATGA CCCCCGTGCC CGACGACACG
CAGGCCAGCG ACGGCCGGAA GATGCCGCAG AGCTTCGACC AGGCGATGGG CGTGCGCAAG
GACGACACCG CCCTGAAGGC GGAGATCGAC GCCGCCCTGG AGAAGGCCAA GCCGAAGATC
GAGGCGATCC TGAAGGAAGA AGGCGTGCCC GTGCTGCCCG TCTCCAACTG A
 
Protein sequence
MSLVNGRRRT AASVVALTAF AALCAPAQAQ DTKATSKAAE AAKPDAGTLR VCAAEQPPLS 
MKDGSGLENR IATTVAEAMG RKAQFVWLGK PAIYLVRDGL EKKTCDVVIG LDADDARVLT
SKPYYRSGYV FLTRADKDLD VKSWSDPRLK DVSHMVVGFG TPGEAMLKDI GRYEEDMAYL
YSLVNFRAPR NQYTQIDPAR MVSEVATGKA EVGVAFGPDV ARYVRDSSTK LRMTPVPDDT
QASDGRKMPQ SFDQAMGVRK DDTALKAEID AALEKAKPKI EAILKEEGVP VLPVSN