Gene Mext_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1806 
Symbol 
ID5832170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2027993 
End bp2028988 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID641367605 
Productputative ABC transporter periplasmic solute-binding protein 
Protein accessionYP_001639276 
Protein GI163851233 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.592526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.965356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCAC GGCGCGGAAT GCTCGCGACG GCGGCTCTGT CGCTGGCCGC GACGCTGCCG 
ATGGCGCGCC GCGCGCAGGC GGCGGGCGAG ACGTTCCGGC TCGGCGTTCT GCCTTTCGGC
ACCGCCTCCT GGGAGGCTGC CGTTATCAAG GCGCGGGGCT TTGATACGGC CAATGGCTTC
ACCCTCGATA TCGTCAAGCT GGCCGGCAAC GATGCCGCCC GTATCGCCTT CCTCGGCGGT
CAGGTCGATG CCATCGTCGG CGACCTGATC TTCGCCGCCC GCCTCGGCAA CGAGGGGCGG
GGCGTGCGCT TCTCGCCCTA TTCCACCACC GAAGGGGCGC TGATGGTGCC CGCCGGAAGC
CCGATCACGG ATTTGAAGGG GCTCGCGGGC AAGCGGCTCG GGGTGGCGGG CGGCGCGCTC
GACAAGAACT GGATCCTGTT GAGGGCGCAG GCGCGCGAGA CGGCCGGGCT CGAGCTCGAG
AACGTCGCGC AGATCGCCTA CGGCGCGCCA CCGCTGCTGG CGCAGAAGCT GGAGACCGGC
GAGCTCGACG CGGCTCTGCT CTACTGGCAG TTCTGCGCCC GCCTCGAAGC CAAAGGCTTC
AAGCGGCTGA TTTCGGCCGA CGACGTCATG CGGGCCTTCG GCGCCAAGGG CGCGGTCTCG
CTGATCGGCT ATCTCTACGA GGGCCACACC GTGGCCGACC GGGGCGAGGT GGTGCGCGGC
TTCGCCCGCG CCTCGGCCGC TGCCAAGGAC GCGCTGGCGA ACGAGCCGGC CCTGTGGGAG
ACGGTCCGTC CGCTGATGGC GGCGGAGGAC GACGCCACCT TCGCCACGCT CAAGCGCGAT
TTCCTCGCCG GAATCCCGCG CCGGCCGATC GCCGCCGAGC GCGCCGACGG CGAGCGCATC
TACGCGGCGC TGGACCGGCT CGCAGGCGCG CAGCTCCTCG GCGTGGGCAA GAGCCTGCCG
CCGGACCTCT ATCTCGACGC CTCGGGCAAC GGCTGA
 
Protein sequence
MLSRRGMLAT AALSLAATLP MARRAQAAGE TFRLGVLPFG TASWEAAVIK ARGFDTANGF 
TLDIVKLAGN DAARIAFLGG QVDAIVGDLI FAARLGNEGR GVRFSPYSTT EGALMVPAGS
PITDLKGLAG KRLGVAGGAL DKNWILLRAQ ARETAGLELE NVAQIAYGAP PLLAQKLETG
ELDAALLYWQ FCARLEAKGF KRLISADDVM RAFGAKGAVS LIGYLYEGHT VADRGEVVRG
FARASAAAKD ALANEPALWE TVRPLMAAED DATFATLKRD FLAGIPRRPI AAERADGERI
YAALDRLAGA QLLGVGKSLP PDLYLDASGN G