Gene Mpe_A3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3374 
Symbol 
ID4786361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3585052 
End bp3586299 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content64% 
IMG OID640091950 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001022562 
Protein GI124268558 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.76232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAA TCCGTCGCCG CCAACTGCTC CAGGGCACCG CCGGCATCCT CGCCACCGGC 
ATCTTCCCCG CCATCCACGC GCAGGAGAAG ATCACGCTGC GCTACCTGGG CACCGCGGTG
AACCAGGACA AGGCCATCGC CGAGAAGTTC AAGGCCGACA CCGGCATCGC CATCCAGTAC
ATCGCCGTCA ACACCGACGA GGTGACCAAG CGCGCGGTGA CGTCGCCGAA CAGCTTCGAC
CTGATCGACA CCGAGTACTT CTCGCTGAAG AAGATCGTGC CGACCGGCAA CCTCAAGGGC
ATCGACACCA AGCGCATCAA GAACGCCGAC AAGATCACCA CGCTGTTCAC CACCGGCGTG
GTGGGCGGCA AGGCGGTCGG CGACCAGGGA ACGGCGCCGA AGAAGGTGAT CTACCTGGAG
GGTGAGAAGA GCAAGGTGTT CGCCAAGGCG CCGACCCAGT TCATGAGCCT GATCCCCACG
GTCTACAACG CCGACACGCT GGGCATCCGG CCCGACCTGA TCAAGCGCCC GATCGACAGC
TGGGCCGAGT TGCTGAACCC GGAGTTCAAG GGCAAGGCCG CGATCCTCAA CATCCCGTCG
ATCGGCATCA TGGACGCGGC GATGGTCGTG GAGGCCAAGG GGCTCTACAA GTACCCCGAC
AAGGGCAACA TGACCCGGAA GGAAATCGAC CTCACGATCA AGACCCTGAT CGAGGCGAAG
AAGGCCGGCC AGTTCCGCTC GCTGTGGAAG GACTTCAACG AGAGCGTCAA CCTGATGGCC
TCCGGCGAGG TGGTGATCCA GTCGATGTGG TCGCCGGCCG TCACCGCGGT GCGCACCAAG
GGCATCGACT GCGTGTTCCA GCCGCTGAAG GAGGGCTACC GCGCCTGGGC GGCCGGCTTC
GGCGTACCGG CCACTTCGAG CGGCAAGAAG CTCGACGCCA CCTACGAGTT CATCAACTGG
TTCCTCGATG GCTGGGCCGG CGCCTACCTG AACCGCCAGG GCTACTACAG CGCCGTGCTG
GAGACCGCCA AGGCCAAGAT GGAAGCCTAC GAGTGGGCCT ACTGGATGGA AGGCAAGGCG
GCCGAGAAGG ACATCAAGTC GCCCAACGGC GATCTGCTGG CCAAGGCCGG CAGCAAGCGC
GACGGCGGCT CCTACGAGCA GCGCATGGGG GGCATCGCGT GCTGGAACGC GCTGATGGAC
GAGAACACCT ACATGGTCCA GAAGTGGAAC GAGTTTGTCG CGGCGTGA
 
Protein sequence
MSEIRRRQLL QGTAGILATG IFPAIHAQEK ITLRYLGTAV NQDKAIAEKF KADTGIAIQY 
IAVNTDEVTK RAVTSPNSFD LIDTEYFSLK KIVPTGNLKG IDTKRIKNAD KITTLFTTGV
VGGKAVGDQG TAPKKVIYLE GEKSKVFAKA PTQFMSLIPT VYNADTLGIR PDLIKRPIDS
WAELLNPEFK GKAAILNIPS IGIMDAAMVV EAKGLYKYPD KGNMTRKEID LTIKTLIEAK
KAGQFRSLWK DFNESVNLMA SGEVVIQSMW SPAVTAVRTK GIDCVFQPLK EGYRAWAAGF
GVPATSSGKK LDATYEFINW FLDGWAGAYL NRQGYYSAVL ETAKAKMEAY EWAYWMEGKA
AEKDIKSPNG DLLAKAGSKR DGGSYEQRMG GIACWNALMD ENTYMVQKWN EFVAA