Gene Mpe_A1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1903 
Symbol 
ID4786724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2038967 
End bp2040097 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content63% 
IMG OID640090473 
Productpolyamine ABC transporter system, substrate-binding protein 
Protein accessionYP_001021096 
Protein GI124267092 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0765034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.376204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGT TGTCGTCGCG CCGTGCGGTC GGTGCCCTGG GCCTGGTCGC CAGCGTCGCT 
GCCGCGTGCG GCCTGCTGAG CAGCCCGGTG CATGCTCAGG AGGAGAAGGT CCTCAATGTC
TACAACTGGT CCGACTACAT TGCCGAGGAC ACGCTGGCCA ACTTCGAGAA GGAAACCGGC
ATCAAGGTTC GCTACGACAA CTTCGACAAC AACGAGATCG TCCATGCCAA GCTGGTGGCC
GGCAAGACCG GCTACGACGT GGTGGTGCCC TCCTCCTACT GGGCCAAGCT GCAGGCCGAC
GGCGGCCTCC TGCAGAAGCT GGACAAGGCG CAGCTGCCGA ATCTGAAGCA CCTCGACCCG
GCGCTGCAGG AGCAGCTCGC CAAGCTCGAC CCGGGCAACC AGTACCAGGT CAACTGGCTG
TGGGGCTACA CCACCGTCGG CATCAACGTC GACAAGGTCA AGGCCGCCCT GGGCAACCTG
CCGATGCCCG ACAACGCCTG GGACCTGGTC TTCAAGCCGG AGTACATCTC CAAGCTCAAG
AGCTGCGGCG TGAGCATGCT GGACTCCGCC ACCGAGGTGG TCCCGGCGGC CCTGCACTAC
CTGGGCAAGC CCGCGTACAG CAAGAACCAG GCTGACTACG CCGGCGTCGC ACCGCTGCTC
AAGAGCGTGC GGCCCTACGT GACGCTGTTC AGCTCGTCCG GCTACATCAA CGACATGGCC
AACGGCTCGA TCTGTCTTGC ACTGGGTTGG TCGGGCGACA TCAACATCGC GAGGCAGCGT
GCCATCGACG GCAAGACCGG CCAGAAGATC GAGGCCCTGA TCCCGAAGAC CGGCGGCGTG
CTGTTCTTCG ATGTGATGGT GATCCCGGCC GATGCCCCGC ACCCCGGCAA TGCCCACAAG
TTCATCAACT ACATCCTGCG GCCCGAGGTG GCGGCCAGCC TGACCAACAA GGTGTTCTAC
GCCAACCCGA ACAAGGAATC GAAGAAGTTC GTCAAGCCCG AGATCGCCGG CAATGCCACC
GTGTTCCTGA ACGAGGCCGA CCTGAAGAAG ATGGCCGCGC CCGACAGCAT CGGCAGCGAC
ATCCGGCGGA CCATGACGAG GCTGTACACG TCGTTCAAGA CCGGTATCTG A
 
Protein sequence
MKLLSSRRAV GALGLVASVA AACGLLSSPV HAQEEKVLNV YNWSDYIAED TLANFEKETG 
IKVRYDNFDN NEIVHAKLVA GKTGYDVVVP SSYWAKLQAD GGLLQKLDKA QLPNLKHLDP
ALQEQLAKLD PGNQYQVNWL WGYTTVGINV DKVKAALGNL PMPDNAWDLV FKPEYISKLK
SCGVSMLDSA TEVVPAALHY LGKPAYSKNQ ADYAGVAPLL KSVRPYVTLF SSSGYINDMA
NGSICLALGW SGDINIARQR AIDGKTGQKI EALIPKTGGV LFFDVMVIPA DAPHPGNAHK
FINYILRPEV AASLTNKVFY ANPNKESKKF VKPEIAGNAT VFLNEADLKK MAAPDSIGSD
IRRTMTRLYT SFKTGI