Gene Mpe_A1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1940 
Symbol 
ID4786701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2076133 
End bp2077260 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID640090510 
Productputative secreted substrate binding protein 
Protein accessionYP_001021133 
Protein GI124267129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.15659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0532415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCA AACTCAAAGC CATCGCCCTT GCAACGGCCC TGCTCGCGAC CGGCGCCGTC 
TCGGCCCAGG AAGTGATCAA GATCGGCCAC GTCGCCCCCA TCTCCGGTGC CCAGGCCCAC
TACGGCAAGG ACAACGAGAA CGGCGCCCGG ATGGCGATCG AGGAACTCAA CACCCAGAAC
ATCACGATCG GTGGCAAGAA GGTCAAGTTC GAACTGGTTG CGGAAGACGA CGCTGCCGAC
CCGAAGCAGG GCACGGCCGC CGCCACCAAG CTGTGCGATG CCAAGGTCAA CGGTGTGGTC
GGTCACCTGA ACTCCGGCAC CACCATTCCC GCCTCGAAGA TCTACAACGA CTGCGGCATT
CCTGAGATCA CCCCGTCGGC CACGAACCCC AAGTACACGC AGCAGGGCTT CAAGACCGCT
TTCCGCATCC TGGCCAACGA CAACGCGCTC GGCGCCGGCC TGGCTTTGCA CGCCGCCAAC
AACCTGAAGC TCAAGAAGGT CGCGATCATC GATGACCGCA CTGCCTACGG GCAGGGTGTG
GCCGAGGTGT TCAAGAAGAC TGCCCAGGCC AAGGGCATCC AGATCGTCGA TGAGCAGTAC
ACCACCGACA AGGCCACCGA TTTCATGGCG ATCCTGACCT CGATCAAGTC GAAGGGTCCG
GATGGCGTGT TCTACGGCGG CATGGACCCG CAAGCCGGCC CGATGCTGCG CCAGATGGAG
CAACTCGGCC TGTCGAACGT CAAGTTCTTC GGCGGCGACG GCGTGTGCAC CGCCAAGCTC
GCCGACCTGT CGGCCGGCGC CAAGACGCTG GGCAACGTGG TCTGCGCCGA AGGCGGCTCC
TCGCTCGAGA AGATGCCCGG CGGTACCGCC TGGAAGGCCA AGTACGACGC GAAGTATCCC
GGCCAGTTCC AGGTCTACTC GCCCTACGTC TACGACGCGG TATTCGTGCT GGTCGACGCC
ATGAAGCGCG CCAACTCGGC CGACCCCAAG GTCTACGGCC CGAAGCTGTT CGAAACCAAC
TACACCGGCG TGACCGCGAA GGTGGCCTTC GAGAGCGATG GTGAACTGAA GAACCCGGCG
ATGACCCTGT ACGTCTACAA GGACGGCAAG AAGGTCCCGC TGAACTGA
 
Protein sequence
MQLKLKAIAL ATALLATGAV SAQEVIKIGH VAPISGAQAH YGKDNENGAR MAIEELNTQN 
ITIGGKKVKF ELVAEDDAAD PKQGTAAATK LCDAKVNGVV GHLNSGTTIP ASKIYNDCGI
PEITPSATNP KYTQQGFKTA FRILANDNAL GAGLALHAAN NLKLKKVAII DDRTAYGQGV
AEVFKKTAQA KGIQIVDEQY TTDKATDFMA ILTSIKSKGP DGVFYGGMDP QAGPMLRQME
QLGLSNVKFF GGDGVCTAKL ADLSAGAKTL GNVVCAEGGS SLEKMPGGTA WKAKYDAKYP
GQFQVYSPYV YDAVFVLVDA MKRANSADPK VYGPKLFETN YTGVTAKVAF ESDGELKNPA
MTLYVYKDGK KVPLN