Gene Mpe_A3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3587 
Symbol 
ID4786172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3796759 
End bp3798015 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content65% 
IMG OID640092169 
Productbranched-chain amino acid ABC transporter, periplasmic amino acid-binding protein 
Protein accessionYP_001022775 
Protein GI124268771 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG ATCGTGATGA CGCCCCGTTC TCCGTCGATC GCCGTCGCCT GCTGCAGGGC 
CTCGGCGCGC TGCCGCTGGC CGGCATGCCG GCCTGGGCGC TGGCCCAGCA GTTCCCGACC
GCCAAGGTCA ACACCACCAA GCTCGCCGTC ACCGACACCG AGGTGACGGT GGGGCAGCTG
CACTCCAAGA CCGGCACGAT GGCCATCTCG GAGACCGGCT CGGTGCAGGC CGAGCAGCTG
GCCATCGACC AGATCAACGC GATGGGCGGC ATCCTCGGCC GCAAGATCAA GGTGATCTCC
GAGGACGGCG CCTCCGACTG GCCGAACTTC GCCGAGAAGA GCAAGAAGCT GCTCGTCAAC
GACCGCGTCG CCACCGTGTT CGGCTGCTGG ACCAGCGCCT CGCGCAAGGC GGTGCTGCCG
GTGTTCGAGA AAGAGAACGG CCTGCTGTAC TACCCGACCT TCTACGAAGG CCTGGAGCAG
AGCAAGAACG TCATCTACAC CGGCCAGGAA GCCACCCAGC AGATCATCTG GGGCCTGGAC
TGGGGCGCGA AGGAGAAGAA GGCCAAGACC TTCTTCCTGG TCGGCTCCGA CTACATCTGG
CCGCGCACCT CGATGAAGAT CGCGCGCAAG CACATCGAGA ACTTCCAGAA GGGCACGGTC
AAGGGCGAGG AGTACTACCC GCTGGGCCAC ACCAACTTCA ACTCGCTGAT CAACAAGGTC
AAGGTCGCCA AGCCCGACTG CATCTTCGCG GCGGTGGTAG GCGGCTCCAA CGTGGCCTTC
TACAAGCAGC TCAAGGCCGC CGGCATCACC GGCGACAAGC AGTTCCTGCT GACGCTGTCG
GTGACCGAAG ACGAGATGAC CGGCGTGGGC GGCGAGAACT TCGCCGGCTT CTACTCGTCG
ATGAAGTACT TCCAGTCGCT GACCAACGAC AACAACAAGA AGTTCGTCGA GGCCTTCAAG
GCCAAGTACG GCAAGGACGC CGTGATCGGC GACGTGACGC AGGCCGGGTA CCTGGGCCCG
TGGCTGTGGA AGGCGGCGGT CGAGAAGGCC GGCAGCTTCG ACGTCGACAA GGTGGTCGCG
GCCTCGCCCG GCATCGAACT GAAGACCGCG CCCGAGGGCT ACGTGAAGCT CGACGCCAAC
CACCACCTGT GGAGCAAGGC GCGCATCGGC CAGGGCATGC CCGACGCGAC CTTCAAGGTG
GTGGCGGAGT CGCCCGAGCT GATCAAGCCG GACCCGTTCC CCAAGGGATA TCAATAA
 
Protein sequence
MSQDRDDAPF SVDRRRLLQG LGALPLAGMP AWALAQQFPT AKVNTTKLAV TDTEVTVGQL 
HSKTGTMAIS ETGSVQAEQL AIDQINAMGG ILGRKIKVIS EDGASDWPNF AEKSKKLLVN
DRVATVFGCW TSASRKAVLP VFEKENGLLY YPTFYEGLEQ SKNVIYTGQE ATQQIIWGLD
WGAKEKKAKT FFLVGSDYIW PRTSMKIARK HIENFQKGTV KGEEYYPLGH TNFNSLINKV
KVAKPDCIFA AVVGGSNVAF YKQLKAAGIT GDKQFLLTLS VTEDEMTGVG GENFAGFYSS
MKYFQSLTND NNKKFVEAFK AKYGKDAVIG DVTQAGYLGP WLWKAAVEKA GSFDVDKVVA
ASPGIELKTA PEGYVKLDAN HHLWSKARIG QGMPDATFKV VAESPELIKP DPFPKGYQ