Gene Mpe_A0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0803 
Symbol 
ID4784487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp840987 
End bp842516 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content70% 
IMG OID640089364 
Productputative sugar ATP binding ABC transporter protein 
Protein accessionYP_001020000 
Protein GI124265996 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGTC TCGAACTCCT CGGCATCAGC AAGCAGTACC CGGCCGTCAA GGCCAACGAC 
GGCGTGAGCC TGCGCGTCGC GCCGGGCCAG ATCCATGCGG TGCTGGGGGA GAACGGTGCG
GGGAAGTCGA CGCTGATGAA GATCATCTAC GGCGCGGTGC GGCCCGACGC CGGCGAGATG
GTCTGGGACG GGCGGCAAGT GCAGATCGCG AGCCCGGCGC AGGCGCGCCA GCTCGGCATC
AGCATGGTCT ATCAGCACTT CAGCCTGTTC GACACCCTGA CAGCGGCCGA GAACGTGTGG
CTCGGCCTCG ACAAGAGCCT GTCACTTGCC GAGGTGACGG TGCGCATCGT GCAGGTCGCC
AAGGTCTACG GACTCGAGGT CGACCCGCAG CGGCCGGTGC ATTCGCTGTC GGTCGGCGAG
CGCCAGCGCG TCGAGATCGT GCGCGCGCTG CTCACCGATC CCAAGTTGCT GATCCTTGAC
GAGCCGACCT CGGTGCTGAC GCCACAGGCC GTCGAGACGC TGTTCGTCAC GCTGCGCCAG
CTGGCCGAGC GCGGCTGCTC CATCCTCTAC ATCAGCCACA AGCTCGACGA GATCCGCGCG
CTGTGTCACC ACTGCACGGT GCTGCGCGGC GGCAAGGTCA CCGGCGAGGT CGATCCGCGC
ACGGTCGGCA ATGCCGACCT GTCGCGACTG ATGATCGGCG CCGAGCCGCC GAAGCTGCAG
CACATCCAGG CCCGACTCGG CGACGTGGCG CTGCAGGTGA GCGGCCTGAC GCTCGCGAAG
CAGTCGCCGT TCGGCACCGA CCTGAAGGAC ATCGGTTTCG AGGTGAAGGC CGGCGAGATC
GTCGGCGTGG CTGGCGTCTC GGGCAACGGC CAGCAGGAAC TGATGGCTGC GCTGTCGGGC
GAGGATCCGC GCGCGCCGCT CGGCGCGGTG AAATTGTTCG GGCACGACAT CGGCCGGCAG
CCGCCGCGCG CCCGCCGCGC GCTGGGGCTG CACTTCGTGC CCGAGGAGCG GCTGGGGCGC
GGCGCGGTGC CGACGCTGTC GCTCGCGGCG AACACGCTGC TGACGCGCAC CGAGGCGGTG
GGCCGCGGCG GCTGGCTGCA CATGGCCCGG GTGCATCGGC TCGCCGAGTC GCTGATCCGT
ACCTACAACG TCAAGGCCGG AGGGCCGAAG GCGGCCGCGA AGAGCCTGTC GGGCGGCAAC
CTGCAGAAGT TCATCGTCGG CCGCGAGATC GACGCGCGGC CGAAGCTGCT GATCGTCTCG
CAGCCGACCT GGGGTGTCGA CGTCGGTGCG GCGGCACAGA TCCGTGGCGC GCTGCTGAAG
CTGCGCGACG AGGGCTGCGC GGTGCTGGTG GTCAGCGAGG AGCTCGACGA GCTGTTCGAG
ATCAGCGACC GCCTGCTGGT CATCGCGCAG GGCCGCCTGA GCCCGAGCGT TGCGACGATG
CGGACCACGG TCGAGCTGAT CGGTGAGTGG ATGAGCGGGC TGTGGCCGAA GGCCGATACC
GCCAACGGCG AGGCGCTGCA CCATGCTTAA
 
Protein sequence
MLRLELLGIS KQYPAVKAND GVSLRVAPGQ IHAVLGENGA GKSTLMKIIY GAVRPDAGEM 
VWDGRQVQIA SPAQARQLGI SMVYQHFSLF DTLTAAENVW LGLDKSLSLA EVTVRIVQVA
KVYGLEVDPQ RPVHSLSVGE RQRVEIVRAL LTDPKLLILD EPTSVLTPQA VETLFVTLRQ
LAERGCSILY ISHKLDEIRA LCHHCTVLRG GKVTGEVDPR TVGNADLSRL MIGAEPPKLQ
HIQARLGDVA LQVSGLTLAK QSPFGTDLKD IGFEVKAGEI VGVAGVSGNG QQELMAALSG
EDPRAPLGAV KLFGHDIGRQ PPRARRALGL HFVPEERLGR GAVPTLSLAA NTLLTRTEAV
GRGGWLHMAR VHRLAESLIR TYNVKAGGPK AAAKSLSGGN LQKFIVGREI DARPKLLIVS
QPTWGVDVGA AAQIRGALLK LRDEGCAVLV VSEELDELFE ISDRLLVIAQ GRLSPSVATM
RTTVELIGEW MSGLWPKADT ANGEALHHA