Gene Mpe_A0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0439 
Symbol 
ID4785429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp477931 
End bp479796 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content68% 
IMG OID640088997 
Productputative composite ATP-binding transmembrane ABC transporter protein 
Protein accessionYP_001019636 
Protein GI124265632 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5265] ABC-type transport system involved in Fe-S cluster assembly, permease and ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.130019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCG CTGAAACCCT GAGTCCGCCG CCGGCCGCTG CCGGCCCGGC TGCACCTGCT 
GCGGGCCCGG CCGCTGCGGC GCGCTCCGAC TGGGCGACGC TCGCCCGGCT GCTGCCCTAC
CTGTGGGTCT ACAAGTTCCG CGTGATCGCG GCGCTGGCCT GCCTGATCAC GGCCAAGGTC
GCCAACGTCG GCGTCCCGTT GCTGCTGAAG CAGTTGGTGG ATGCGCTGTC CATCCCGCTG
GGCGATCCGC GTGCCGCGCT GGTGGTGCCG GCGGGACTGC TGCTGGCCTA CGGCGCGCTG
CGGCTGTCGA CCTCGCTGTT CACCGAGCTG CGCGAGCTGG TGTTCGCCAA GGCGACCGAG
GGCGCCTCGC GCAGCATCTC GCTGCAGGTG TTCCAGCACC TGCACGCGCT GAGCCTGCGT
TTCCACCTCG AGCGCCAGAC CGGCGGCATG ACGCGCGACA TCGAGCGAGG CACGCGTGGC
GTGCAGTCGC TGATCTCGTA TTCCCTCTAC AGCATCCTGC CCACGCTGGT CGAGGTCAGC
CTCGTGCTCG GGCTGCTGGC GGTCAAGTTC GACGCGATGT TCGCGTGGAT CACGCTGGCC
GCGCTGGTGG TCTACATCGG GTTCACGGTG CTCGTGACCG AGTGGCGCAC GAAGTTCCGC
AAGACGATGA ACGAGCTGGA CTCGAGCTCG CATTCCAAGG CGATCGACTC GCTGCTGAAC
TACGAAACCG TCAAGTACTT CAACAACGAG GACTTCGAGG CGAAGCGCTA CGACGAGAGC
CTCGATCGGC TGCGCCGGGC GAAGCTCAAG TCGCAGTCGA CGCTGTCGCT GCTCAACACC
GGCCAGCAGC TGCTGATCGC CATCGCGCTG ATCGCGATGC TGTGGCGCGC CACGGAGGGC
GTGGTGTCCG GACGCATGAC GCTGGGCGAC CTGGTGATGA TCAACGCCTT CATGATCCAG
CTCTACATCC CGCTCAATTT CCTGGGCGTG ATCTACCGCG AGATCAAGCA GGCGCTGACC
GACCTCGACA AGATGTTCGA CCTGCTGGAG CGCGAGCGCG AGGTGCGCGA CGCACCCGAC
GCTCAGGACC TGCACGCCGA GGCGCCGCCG GTCGTGCGCT TCGAGAACGT CGGATTCGCC
TACGAAGCGG GGCGGCCCAT CCTGCACGGC CTGAGCTTCG AGATCCCGGC CGGCCACACG
ATCGCGGTGG TCGGGCCGTC GGGTGCGGGC AAGAGCACGC TGGCGCGGCT GCTGTACCGC
TTCTACGACG TGAGCGAGGG CCGCATCACG ATCGCCGGCC ACGACATCCG CCAGCTCACC
CAGGCCAGCC TGCGCCGCGC CATCGGCATC GTGCCGCAGG ACACCGTGCT GTTCAACGAC
ACCGTGGCCT ACAACATCGC CTACGGCCGA CCCGGGGCGA CCCAGGCCGA GATCGAGGCG
GCGGCCCAGG CGGCTCGCAT CCACGGTTTC ATCGCATCGA CCCCGAAAGG CTACGGCACG
ATGGTCGGCG AGCGTGGCCT GAAGCTGAGC GGCGGCGAGA AGCAGCGCGT GGCGATCGCG
CGCACGCTGC TGAAGAATCC GCCGATCGTG ATCTTCGACG AGGCCACCTC GGCGCTCGAC
TCGGCCAACG AGCGGGCCAT CCAGGCCGAG CTGCAGACCG CGGCGCGCAA CAAGACCGCA
CTGGTCATCG CGCACCGGCT GTCGACCGTG GTCGACGCCG ATCAGATCCT GGTGATGGAG
GCGGGCCGCA TCGTCGAGCG CGGCACGCAT GCGGAGCTGC TGGCCCGAGA AGGGCGCTAT
GCACAGATGT GGGCGCTGCA GCAGCAAGGC GGCGCCGAGG CGGCGGCCGA CGTCGCCGCG
TCCTGA
 
Protein sequence
MRRAETLSPP PAAAGPAAPA AGPAAAARSD WATLARLLPY LWVYKFRVIA ALACLITAKV 
ANVGVPLLLK QLVDALSIPL GDPRAALVVP AGLLLAYGAL RLSTSLFTEL RELVFAKATE
GASRSISLQV FQHLHALSLR FHLERQTGGM TRDIERGTRG VQSLISYSLY SILPTLVEVS
LVLGLLAVKF DAMFAWITLA ALVVYIGFTV LVTEWRTKFR KTMNELDSSS HSKAIDSLLN
YETVKYFNNE DFEAKRYDES LDRLRRAKLK SQSTLSLLNT GQQLLIAIAL IAMLWRATEG
VVSGRMTLGD LVMINAFMIQ LYIPLNFLGV IYREIKQALT DLDKMFDLLE REREVRDAPD
AQDLHAEAPP VVRFENVGFA YEAGRPILHG LSFEIPAGHT IAVVGPSGAG KSTLARLLYR
FYDVSEGRIT IAGHDIRQLT QASLRRAIGI VPQDTVLFND TVAYNIAYGR PGATQAEIEA
AAQAARIHGF IASTPKGYGT MVGERGLKLS GGEKQRVAIA RTLLKNPPIV IFDEATSALD
SANERAIQAE LQTAARNKTA LVIAHRLSTV VDADQILVME AGRIVERGTH AELLAREGRY
AQMWALQQQG GAEAAADVAA S