Gene Mpe_A3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3600 
Symbol 
ID4786126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3810948 
End bp3812405 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content68% 
IMG OID640092182 
Producthypothetical protein 
Protein accessionYP_001022788 
Protein GI124268784 
COG category[C] Energy production and conversion 
COG ID[COG2421] Predicted acetamidase/formamidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.229476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.563302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAGC ACATCTGCGG TCCCGGCTGC AACCACGGCC CTGCCACGCC CGAGGCCGAG 
GCAGAAGTCC GCGAGGAGTT CAAGGTCGCG CGCCGCAGCT TCCTGCGCGA CGCGCTGGCG
GTCGGCGGCG CCACCGTGTC GGCGGCCTCT CTGAACGTGG CGATGACGCC CAGCGCCTTC
GCGCAGAGCG CCGCGAAGCC GGGCTCGGGG CTCACCTCGC ACTACTACAT CCCGGCCTCG
GCCGGCACCG TGCTGTGGGG CTACTTCAGC AAATCGGCCA AGCCGGTGGT GGAAGTGGAG
TCGGGCGATT TCGTGACCAT CGAGACCCTG ACCCACCACG CCAACGACGA CGCCGAGCGC
ATGATCAAGG GCGACCCCGG CGCCGAGAGC GTGTTCCACT GGGACGCCAA GAAGAAGGGC
GTGAACCGCC GTGGCGCCGG CGCGATGGAC GCCAAGGTGG GCGCCGGTGG CGGCGAAGGC
GTGCACATCT GCACCGGGCC GGTGCGCATC AAGGGCGCGG AGCCGGGCGA CATCCTGGAA
GTGCGCATCG TCGACGTGGC CACGCGGCCC AGCGCCAACC CGGCCTACAA GGGCCGCGCC
TTCGGCAGCA ACGCCGCCGC GTGGTGGGGC TTCCACTACG GCGACACGAT CACCGAGCCG
AAGAAGCGCG AGGTGATCAC CATCTACGAG GTCGACGCCA CCGGCGAGCG CAACTGGGCG
CGCGCGGTCT ACAACTTCAA GTGGACGCCG CAGACCGACC CCTTCGGCGT GGTGCACCCG
ACGATCGACT ACCCCGGCGT GCCGGTCGAC CACCGCACCA TCACCAAGAA CGAGAATGTG
CTGAAGAACA TCCGCATCCC GGTGCGGCCG CACTTCGGGA CCATGGGCGT GGCGCCGGTC
GAGGCCGAGA TGGTGAACTC CATCCCGCCC AACTACACCG GCGGCAACAT CGACAACTGG
CGCATCGGCA AGGGCGCCAC CGTCTACTAC CCGGTGGCCG TGCCCGGCGC CATGTTCTCG
GTGGGCGACC CGCACGCGTC GCAGGGCGAC TCCGAGCTCT GCGGCACCGC CATCGAGTGC
TCGCTGACCG GCACCTTCCA GCTCATCCTG CACAAGAAGG CCAGCCTGCC CGGCACGCCG
CTGGCCGAGC TGAAGTACCC GCTGCTCGAG ACGCAGGACG AGTGGCTGCT GCACGGCTTC
AGCTACGCCA ACTACCTGGC CGAGCTGGGC CCCAATGCGC AGAACGACAT CTACAGCAAG
TCATCGGTCG ACAAGGCGCT GCGCGACGCG TATCACAAGA TGCGCCATTT CCTGATGACC
ACGCAGGGCC TGGGCGAGGA CGAGGCGATC TCGCTGATGT CGATCGCGGT CGACTTCGGC
ATCACCCAGG TAGTGGACGG CAACTGGGGC GTGCACGCGG TGGTGAAGAA GAGCATCTTC
CCCGCGCGCG GCGGCTGA
 
Protein sequence
MYEHICGPGC NHGPATPEAE AEVREEFKVA RRSFLRDALA VGGATVSAAS LNVAMTPSAF 
AQSAAKPGSG LTSHYYIPAS AGTVLWGYFS KSAKPVVEVE SGDFVTIETL THHANDDAER
MIKGDPGAES VFHWDAKKKG VNRRGAGAMD AKVGAGGGEG VHICTGPVRI KGAEPGDILE
VRIVDVATRP SANPAYKGRA FGSNAAAWWG FHYGDTITEP KKREVITIYE VDATGERNWA
RAVYNFKWTP QTDPFGVVHP TIDYPGVPVD HRTITKNENV LKNIRIPVRP HFGTMGVAPV
EAEMVNSIPP NYTGGNIDNW RIGKGATVYY PVAVPGAMFS VGDPHASQGD SELCGTAIEC
SLTGTFQLIL HKKASLPGTP LAELKYPLLE TQDEWLLHGF SYANYLAELG PNAQNDIYSK
SSVDKALRDA YHKMRHFLMT TQGLGEDEAI SLMSIAVDFG ITQVVDGNWG VHAVVKKSIF
PARGG