Gene Mpe_A3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3302 
Symbol 
ID4786461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3509706 
End bp3511025 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content63% 
IMG OID640091875 
Producthypothetical protein 
Protein accessionYP_001022490 
Protein GI124268486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.181488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGA AGATTCAACA AGTCGGCCTG GCCTTGCTCA CCGCAGGCCT GGCCTTCGCC 
AGCGTCCCCG CCGGCGCTGC CACCGAGGCC GACGTCGAGG CCTCGTTCAA CCCTTACAAG
AACGGCTTCC CGAGCTTCTC CGGGATGGCG CCCGGCCTCG TCATCAACAA GGCCAACGTC
GAGCAGTTCA AGGACGCGCT GGACCCCGGC CTCTACGAGA TCGTCAAGAA CGGCTGGGAC
GAGATCAAGG TCGGCCCGAC CACGCAGTTC ACGATGGACA AGTTCTACGT CGAGGCGACG
AAGAAGAACC TGAACACGAC CAAGCTGGGC GCCCAGAGCG GCGAGATCAC CGGCTTCGTG
GCCGGGCGGC CCTTCCCGGA GGAGCCCAAG GTCGATGACC CGCGCGCGGG CGAGAAGATC
GCCTGGAACT ACAAGTACGG CGTCAACTGG GGTGACAACG CGGCGATCTA CCCGTTCTAC
TGGAAGTACC GCAACATGAC GAGCGGCCAG GTCGAGCGCA CGCTGAAGTT CAACTTCCAC
TTCCTGAACT TCAAGCACCG CGTGCAGCAC GCGCCGGTGC CCGAGGTGAC GCCCAACCCC
TCCGATCTGT TCCGCGGCAT CTACCTCACG GTGCTCGAGC CGCAGGACCT GAAGAACACG
CAGCTCCTGA TCCAGCGCTA CGAGAACGAC CTGAAGTTCG ACGACGCCTA CCTCTACCTG
GGCTTCCAGC GCCGCGTGCG CCGGCTCGCC ACCGGGCAGA CGACGGACGC CTTCCTCGGC
GCGGACCTGA TGATCGAGGA CTTCGAGGGC TACAACGGCC GCATCTCGGA CATGAAGTGG
ACGTTCAAGG GCACGAAGAA CATCCTGATG CCCTACTACA ACCATAACGA GCTGCAGCTG
ACCGACGAGT TCAAGGACCC GAGCGGCTAC AAGTTCGTGG ACTTCGGCGG CCAGGGTGGC
TGCTTCCCGA AGATCACCTG GCAGCTGCGC AAGGTCTATG TGCTGGAGGC CGCACCGGTG
AACCCGGCGC ACCCGATCAG CAAGCGCGTC TTCCACGTCG ATGCGCAGGT CTACAACATC
AACCGCACGC TGATCTACGA CCGCAAGGGC GAGCTGTGGA AGACCTTCAC GATCGGCAAG
TCGCACCCCG ACCACCACCT GCCGGTCAAC AAGGGCACCG GCATCGCGAT CGACGACTCG
TTCTCGATGG TCGACGTGCA GGCCAAGCAC TGCACCACCG GCCAGTTCAA GGGCCAGGTC
GATCCCAAGC TCAACCCCCC AAGCCTGTTC CAGGTGCAGA ACCTGCGCGG AGGAGACTGA
 
Protein sequence
MTTKIQQVGL ALLTAGLAFA SVPAGAATEA DVEASFNPYK NGFPSFSGMA PGLVINKANV 
EQFKDALDPG LYEIVKNGWD EIKVGPTTQF TMDKFYVEAT KKNLNTTKLG AQSGEITGFV
AGRPFPEEPK VDDPRAGEKI AWNYKYGVNW GDNAAIYPFY WKYRNMTSGQ VERTLKFNFH
FLNFKHRVQH APVPEVTPNP SDLFRGIYLT VLEPQDLKNT QLLIQRYEND LKFDDAYLYL
GFQRRVRRLA TGQTTDAFLG ADLMIEDFEG YNGRISDMKW TFKGTKNILM PYYNHNELQL
TDEFKDPSGY KFVDFGGQGG CFPKITWQLR KVYVLEAAPV NPAHPISKRV FHVDAQVYNI
NRTLIYDRKG ELWKTFTIGK SHPDHHLPVN KGTGIAIDDS FSMVDVQAKH CTTGQFKGQV
DPKLNPPSLF QVQNLRGGD