Gene Mpe_A1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1224 
Symbol 
ID4785124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1319578 
End bp1321482 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content72% 
IMG OID640089789 
Producthypothetical protein 
Protein accessionYP_001020421 
Protein GI124266417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCGA GGTCCTTCCT GTCGCGCGCA ACGAAGCGGC TCCTTGCCAC GGCCGCATTC 
GCGGCGCTGT TGCCGGCCGG CGGCGCGCTG GCGGCCGAGC CCCAGCCGCA GCGGGTGATC
AAGGACCCGC ACTACGGCGA CACGCTGTTC CACTTCTTCC AGGACCACTA CTTCACGTCG
ATCACCACGC TGATGACCTC GCAGCACTTC GACCGCGTGC CCGCGCACGC CGACGATGCC
GAAGTGCTGC GCGGCGGGCT GCTGCTGTCC TACGGCCTGC ACCGCGAGGC CGGTGAGATC
TTCGCGCAGC TGATCGAGAA GGGTGCGTCG CCGGCGGTGC GCGATCGTGC CTGGTACTAC
CTGGCCAAGA TCCGCTACCA GCGCGGCTTC CTCGCCGAGG CCGAGCAGGC CATCGACCGG
GTCGAGAACC ACCTGCCACC GGCGCTGGAA GACGACCGCG GCCTGCTCAA GGCCAACCTG
CTGATGGCGC GTGGCGACCA CGCCGGCGCC GCCGCCGTGC TGAACGGCAT GGCCAAGCGC
CCCGGCGCCG GCCAGTACGC GCGCTTCAAT CTCGGCGTAG CGCTGGTGCG CAGCGGCGAT
ACCGCAGGCG GCAGCGCGCT GCTCGACGAG ATCGGCCGTG CGCCGCAGCC GGACGAGGAG
TTCCGGACGC TGCGCGACAA GGCCAACGTG GCGCTCGGCT TCGCTGCGCT GCAGGACGAG
CGGGCCGAGG CGGCGCGCGG CTACCTGGAG CGCGTGCGGC TGGAGAGCCT GCACGCCAAC
AAGGCTCTGC TCGGCTTCGG CTGGGCGGCC GCGGCGCTCA AGCAGCCGGC CAAGGCCCTG
GTGCCGTGGA CCGAACTGGC GCAGCGCGAC GGCAGTGACG CGGCGGTGCT CGAGGGCCGC
ATTGCCTTGC CCTATGCGTA TGCCGAGCTG GGGGCTCTGG GTCAGGCGCT GGCGGGCTAC
AACGAGGCGA TCGCCGCCTA CGACCGCGAG GCGGCGCACC TGAACGAATC CATCGCGGCG
ATCCGCGCCG GCAAGCTGGT CGAGGGCCTG ATCGACCGCA ACCCCGGCGA CGAGATGGGC
TGGTTCTGGA CGCTGCGGGA GCTGCCCGAA CTGCCGCACG CCGGGCATCT GGCGCAGGTG
CTGGCGGAAC ACGAATTCCA GGAGGCCTTC AAGAACTACC GCGACCTGCG TTTCCTGTCC
AACAACCTGC AGCACTGGGC CGACAACCTC GGCGTGTTCG GCGACATGCT CGCCAACCGG
CGCCAGGCGT TCGCGCAGCG GTTGACGCAG GTCCAGGCGG GCGCCAAGGC GAACGAGAGC
GGGCTCGACG CGGTTCAGCA GCGTCGCGAC GCGCTGGCCG GCGACCTGGC GCGCGCCGAA
TCGCAGGCCG ACGGTGTGGC CTTCGCCGAT GCGCGGCAGC GCGAGTTGCT GACCCGCATC
GACGACGTGC GTGCTGCGCT GAAGGCGCAG GCCGGCGACC CGCAGTTCGC GACGGCGCCC
GACCGGGTCC GGCTCGCCGC CGGGGCGCTG AGCTGGCAGC TGGCGCAGGA CTACCCGGCG
CGCGTGTGGG AGGCGAAGAA GGCGCTGCAG ACCATCGACA GCGAGCTGGC CGAGGCGCGC
CGCCGCGACA CCGCACTCGC CCAGGCGCAG CGCGACGAGC CGGTGCGGTT CGGGAGTTTC
GACGGGCGCA TCGCCGAGCT CGACCGCCGC ATCCAGGCGC TGATCCCGCG CGTCGCCGCG
CTGAGCCGCG AACAGCAGCA GGTGGTGCAG GACATCGCGG TGGCGGCGCT GACGCGCCAG
CAGGAGCGCC TGACGGCCTA CCTCACCCAG GCCCGCTTCG CAGTCGCTCA GCTGCATGAC
CGTGCCACCC TGGCCAAGGA GACCGACCGT GCGTCCGCGC AGTAG
 
Protein sequence
MGSRSFLSRA TKRLLATAAF AALLPAGGAL AAEPQPQRVI KDPHYGDTLF HFFQDHYFTS 
ITTLMTSQHF DRVPAHADDA EVLRGGLLLS YGLHREAGEI FAQLIEKGAS PAVRDRAWYY
LAKIRYQRGF LAEAEQAIDR VENHLPPALE DDRGLLKANL LMARGDHAGA AAVLNGMAKR
PGAGQYARFN LGVALVRSGD TAGGSALLDE IGRAPQPDEE FRTLRDKANV ALGFAALQDE
RAEAARGYLE RVRLESLHAN KALLGFGWAA AALKQPAKAL VPWTELAQRD GSDAAVLEGR
IALPYAYAEL GALGQALAGY NEAIAAYDRE AAHLNESIAA IRAGKLVEGL IDRNPGDEMG
WFWTLRELPE LPHAGHLAQV LAEHEFQEAF KNYRDLRFLS NNLQHWADNL GVFGDMLANR
RQAFAQRLTQ VQAGAKANES GLDAVQQRRD ALAGDLARAE SQADGVAFAD ARQRELLTRI
DDVRAALKAQ AGDPQFATAP DRVRLAAGAL SWQLAQDYPA RVWEAKKALQ TIDSELAEAR
RRDTALAQAQ RDEPVRFGSF DGRIAELDRR IQALIPRVAA LSREQQQVVQ DIAVAALTRQ
QERLTAYLTQ ARFAVAQLHD RATLAKETDR ASAQ