Gene Mpe_A1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1944 
Symbol 
ID4786705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2080492 
End bp2081613 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID640090514 
Producthypothetical protein 
Protein accessionYP_001021137 
Protein GI124267133 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.085708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGG TCCGCCGTCT CCTGTATGCC GACGTCCTGG GCGCGGTGAC CTTCGTCGCC 
GTGTCCTTCC TGTCGCTGTT CTTCTTCATC GACTTCGTGG AGGAGCTCGA CGACATCGGC
CGTGGCGCCT ACCGCGTGCA CCATGCGGCC CTGTACTGCC TGCTGGAGCT GCCGGGGCGC
CTGTACGAAC TGCTGCCGAT CGCGGTGTTG ATCGGCACCA TCTACGCGAT GGCCAGGCTG
GCGCAGTCTT CAGAGTTCAC CATTCTGCGC ACCGGCGGGC TGGGTCCGGG CCGGGCCCTG
TCGCTGCTCG CCAAGATCGG ACTGGCGTTC GCGGTGCTGA CCTTCGTGGT CGGCGACTAC
GTCGGCCCCT ATTTCGATGC CAAGGCGCAG ACGCTCCGTT CCACGCTGCG CGGCTCGGCC
TCCGGCGGCG GCAACAACAG CGCCTGGCTG AAGGACCGCC GCGCCGCCGC GCCGGGGGAG
CTGCCCGCCG GAGAGCGCAT CGATTCGATC AACATCGGCA ACGTCGGGCC CGACGGTCTG
CTCGACGACG TGCGCATCTA CGAGTTCAGC GAGGAAGGCC AGCTGCTGGC GCGTGTGGCC
GCGGAGCATG CCGTGGTCGA AGACGGTGCC TGGCGCCTTA AAAAGGTGCG TCTCACGCGC
TGGCATGCGG CGAGCGGCGA CGGCCTGCCG GTCGACGAGC GGCGCGACGA GCTGCGTTGG
CCGACCCGCT TGACGCCTTC GGTGGTTGGT GCGGCCGTCT CGCCGCTCAA GAGCATGTCG
ACCGTCGATC TCTACCGCTA CATGAGCCAT CTGTCCCAGA ACGAGCAGGC CGCGCAGCGC
CAGGAGATCC AGTTCTGGAA GAAGGCGCTG TACCCGCTGG CCTGCCTGGT GATGGTGGGC
CTGGCCCTGC CGTTCGCCTA CCTGCACGCG CGCGCCGGTG GCGTCAGCGT CAAGGTGTTC
GGCGGCATCC TGCTGGGCAT CAGCTTCGTG CTGCTGAACA ACGTGTCGAC CCACCTCGGG
TTGCTGCGCG ATTGGACGCC CTGGATCGCA GCCGCAGCAC CCGGCGCTTT CTACCTGCTG
CTGTCGATGG CTGCCTTCAG CTGGCTCGTG CGCTACCGGT GA
 
Protein sequence
MRTVRRLLYA DVLGAVTFVA VSFLSLFFFI DFVEELDDIG RGAYRVHHAA LYCLLELPGR 
LYELLPIAVL IGTIYAMARL AQSSEFTILR TGGLGPGRAL SLLAKIGLAF AVLTFVVGDY
VGPYFDAKAQ TLRSTLRGSA SGGGNNSAWL KDRRAAAPGE LPAGERIDSI NIGNVGPDGL
LDDVRIYEFS EEGQLLARVA AEHAVVEDGA WRLKKVRLTR WHAASGDGLP VDERRDELRW
PTRLTPSVVG AAVSPLKSMS TVDLYRYMSH LSQNEQAAQR QEIQFWKKAL YPLACLVMVG
LALPFAYLHA RAGGVSVKVF GGILLGISFV LLNNVSTHLG LLRDWTPWIA AAAPGAFYLL
LSMAAFSWLV RYR