Gene Mpe_A0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0889 
Symbol 
ID4787212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp933730 
End bp935109 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content68% 
IMG OID640089450 
Producthypothetical protein 
Protein accessionYP_001020086 
Protein GI124266082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACC TCCTGCGTTT CGACCGCGAC TACAGCCGCC GCGCCTTCCT CGACGCCACC 
GGCAAGGGAC TGCTGCGCGC CGGCGTGTTC GGGTCGGCCT GGGCGGCCTT CCTGCGCACC
GGCTCGGTCG CAGCGGCGTA TCCGGACGAG TTGCTGTCGA TCGAGGTCTA CACGAAGGGG
CGGCTCAAGC CCGGTGACGT GATCGACGCG GGCAACGTGG AACTGGTGAA GGACCTGCTC
GACCCGGTCC GCTACCGGCA GATCGCCACG ATGGGTCGCA AGCTGCTGCT CGCGCCGACC
ACCACCGACC TGAGCCACCT GAACCCGCTG CCTTACCTGG AGGCCACGGC GCGCAACCGC
GGCAAGGCGC GCTTCGATGC CACCGGCAAC ATCGTCACCA CCGAAGGCAA GCCCTGGATC
GGCGGCAATC CGTTTCCGGA GCCCACGTCG GCGCTGGAGA TGTTCGCCGC CCACACGCTG
AGCTGGGGCC GGCACGACGT GTCGGTCTAT GCCAGCAAGG AATACGACCT CGACCCCGCG
GGCAATCTGC AGTACCAGTA CTCGTCGGTG TGGGCCGAGA TGTCCACGGT AGCGCGCACC
GTCATCGATC CCAAGCCCTA CTGGGCCGGT GAAGCGGACA AGCTGCGCTA CCAGTCGGTG
CTGTTCACCG AGCCGGCCGA TGCGCGTGGC ACCATCTTCC TCAACATCTG GGCCTACGAC
CAGAACCAGT TCCCGCAGCT CTACGGCTAC CTGCCGGCCT TCAAGCGGGT GCGCAGTTTC
CCGACCAATC AGCGCTTCGA GCCGCTGGTG GCCGGCGCCG AGCTCTACCT GTCGGACGCC
TGGGCGGCCG GCGATCCGTT CCTGACCTGG GGCAACTATC AGGTCGTGCA CCGTGGGCCG
CACCTCGCAG CGGTCTCGCG CGGATGGACG TCGACCCATC CGAACTGGGA GCACACCACC
CATGGCGGCC CGAAGGGCAA CCTGTTCTGG GACCACGCGG TCGAACTGGT GCCCGAGGTG
ATCGTCATCG AGGCCGAGCC GGTGCGTTAC CCGCGCGCGC CGGTCGGGCG CAAGCGCGTG
TGGTTCGACG CGCGCACGCT GGTGCCGTTC CAGATGGTGT CCTACGACCG TCGCGGCGAA
CTGTTCCGCC ACTTCGACGC GTCCTTCGCG TACTACGACG ACGGCAAGGC GCGCGTGATG
GACGGCAAGG AGCCCTACTG GTCCTGGGCC ACCGTGCATG CCTTCAACGT GCAGACCAAC
CGCATGACCC GCATCGAGCA GGTGCGCGAG GTGCCGGGCG GCCACTCGAT GCGCGTCAAC
GACCCCAGCG TCTACGACAA GTACCTGACC ACTTCCGCGA TGCAGCGGCT GGGCAACTGA
 
Protein sequence
MAHLLRFDRD YSRRAFLDAT GKGLLRAGVF GSAWAAFLRT GSVAAAYPDE LLSIEVYTKG 
RLKPGDVIDA GNVELVKDLL DPVRYRQIAT MGRKLLLAPT TTDLSHLNPL PYLEATARNR
GKARFDATGN IVTTEGKPWI GGNPFPEPTS ALEMFAAHTL SWGRHDVSVY ASKEYDLDPA
GNLQYQYSSV WAEMSTVART VIDPKPYWAG EADKLRYQSV LFTEPADARG TIFLNIWAYD
QNQFPQLYGY LPAFKRVRSF PTNQRFEPLV AGAELYLSDA WAAGDPFLTW GNYQVVHRGP
HLAAVSRGWT STHPNWEHTT HGGPKGNLFW DHAVELVPEV IVIEAEPVRY PRAPVGRKRV
WFDARTLVPF QMVSYDRRGE LFRHFDASFA YYDDGKARVM DGKEPYWSWA TVHAFNVQTN
RMTRIEQVRE VPGGHSMRVN DPSVYDKYLT TSAMQRLGN