Gene Mpe_A1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1674 
Symbol 
ID4785755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1799761 
End bp1800903 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content66% 
IMG OID640090243 
Producthypothetical protein 
Protein accessionYP_001020871 
Protein GI124266867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.252915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCATGG TCGAAGGCAT CGAGATGTAT GTGGCGCGCC TGGGCGATGG CTATATCGTC 
AAGCGCATGC CGGACACGGG AAGCCACCAC GCGCCCGACT GCCCGTCCTA CGAACCGCCG
GCCGAGTTCT CCGGCCTGGG GCAGGTACTG GGCAGCGCGA TCACCGAGGA CCCGGCCACC
GGCGAGACGA CGCTCAACCT GGACTTCCCG CTGACCAAGA TGCCCGGCCG GTCGACGATC
CCTCCCACTG GCGGAGAGGG CGATAGCGTT TCCTCCACCG GGACCAAGCT CTCACTGCGC
GGCCTGCTGC ACTACCTGTG GGACCAGGCC GAGCTGACGC GCTGGCATCC CGGCTTCGTT
GGCAAGCGAA CCTGGGCGAC AGTGCGAAGG CACCTGCTCC ACGCAGCCGA GCACAAGCTC
GCCCGCGGCG ACGCCCTGCG CGCCCGGCTC TACGTGCCGG AACCGTTCTT TATTGAAGAG
CGCGACGCGA ACAATGCCCG CCGCCTGGCG CAGTGGCAAA GCGCCGTGCC TGCGCCCGGA
AAGGAACAGC AGCTGATGCT GCTGATCTGC GAGGTCAAGG AGATCGTGCC AGCACGCTAC
GGCTTCAAGG CGATCGTGAA GCACCTGCCC GACCAGGCCT TCGCAATCGA CGAGCAGCTG
TACAGGCGGC TCGGCCGGCG CTTCGAGTCC GAACTGGCGC TGTGGGGTGC CAGCGACGAC
ATCCGCATGG TGATGATCGC CACCTTTGGC GTGAGCAGCG CCGGCGTTCC GGCGATCCAC
GAGCTGTCCC TGATGCCAGT GACGCGGCAA TGGCTGCCGG TCGAAGATGG GTTCGAGAAG
CAGCTCCTGG ACAAGCTCGT CGGTGAGAGC CGCGCCTTCG TGAAGGGCCT GCACTACAAC
CTCGGAAAGA AGGACAGGAT CGCGAGCGCC GCGCTCACCG ACTGCGAAGG GTCGGCGCCG
ATGCTGTTCA TCGTTCCCGC CGGCTTCGAC GAAGCTGTGC CTGTCTACGA GACTGGCCAA
CCGAGCTGGA TATGGTGTCG GTCCAGCGAG TCGATGCCTT CTTTCCCGCC ACCTCGCACC
AATTCGCACA GGGCGGCGCC CACGCACGCA GAACGAGCAA CCTGGCCAGC TCACGCAGGA
TGA
 
Protein sequence
MCMVEGIEMY VARLGDGYIV KRMPDTGSHH APDCPSYEPP AEFSGLGQVL GSAITEDPAT 
GETTLNLDFP LTKMPGRSTI PPTGGEGDSV SSTGTKLSLR GLLHYLWDQA ELTRWHPGFV
GKRTWATVRR HLLHAAEHKL ARGDALRARL YVPEPFFIEE RDANNARRLA QWQSAVPAPG
KEQQLMLLIC EVKEIVPARY GFKAIVKHLP DQAFAIDEQL YRRLGRRFES ELALWGASDD
IRMVMIATFG VSSAGVPAIH ELSLMPVTRQ WLPVEDGFEK QLLDKLVGES RAFVKGLHYN
LGKKDRIASA ALTDCEGSAP MLFIVPAGFD EAVPVYETGQ PSWIWCRSSE SMPSFPPPRT
NSHRAAPTHA ERATWPAHAG