Gene Mpe_A1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1952 
Symbol 
ID4784738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2088678 
End bp2089769 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID640090522 
Producthypothetical protein 
Protein accessionYP_001021145 
Protein GI124267141 
COG category[R] General function prediction only 
COG ID[COG3173] Predicted aminoglycoside phosphotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.126845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCCC AGTCCGGAAC GAAACCCGTC ACCTCGCAGC ACGCATTCGA CGTTGGCGCA 
CTGCAGTCCT ATCTCGAATC CGCGCTCGAC GGCTTCCGCG GTCCGCTCAG CGTCGAGCAA
TTCAAGGGCG GCCAGTCGAA CCCGACCTTC AAGCTGCTCA CCCCGCAGCG CAGCTACGTG
ATGCGCAGCA AGCCGGGGCC GGTCGCGAAG CTGCTGCCGT CGGCACATGC GATCGAACGC
GAGTTCACGG TGATGCGAGC ACTCGCCGGA AGCGAAGTAC CAGTGGCCCG GATGCACCTG
CTGTGCGAGG ACGAATCGGT GATCGGCCGG GCCTTCTACG TGATGGAGTA CGTGGAGGGC
CGCGTGCTGT GGAGCCAGGC ACTGCCCGAC ATGACCACCG TGCAGCGTGG TGAGATCTAC
GACGAAATGA ATCGCGTGAT CGCGGCGCTG CACCGTGTGG ACTACGCCGC TTGCGGTCTG
GCCGGCTACG GCAAGCCCGG CAACTACTTC GAGCGTCAGA TCGGACGCTG GAGCCGCCAG
TACCAGGCCT CTCTGGGTCC CGGCGGACCG GAGCCGATCG ACGCAATGGA GCGCCTGATC
GACTGGCTGC CGGATCACAT CCCGGCCAGC GCACGCGATG AATCCCAGAC CCGCATCGTT
CACGGCGACT ACCGGCTCGA CAACCTGATC TTCCACCCGA GCGAGCCGCG CATCGTCGCG
GTGCTCGACT GGGAACTGTC GACGCTGGGA CACCCTCTGG CCGACTTCAG CTACCACTGC
ATGAGCTGGC ACATCTCGCC CGGCACCTTC CGCGGCATCG GCGGGCTCGA CGTCGCGGCG
CTCGGCATCC CGACCGAAGC CGAGTACATG CAGCGCTATT GCGAACGCAC CGGCCGCGGC
GGCACCGACG CGCTGGTGAC AGACTGGAAC TTCTACCTGG CATACAACCT GTTCCGCATG
GCCGGCATCC TGCAGGGCAT CGCCAAGCGT GTGCTGGACG GCACCGCCGC GAGCGAGCAG
GCCCGACAGG CCGCTGCCGG CGCTCGACCG CTAGCCGAAC TGGGCTGGGC GATCGCACAA
CGGCAGCGCT GA
 
Protein sequence
MEAQSGTKPV TSQHAFDVGA LQSYLESALD GFRGPLSVEQ FKGGQSNPTF KLLTPQRSYV 
MRSKPGPVAK LLPSAHAIER EFTVMRALAG SEVPVARMHL LCEDESVIGR AFYVMEYVEG
RVLWSQALPD MTTVQRGEIY DEMNRVIAAL HRVDYAACGL AGYGKPGNYF ERQIGRWSRQ
YQASLGPGGP EPIDAMERLI DWLPDHIPAS ARDESQTRIV HGDYRLDNLI FHPSEPRIVA
VLDWELSTLG HPLADFSYHC MSWHISPGTF RGIGGLDVAA LGIPTEAEYM QRYCERTGRG
GTDALVTDWN FYLAYNLFRM AGILQGIAKR VLDGTAASEQ ARQAAAGARP LAELGWAIAQ
RQR