Gene Mpe_A2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2600 
Symbol 
ID4787037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2773034 
End bp2774083 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content65% 
IMG OID640091171 
ProductTIS1021-transposase protein 
Protein accessionYP_001021789 
Protein GI124267785 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.90039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTGCG GCGAATTCAG CGGCGATGAT GCTGCCATGA AGCAGACGAG TTTTGCCACT 
GCCGAGTACG CCGGCAAGAA GCGCCAGACG CGCCGGGAGC GCTTCCTGGC CGAGATGAAC
GTGGTGGTTC CGTGGGCGCG GCTTGAGGCG CTGATCGAGC CGCACTACCC GAAGAGCGGC
AAGGTGGGCC GACCGCCGAT TGGCGTGCCG CGGATGCTGC GCATGTACTT CCTGCAGCAG
TGGTACACGC TGGCCGACGA GGCACTGGAA GACGCGCTGT ACGACAGCCA GGCCATGCGC
GAGTTCATCG GCATCGACCT TGGGCGGGAG AACGTACCCG ACGCCACAAC GCTGCTGAAG
TTCCGCCGCC TGCTCGAGCA GCACGACTTG ACGTCGGCCA TCCTGGCCGA GGTCAACGCG
CACCTCACCG AGCGTGGGCT GCTGATGCGC CAGGGCACGG TGGTGGACGC CACCATCATT
GCCGCGCCAA GTTCGACGAA GAACGAGGAC GGCAAGCGCG ACCCCGAGAT GCACCAGACC
AAGAAGGGGA ACCAGTGGCA CTTCGGGATG AAGATGCACT CGGGCGTGGA TGCCGAGTCG
GGTCTGATCC ACAGCGTGGT CTGCACCGCG GCCAACGAGG CTGACGTGGC GCACGCGCAC
GAACTGCTGC ATGGCCAGGA GAGCCAAGTT CACGGCGACA GCGGCTACAC CGGCATCCAG
AGGCGAGACG AGATCACGAC GGCGCAGGAA GAGGGCAGGC TGCGCCAGGA CATGGATTGG
CGTATCGCCA TGAAGCGCGG CCAACTCAAG GCCATGCCCG AAGGGCCGGC CAAGGCGATG
CACGAGTGGT TCGAACGGCG CAAGGCTCAG GTGCGGGCCA TCGTCGAACA CCCGTTCCAC
GTCATCAAGA ACCTGTTCGG CTACCGCAAG GTCAGCTACC GCGGGATCTC CAAGAACGAA
GCTCGCGCGA AGGCGCACGC TGCGCTGGCC AACTTGTACA TCGCCCGGCG CCGATTGCTG
GCCCAAGGCC TCAGTGCGTC TGCTGCATGA
 
Protein sequence
MACGEFSGDD AAMKQTSFAT AEYAGKKRQT RRERFLAEMN VVVPWARLEA LIEPHYPKSG 
KVGRPPIGVP RMLRMYFLQQ WYTLADEALE DALYDSQAMR EFIGIDLGRE NVPDATTLLK
FRRLLEQHDL TSAILAEVNA HLTERGLLMR QGTVVDATII AAPSSTKNED GKRDPEMHQT
KKGNQWHFGM KMHSGVDAES GLIHSVVCTA ANEADVAHAH ELLHGQESQV HGDSGYTGIQ
RRDEITTAQE EGRLRQDMDW RIAMKRGQLK AMPEGPAKAM HEWFERRKAQ VRAIVEHPFH
VIKNLFGYRK VSYRGISKNE ARAKAHAALA NLYIARRRLL AQGLSASAA