Gene Mpe_A1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1964 
Symbol 
ID4784750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2102353 
End bp2104314 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content72% 
IMG OID640090534 
Producthypothetical protein 
Protein accessionYP_001021157 
Protein GI124267153 
COG category[S] Function unknown 
COG ID[COG4121] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03197] tRNA U-34 5-methylaminomethyl-2-thiouridine biosynthesis protein MnmC, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.117707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGG CGCCGATCAC GCCAGGCCGC CTGGCCTTCT CACCCGACGG CGTGCCGCTG 
GCGCCGGAGT TCGGCGATGT CTACCACCCG GCGGCGGGCG CGCTGCAGCA GGCGCACCAC
GTCTTCCTCG GCGGTAACCG GCTGCCGGCG CGCTGGGGCG GTCGCGGGCG CTTCGTGATC
CTCGAGACCG GCTTCGGCCT GGGCAACAAC TTCCTCGCCA CCTGGGACGC CTGGCAACGT
GACCCGCAGC GCTGCGAGCG GCTGGTGTTC GTCTCGATCG AGAAGCATCC GCTGACGCGC
GAGGACCTGG CTCGTGCGCA CGCGGCTTCA CCGCTGCCCG AATTGGCCCG TGCACTGGTT
TCGGCCTGGC CCTTGTCGAC GCCGAATTTG CATCCGATCG CCTTCGAAGG CGGTCGCGTG
CAACTGCTGC TGGGTTTCGG TGACGTGGCC TTGCTGCTGC CGCAATTGGT GGTGTCGGTC
GACGCCTTCT TCCTCGACGG GTTCGCCCCG GCCCGCAATC CTGAGATGTG GGAGCCGCGC
CGGCTTCAGC GCCTGGGCCG GCTCGCCGCG CCCGGCGCAA CCGCGGCGAC CTGGAGCGCT
GCGCGGGTCG TGCGTGACGG GCTCTCGGCA GCCGGCTTCA CGGTGGAAAC CACAGCCGGT
ACCGGCGGCA AGCGCGACAT CACGGTGGCG CGATTCACAC CAAGGCACAT CGCGGTGCCC
CCGCCCGGTG GCTGGCATGC GCACGACGCG GCCTCACGCG AAGCCTTGGT GATCGGCGCC
GGTCTGGCCG GCTGCGCGGC AGCCTGGGCG TTGTCGCAAC AGGGCTGGCA GTGCCAGCTG
CTGGATCGTG CGGCGGAGCC GGCCGACGTC ACGTCTGGCA ATCCGGCCGG CCTGTTCCAC
GGCAGCTTCC ACCGCGACGA CGGTCCGCAT GCCCGCACAC TGCGAGCCGC GGCACTGGCG
ACCGAACGCC TGGCCGGCGC GTGGATCGCG CAGGGCCGAG TGTCCGGCCA GCTCGCTGGC
TGCCTGCGCC TCGAATCGAG GTGGTCGGAC GACGCGGCGC GCGCGGCAAT GGCGGCCCAG
CAGATCGCCC CCGGCTATAT CGACTGGATG GACCGGGCCG TCGCGAGCAC GCTCTCCGGC
CTCGCCCTGC CGAGCGGCGC CTGGTTCTAC CCGGGCGGCG GCTGGCTCGC GCCGCGCGAC
TATGCGCGCG AACTGCTTGC GCGCAGTGGC TCCCTCTTTC GGGGCGGCAT CGACGTGGCG
ACCATCGAGC GGCATAGCGG CTTGTGGCGC GTGCTCGACG AACAGCGCCA GGTGATCGCC
GAAGCACCGG TGCTGGTGCT GGCCAACGGG CTCGGTGCGA ACGGCCTGCT GGCCTCCGGC
CGCGGTGAGG TGCCGTGGCC GCTGACGGCG GTGCGCGGAC AGATCAGCAG CCTGGCGACC
GACGGTCACC CCGCCACCCT GCCCTGCCCG CGCCTGCCGG TCGCCGGCGG CGGCTATGTG
CTGCCGCAGA CGGGCGGTCG GCTGCTCTTC GGCGCCACCA GCCAGCCCGA CGACATCGAT
CCCGCGCTGC GCGATGCCGA CCACCGATTC AATCTGCAAC AGCTCGCAGG ACTGTCGGGC
TGCGACGTCG AAGCCTGGTC CAGCCTGCCC TGGCAGGGCC GCGTGGGGTG GCGCGCAGTG
ACGAGCGATC GGCTGCCGCT GATCGGTGCA GTGCCCGACC TGGAGGCGCT GGACCGCACT
TCGCGCGCCG ACCAGCCGCG CTTCGTGCCG CGCCAGCGCG ACGCGCGAGG CGGTCTCTAT
GTCTTCACCG GCCTCGGTTC GCGCGGCATC ACCTGGGCCG CGCTCGGTGG CCAGTTGCTG
GCCTCGTGGA TCAGCGGCGC GCCCTGCCCG CTCGAAGCCG ATCTGCGCGA CGCGCTCGAC
CCCGCGCGCT ACGCGCTGCC GCGCTGGCGC AGCGACTCGT AG
 
Protein sequence
MKTAPITPGR LAFSPDGVPL APEFGDVYHP AAGALQQAHH VFLGGNRLPA RWGGRGRFVI 
LETGFGLGNN FLATWDAWQR DPQRCERLVF VSIEKHPLTR EDLARAHAAS PLPELARALV
SAWPLSTPNL HPIAFEGGRV QLLLGFGDVA LLLPQLVVSV DAFFLDGFAP ARNPEMWEPR
RLQRLGRLAA PGATAATWSA ARVVRDGLSA AGFTVETTAG TGGKRDITVA RFTPRHIAVP
PPGGWHAHDA ASREALVIGA GLAGCAAAWA LSQQGWQCQL LDRAAEPADV TSGNPAGLFH
GSFHRDDGPH ARTLRAAALA TERLAGAWIA QGRVSGQLAG CLRLESRWSD DAARAAMAAQ
QIAPGYIDWM DRAVASTLSG LALPSGAWFY PGGGWLAPRD YARELLARSG SLFRGGIDVA
TIERHSGLWR VLDEQRQVIA EAPVLVLANG LGANGLLASG RGEVPWPLTA VRGQISSLAT
DGHPATLPCP RLPVAGGGYV LPQTGGRLLF GATSQPDDID PALRDADHRF NLQQLAGLSG
CDVEAWSSLP WQGRVGWRAV TSDRLPLIGA VPDLEALDRT SRADQPRFVP RQRDARGGLY
VFTGLGSRGI TWAALGGQLL ASWISGAPCP LEADLRDALD PARYALPRWR SDS