Gene Mpe_A3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3478 
Symbol 
ID4786296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3689009 
End bp3690085 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content71% 
IMG OID640092058 
Productnitrilase 
Protein accessionYP_001022666 
Protein GI124268662 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.42672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCCCC CTCGCACCGT CCGCGCTGCC GCGGTCCAGA TCGCGCCCGA TCTCGAGCGG 
CCCGAGGGCA CGCTCGAGCG CGTGCTCGCG GCGATCGACG AGGCGGCCGG GCGCGGCGCG
GGGATCGTGG TGTTCCCCGA AACCTTCGTG CCCTACTACC CCTACTTCTC GTTCGTGCTG
CCGCCGGTGC TGCAGGGCGC GCCGCACCTG CGGCTGATGG AGCACGCGGT GGTGGTGCCC
GGGCCGGTCA CGCAGGCGGT CGGCGAGCGG GCCCGCGCGC GCGGCATCGT CGTCGTGCTC
GGCGTCAACG AGCGAGACCA CGGCAGCCTC TACAACACCC AGCTGGTGTT CGACGCCGAC
GGTGCGCTGA TCCTGAAGCG CCGCAAGATC ACCCCGACCT ATCACGAGCG CATGGTCTGG
GGCCAGGGCG ACGGCGCCGG GCTGAAGGTG GTGGACAGCG CGGTCGGCCG CGTCGGCGCG
CTGGCCTGCT GGGAGCACTA CAACCCGCTG GCGCGCTACG CGCTGATGAC GCAGCACGAG
GAGATCCACT GCGCGCAGTT TCCCGGCTCG ATGGTCGGGC AGATCTTCGC CGACCAGATG
GCGGTGACGA TTCGCCACCA CGCGCTGGAG TCGGGCTGCT TCGTCGTCAA CGCCACCGGC
TGGCTGACCG ACGCGCAGAT CGCCGCGATC ACGCCCGACG CCGGCCTGCA GAAGGCGCTG
CGCGGCGGCT GCCACACCGC CATCGTCTCG CCCGAGGGCA AGGACCTGTG CACGCCGCTG
ACCGAGGGCG AGGGCATCGT CTATGCCGAC CTCGACATGG CGCTGATCGC CAAGCGCAAA
CGCATGATGG ACTCGGTGGG CCACTACGCG CGCCCCGAGC TGCTGAGCCT CCTGATCGAC
GACCGCCCGG CCACGACCTC GACGTCGATG ACCGCGGCCG CCCTTGCCCC TGCCGTTCCC
GCGACCTTCC GGAGTTCCTC CCATGAGCAC GCCGCCCCTC AGCCCCGCCA CGCCCCTGTC
GCCGGAGAGC CGCCGGCTGA TGACCGAGCT GCAGTCCTTC GGGTTGCGGC TGGCTGA
 
Protein sequence
MSPPRTVRAA AVQIAPDLER PEGTLERVLA AIDEAAGRGA GIVVFPETFV PYYPYFSFVL 
PPVLQGAPHL RLMEHAVVVP GPVTQAVGER ARARGIVVVL GVNERDHGSL YNTQLVFDAD
GALILKRRKI TPTYHERMVW GQGDGAGLKV VDSAVGRVGA LACWEHYNPL ARYALMTQHE
EIHCAQFPGS MVGQIFADQM AVTIRHHALE SGCFVVNATG WLTDAQIAAI TPDAGLQKAL
RGGCHTAIVS PEGKDLCTPL TEGEGIVYAD LDMALIAKRK RMMDSVGHYA RPELLSLLID
DRPATTSTSM TAAALAPAVP ATFRSSSHEH AAPQPRHAPV AGEPPADDRA AVLRVAAG