Gene Mpe_A2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2551 
Symbol 
ID4785212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2719427 
End bp2720386 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content73% 
IMG OID640091119 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001021739 
Protein GI124267735 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.138061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.668245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA TCTTTCGGTC TGAGCCGCCG GCGCCCCTGC CGCGCCCGGC CGACGCCCTG 
CACGGGCCGG TCGAGCTGGA GGTCGTGCAC GTCACCGAGT ACCGTTATTC CAGCCCGGTC
GAGCTGGCGC AGCACATTGC CGTGCTGCGG CCGCGTGACG ACGCCGCACA ACAACTGCTG
GGCTTCGAGA TGGAGATCGA GCCGGCGCCG GCGCAGCAGC ACGACGACCG CGACGTCTAC
GGCAACAGCC GCCGCGTGTT CACGCTGACG GCTTCGCACC AGCACCTGCG GGTCTGCGCC
ACCAGCCGCG TGCGTGCCCA CGCGCCCGCG CGCTCCGACG CCGCGGCGCT GCCCTGGGAG
GCCTGCCGCG CCCACTGGCG CTACGAGTCG GGCCGCCCGC TGGACGCGGC GGTCGAGTTC
ACCTTCCCGT CCACGCTGGT GCCGCACCAC GCGGCGCTGC GCGACTGGGC CCTGCCGTCC
TTCCCGCCGG GCCGCGGCAT CGACGAGGCG GCCACCGAGC TGATGCACCG GCTGCACGCC
GATTTCACCT ATGCGCCGCA CAGCACCGAG GTCGGCACCC CGGTGCTGCA GGCCTTCGAG
CAGCGGCGTG GCGTGTGCCA GGACTTCGCC CACGTGATGA TCGGCAGCCT GCGGGCGCTG
GGCCTGTCGG CCCGCTATGT CAGCGGCTAC CTGCTGACCG AGCCGCCGCC GGGCCAGCCG
GTACTGCAGG GTGCCGATGC CTCGCATGCC TGGGTGGCGG TGGCGGTGCC GCGCGAGGAC
GGCTCGGCCG ATTGGCTGGA ACTCGACCCC ACCAACGACT GCGAGGCCGG TCTGACCCAT
GTGCGCCTCG CACTGGGGCG CGACTATGCC GACGTGACGC CGCTGTGCGG CGTGATCCGT
GGTGGCGGCC GCCACCAGCT CGACGTGCGC GTGGGGACGC GCGTGGTGCC TGCGGCATGA
 
Protein sequence
MNSIFRSEPP APLPRPADAL HGPVELEVVH VTEYRYSSPV ELAQHIAVLR PRDDAAQQLL 
GFEMEIEPAP AQQHDDRDVY GNSRRVFTLT ASHQHLRVCA TSRVRAHAPA RSDAAALPWE
ACRAHWRYES GRPLDAAVEF TFPSTLVPHH AALRDWALPS FPPGRGIDEA ATELMHRLHA
DFTYAPHSTE VGTPVLQAFE QRRGVCQDFA HVMIGSLRAL GLSARYVSGY LLTEPPPGQP
VLQGADASHA WVAVAVPRED GSADWLELDP TNDCEAGLTH VRLALGRDYA DVTPLCGVIR
GGGRHQLDVR VGTRVVPAA