Gene Mpe_A3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3809 
Symbol 
ID4785920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp4027008 
End bp4028030 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content71% 
IMG OID640092392 
Productarabinose-5-phosphate isomerase 
Protein accessionYP_001022997 
Protein GI124268993 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.127508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCCAC CCTCCGCTTC TTCTTCCTCT TCCTCCTCGT ATTCCCCGCA GCGCAGCGTC 
GAGATGGGCG CGCAGGCGCT GGCGGTCGAG GCGCAGGCGC TGGGCGCACT GCAGCAGCGC
ATCGTCGGGC CGATGGCCGA CGCGTTCGCA CGGGCGGTCG CGGCCATGCT GGTATGCCGC
GGCCGCGTGG TCGTGATGGG CATGGGCAAG AGCGGCCACG TGGGCCGCAA GATCGCCGCG
ACACTGGCCT CGACCGGCAC GCCGGCGATG TTCGTGCACC CTGCCGAGGC GAGTCACGGC
GACCTGGGCA TGGTGACCCC GTCCGACATC GTGCTGGCGA TCTCGAACTC CGGCGAGAGC
GACGAGCTGG CGGCCATCCT GCCGGTGCTC AAGCGGCTGG GCGTCATGCT GATCGCGATC
ACCGGCCGGG CCGACTCCAA CCTCGCGCGC CATGCCGAGC TGGTGCTCGA CAGCGCGGTC
GCACAGGAGG CCTGTCCGCT GAACCTGGCA CCGACGGCCA GCACCACCGC GCAGATGGCG
CTGGGTGACG CCCTCGCCGT CGCGCTGCTC GATGCCCGCG GCTTCAAGGA GGAAGACTTC
GCGCGCTCGC ATCCTGGCGG TTCGCTGGGG CGCAAGCTGC TGACGCACGT GCGCGACGTG
ATGCGCGGCG GCGACGCGGT GCCGAGCGTG GGGCCGGCAA CGGCGTTCAC CGACCTGATG
CGCGAGATGA GCGCGAAGGG CCTGGGCGCC ACAGCGATCG TCGATGACGC CGGCCGCGTG
CAGGGCATCT TCACCGACGG CGACCTGCGC CGCCTGATCG AGAAGGGCGG CGACCTGCGC
GCGCTGACGG CCGCGGAGGT GATGCATCCG GCGCCGCGCA CGGTGCGCGA CGACGCACTG
GCCGTCGATG CCGCCGACCT GATGGAGACG CACCGCATCA CCAGTGTGCT CGTGGTCGAT
GCCCAGGGCG TGCTGGTCGG TGCGCTGAAC ATCAACGATC TGCTGCGCGC GAAGGTCATC
TGA
 
Protein sequence
MTPPSASSSS SSSYSPQRSV EMGAQALAVE AQALGALQQR IVGPMADAFA RAVAAMLVCR 
GRVVVMGMGK SGHVGRKIAA TLASTGTPAM FVHPAEASHG DLGMVTPSDI VLAISNSGES
DELAAILPVL KRLGVMLIAI TGRADSNLAR HAELVLDSAV AQEACPLNLA PTASTTAQMA
LGDALAVALL DARGFKEEDF ARSHPGGSLG RKLLTHVRDV MRGGDAVPSV GPATAFTDLM
REMSAKGLGA TAIVDDAGRV QGIFTDGDLR RLIEKGGDLR ALTAAEVMHP APRTVRDDAL
AVDAADLMET HRITSVLVVD AQGVLVGALN INDLLRAKVI