Gene Mpe_A2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2856 
Symbol 
ID4785550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3039356 
End bp3041062 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content69% 
IMG OID640091427 
ProductGGDEF domain-containing protein 
Protein accessionYP_001022045 
Protein GI124268041 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.592717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTGG CCTCGAACCC GGAAGCGTCC GCTGCACGTG CCGCGGAGCC CGCGGCTGCG 
GAGTTGGCCA AGGCGGCGCT GCGCCGCCTT GCGCTGGAGC GCAAGCAACC GACGCCCGAG
AACTTTGCGT GGGCCTACCG GACGGAGCGC GGTGACGCAC CGGGGCCCGC GGCGGGGGTT
GCTGCTGCCG TGCCGGACAG CGCCGAAGGC GACGGCGAAG CGTGGGCCGC TCTGTTCCAG
AGCGCCCTGC GGGGCGTGGA GCGTGGCGGC CGCAACTGGA CCGCCGCTCG ACGCAAGGAC
AGTCTGCAGC GGGTGCTCGG GGGCAGCCGC AGTGACGCCA AGCGGCTGCA GCAAAGGCTC
AAGCAGCTCA TCGCCAGCTG GGACAGTGAT GCGCTCGACG AGGCACCGAG CGATCTGGCA
CCTTTCGACG CGGCAGCACC CGTGGGCGAC GTCGCGCCGA TGGTCGTTGC AGAGGCTTCG
CCGGCGCCCG TCTCGATGGC CTGGTCGCGC GTTGCCGCCG AACTCGGTGC GACCGTTCAG
GCGGCCTTGC CGTCGAGCGA GGCGCGCAGC CGCGAGGTGG CCGAGCGGCT TGCGCAGTTG
CAGGCGCGCA TTGCTGCCGA CGGCCCCGAT GGTGTTGTCG AAGACGTGGC GCAGGCTTGC
ACCGAGGCGC AACGGGTGTT GCAGCATCGC CAGCACCTGA CGAACCAACT GGGAGGGCTG
TGCCAGGAAC TCACCGAAGG TCTGGCCGAC CTGTCCGAGG ACGATTCCTG GGTCAAAGGG
CAGTGCGAGG TGATGCGCAG TCAGCTGTTC GAGAGCGACG GTGGCCTGAC GGCAAGGGGC
GTGCGATCAG TCAGCCAACT GCTGCTCGGT ACGCGCGAGC GCCAGCAGGA ACTGCGCGCG
GCCCGCAAGG CGGCCCGAGA TGCCTTGAAG GACTCGATCC ACGAGATGCT CGACGAGATT
GCCTCGCTGG GTGCACAGAC CGGGCGCTTC TCGTCTCAAT TGGACGGGTA CGCCGACGAG
ATCGACCGTG CCGAGTCTCT TGAGAGCCTG GCCGGGACGG TGCGACAGAT GGTGGCTGAA
AGCCGGGCAG TGCACGAAGG GGTCAGCCAG GCGCAGGCCA GACTGGCTCT GGAGCACAGC
CGATCGGCCG AGATGCAGGC GCGGGTGGTC GAATTGGAGG ACGAGATCCG TCGGCTGTCG
GACGAAGTGT CCACGGACCC CTTGACGCAG ATCGCCAACC GCCGCGGCCT GCAGCAGGCC
TTCGAGACGG AGCAGGCCAA GAACCAGCGG GCTGCACAGG GTGAGGGCGC GACGCTCGCC
GTGGCGCTGA TCGACATCGA CAATTTCAAG AAGCTCAACG ATCGGCTGGG CCATGCGAGC
GGCGATGCGG CGCTGCAGTT CCTGACGCAC CGCGTCGCGC AGGCGCTGCG CCCCGGCGAC
ACACTGGCGC GCTACGGTGG CGAGGAGTTC GTGGTGCTGC TGCCGGGCAC GCCGCTCGAC
GAGGCGCAGC GCGTGCTCAC CCGCGTGCAG CGTACGCTCA GCGCGGAGCT GTTCATGAGC
GATGAGCAGG GGCAGGTGTT CGTGACCTTC TCGGCCGGCG TCAGCCTCTA CCGGCCCGGC
GAACGGCTCG AGCAATCGCT GGAGCGCGCC GACGAGGCGC TGTACGAGGC CAAGCACAGC
GGCAAGAACC GCACCTGCGC CGCCTGA
 
Protein sequence
MGVASNPEAS AARAAEPAAA ELAKAALRRL ALERKQPTPE NFAWAYRTER GDAPGPAAGV 
AAAVPDSAEG DGEAWAALFQ SALRGVERGG RNWTAARRKD SLQRVLGGSR SDAKRLQQRL
KQLIASWDSD ALDEAPSDLA PFDAAAPVGD VAPMVVAEAS PAPVSMAWSR VAAELGATVQ
AALPSSEARS REVAERLAQL QARIAADGPD GVVEDVAQAC TEAQRVLQHR QHLTNQLGGL
CQELTEGLAD LSEDDSWVKG QCEVMRSQLF ESDGGLTARG VRSVSQLLLG TRERQQELRA
ARKAARDALK DSIHEMLDEI ASLGAQTGRF SSQLDGYADE IDRAESLESL AGTVRQMVAE
SRAVHEGVSQ AQARLALEHS RSAEMQARVV ELEDEIRRLS DEVSTDPLTQ IANRRGLQQA
FETEQAKNQR AAQGEGATLA VALIDIDNFK KLNDRLGHAS GDAALQFLTH RVAQALRPGD
TLARYGGEEF VVLLPGTPLD EAQRVLTRVQ RTLSAELFMS DEQGQVFVTF SAGVSLYRPG
ERLEQSLERA DEALYEAKHS GKNRTCAA