Gene Mpe_A2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2213 
Symbol 
ID4784818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2364808 
End bp2366733 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content68% 
IMG OID640090781 
ProductGGDEF domain-containing protein 
Protein accessionYP_001021404 
Protein GI124267400 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.658691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.864608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACC GAAATCGCCG TGTCGCGGAC CGTCCCCAGC GGATGTGTCC CGACGCGATT 
TCAATGACCG AAAACTCCTC CGCCCGCTCA CCCCTCACGT CGTCGGTCAC GCTCCACAGC
GCGCTGCTGG CGGCGGCCGT GCTGTTGCTC GCCGTGCTCG GCGTCGTGAC ACAGCTGAGT
CTCGACACCG CCCGCCGCAA CGAGGTGCAG CGCGCACTGC GCGATACCGA ACGCAACACG
CAGAAGCTGG CCGTGCGCGT CGGCGAAGTG CTCGACCGTG TCGACCAGAC CACGCTGCTG
GTCAAGTCGC TGCACGAGAC CGGCAACCCG ACGAGCCTGT CCGGGCTGCG CGCCGCCGGA
CTCGTGACGC TGGACACGAC CCGCGCCCTG CTCATCACCG ATCGACATGG CGTCGTGCAG
GAAAGCACCT CGCCCGACGT TGCGCTGAAC GTCGCCGACG AAGACGATTT CAAGCGTCAC
CTCCACGATC CCGTCCTCGG TCTGAGCATC GGCGCGCCTC AGCCCGATCA TCTGAATGGC
GGTTGGATGC TGCCGGTGAT GCGCCGCCTG ACCCACGACG GCCAGTTCGA CGGCGTCGTC
GTCGCGATGC TCGATCCCGG CTCTCTGACC AAGGGCTTCG ATCATGGCGA GGCGCCGGGC
ACCGTGATCA CGGTGATCGG CCTCGACAAC ATCAACCGCT CACGGCGTCT GGACGGCAGC
ATCAGTTTCG GCGAGAAGGT CGACGCCCAG AAGGTGCTGC AGCGCTCGCG CGAGATCCGC
GAGACGCTCC AGCCCTTCTA CAGCCAGGTC GACGGCACGG CCCGCTTCTT CACGGCGCTG
CCGATCGATC GCTATCCGAT GGTCGCCGTC GTGGCCGTCT CGGCCGACGC CGTGACAGCC
GGCTATCAGC AGACGCGGCT TCGTCTCCTC GGCTGGTCCG CGGCCGTCGC ACTGCTGATC
GTGTTCGGAA CGCTGGTGCT GTGGCGGCAG GCCCGCGGGC TCGACAGCAG TCGGCGCGAG
GCCCGTCGGG CCAAGGCGCT CTACGTCGCC ACACTCGACG GCAGCCTGGA TGCACTGTGG
CTGATGCGTG CCGAGCGGGA TGCGTCGGGG CAGACACACG ATTTCGTGAT CACCGACGCC
AACCGCCGTG CGGGCGCCAT GCTCGGGCTG GACCCCGCTG CGATGATCGG GCGTCGCGCG
ACCGAGTTGG TGCCGTCGAT CCGCGAAGAT GGCCTGCTGA ACCTGCTGCT CACGGTGCTG
CAACGGCAGA AGCCGATGGA TGTCGAGGCC CAGGCCGTCG CGACCTCGAT GCGCGGACGC
TGGATGCACT TCCAGGTCGT CCCGGTCGAG GAGGGCGTGG CGCTGATCAC GCGCGACATC
GACGACCGCA AGCGCGCCGA AGCACAGCTC GCCGACATTG CACGCCGGGA TGCCTTGACG
CAGTTGCCGA ACCGCCGTCA TTTCGAGGAA CAGCTCGAAC TGGCCGCTGC CCGCGCTCAA
CGCAGTGGTC GGCCGATGGC CCTCGTCTAC CTCGATCTCG ACGGCTTCAA GCGCGTCAAC
GACACGCTCG GTCACGAAGC GGGGGATCGG TTGCTGATCT CGGTCGCGCT GCGCCTGACA
GCCTGCGTGC GTGTGACCGA TCTGGTCAGC CGGCTCGGGG GTGACGAGTT CACCGTGATC
CTGGAAGAGT CGGGAACGGC CGAGGATCGT CTTCAGCTGT GCGAGCGCAT CCTCGCCCAG
CTGTCCGAAC CACACGTGTT GGCCGGCCAG GCAACCGTGT CGACGCCGAG CCTGGGCATG
GCCGTCTACC TGCCGGGTGA ATCGCTCGAC AGCCTGCGCA AGCGTGCCGA CGGCGCGATG
TACGACGCCA AGCGGGCCGG CAAGGCCTGC CTGCGGATCG CGGCCTCTGC CAGCACGGCG
AGTTGA
 
Protein sequence
MRYRNRRVAD RPQRMCPDAI SMTENSSARS PLTSSVTLHS ALLAAAVLLL AVLGVVTQLS 
LDTARRNEVQ RALRDTERNT QKLAVRVGEV LDRVDQTTLL VKSLHETGNP TSLSGLRAAG
LVTLDTTRAL LITDRHGVVQ ESTSPDVALN VADEDDFKRH LHDPVLGLSI GAPQPDHLNG
GWMLPVMRRL THDGQFDGVV VAMLDPGSLT KGFDHGEAPG TVITVIGLDN INRSRRLDGS
ISFGEKVDAQ KVLQRSREIR ETLQPFYSQV DGTARFFTAL PIDRYPMVAV VAVSADAVTA
GYQQTRLRLL GWSAAVALLI VFGTLVLWRQ ARGLDSSRRE ARRAKALYVA TLDGSLDALW
LMRAERDASG QTHDFVITDA NRRAGAMLGL DPAAMIGRRA TELVPSIRED GLLNLLLTVL
QRQKPMDVEA QAVATSMRGR WMHFQVVPVE EGVALITRDI DDRKRAEAQL ADIARRDALT
QLPNRRHFEE QLELAAARAQ RSGRPMALVY LDLDGFKRVN DTLGHEAGDR LLISVALRLT
ACVRVTDLVS RLGGDEFTVI LEESGTAEDR LQLCERILAQ LSEPHVLAGQ ATVSTPSLGM
AVYLPGESLD SLRKRADGAM YDAKRAGKAC LRIAASASTA S