Gene Mext_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3804 
Symbol 
ID5832210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4223977 
End bp4225773 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content66% 
IMG OID641369596 
ProductEAL domain-containing protein 
Protein accessionYP_001641249 
Protein GI163853206 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0052374 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGAA CGGATTGCCG CGAGCGTCTC GAAGAAGATC GTTTGAATGC GCTCCAGGCA 
CTCAACCTCC TCGACACGCC GCCGAGCGAG AGCTTCGACC GCATCACACG GATGGCCAGC
CAGATCTTCA ACCTGCCAAT CTCGGCGGTG TCGTTGACGG ATCGCGACCG GCAATGGTTC
AAGTCGCGCA TCGGTGTCGA TCACTGCTCC ATCCCACGGG ATAAGGCGCC GTGCGGGCAG
GTCGCCGAAA GCGCCGAAGT CCTCGTGATC CCCGATTTCG CGCAGGACGC CTTCTACGCC
GACAGCGTGC TCGGCCGCTC CGGCATCCGC TTCTATGCCG GCGCACCGCT GGTCACGCGC
GAGGGCTACA GCCTGGGTGC CCTCTGTGTC CTTGGAACCG AGCCCCGCAC GGTCTCTCCG
GCCGAGATCG CCGCCCTCAC GGACCTCGCC GCGATGGTCA TGGCGCAGAT CGAGTTGCAG
CACGCTTTCG GCCGGGTCGA TCCGTTGAGT GGCCTGCCCA GCCGGAACCA GTTCCTGGAC
GACCTCGCCG ATCTCGCCGC CGAGCATCCG GATGAGGCCA GGATCGCTGT GCTCATCGAT
CTCGGCCGAC CCGAGCAGGT CAGTGCCTAT AGCCGCGTCA TGGGTCCGGG CCGCGTTGAC
GATCTGGTGC GGGAGGCCGC GCGGGAGTTG CGGCGCCTCA TCGGCCCAGG GCGCAAACTC
TACCACACGG CGGCCACGCA GTTCACCTTT CTCGCTCCAC GCGGGGCTCA GCAGGACGAC
TATGTCCGCC TCCTCGCGGA CGAGCACCGG CAGGCGCGGC AACGCTCCAT CACTGGGATG
CTGCTGACGA GCGCGATCGG TGTGAGCGTG TTCAAGCCTT GCACGACGGC GCCACAAGAC
GCGCTGCGCT CCCTCTACAG CGCGGTTCAG GACGCGCGCT CGTTGCACGA TCTCATCAGC
GTCTACTCGT CCGTTGCCGA CGAGGCCTAC CAGCGCCGGT ACCAACTGCT CCAGGATTTC
GGGCCGGCCC TCGGTGCCGA CGACCAGCTA CGCCTCGTTT TCCAGCCACG CATCGACCTG
TCCACCGGTC GATGCATCGG CGCGGAGGCG CTGCTGCGCT GGGACCATCC AGAGTTGGGG
CCCGTGTCCC CCGGCGAGTT CGTGCCGGTG ATCGAACTCT CGCCCCACGC GCAGGCGATG
ACGGCCTTCG TTCTCGAGAG GGCGCTGGCG CAGGCGCGCC GCTGGCAGGA TGCCGGGCAC
AGCTTGGTGA TGTCGGTCAA CATCTCAGCC GCGAACTTGA TCGAAGCTGG CTTCGGCCAG
TCGGTCGAGG CCGGCCTTCG GCGCCACGGC CTTGCGCCCG GGCAGTTGGA ACTGGAAGTG
ACTGAGAGCG CGATCATGCA AAATGCTGAA CAGGCGCGAC GTCAGTTGGA CTTGCTGGCC
GCGGCCGGCA TTCGCTTGGC GATTGACGAC TTTGGGACGG GCTACAGCAG CTTGGCCTAC
CTGCAGGACA TCCCAGCGCA CGTCGTGAAG ATCGATCAAA GCTTCGTGCG CAAGCTTGCG
GATGGCGAGC GGGAACGATC GCTCGTCCAC TCGATGATCC ACCTCTCGCA TGATCTCGGC
TACCGGGTGG TCGCGGAGGG CATCGAGACG GCGGAGGCAG CCGACCAAGT CAGGGCGATG
GCCTGTGATG AGGCGCAAGG CTATCTCTTC GCCCGCCCGC TGGAGATCGG GACATTTGAG
ACGTGGCTCA GGGAGCATGA GCAGGACGCC CGGTACGAGC CGGCACTGGC CAGCTGA
 
Protein sequence
MSGTDCRERL EEDRLNALQA LNLLDTPPSE SFDRITRMAS QIFNLPISAV SLTDRDRQWF 
KSRIGVDHCS IPRDKAPCGQ VAESAEVLVI PDFAQDAFYA DSVLGRSGIR FYAGAPLVTR
EGYSLGALCV LGTEPRTVSP AEIAALTDLA AMVMAQIELQ HAFGRVDPLS GLPSRNQFLD
DLADLAAEHP DEARIAVLID LGRPEQVSAY SRVMGPGRVD DLVREAAREL RRLIGPGRKL
YHTAATQFTF LAPRGAQQDD YVRLLADEHR QARQRSITGM LLTSAIGVSV FKPCTTAPQD
ALRSLYSAVQ DARSLHDLIS VYSSVADEAY QRRYQLLQDF GPALGADDQL RLVFQPRIDL
STGRCIGAEA LLRWDHPELG PVSPGEFVPV IELSPHAQAM TAFVLERALA QARRWQDAGH
SLVMSVNISA ANLIEAGFGQ SVEAGLRRHG LAPGQLELEV TESAIMQNAE QARRQLDLLA
AAGIRLAIDD FGTGYSSLAY LQDIPAHVVK IDQSFVRKLA DGERERSLVH SMIHLSHDLG
YRVVAEGIET AEAADQVRAM ACDEAQGYLF ARPLEIGTFE TWLREHEQDA RYEPALAS