Gene Emin_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1214 
Symbol 
ID6263453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1312880 
End bp1314370 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content41% 
IMG OID642611692 
Productdiguanylate cyclase with GAF sensor 
Protein accessionYP_001876101 
Protein GI187251619 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.302028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000035245 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCGGTT TTAATGACAG GCACAGGCCC AAATTGCAGC TTTACCATTT TCATAAAAGG 
CTGAACCAGA CTTTCCAAAA TACAAGCGCC TTGTTAAACC AGGCTTTGCC TTTTATGCAA
AAAATTTTGG GGCTGGACCG TATTTACTTT TTTAATTGGG AAAAAAACCG CGAGCTTCTT
TCACTTACCA TGCTTTGCAA AGACGGCTAC TGCATGGACA TGCAGGAAAC TATTTCCTCA
ACAGGCAAGC AGGAAATCAT GGCTGATCTT CTAGCGAAAG GGTATTCCTT AAAAAGTGAT
TTAAGCTACC CCGCGATTTA TGTTTTTTTA CAATGGAAGG CGCCCGCCGC TTACGGTAAA
AACGGCGGCA ACTCCGCCAT GCAGGAAAAG TTTGGCGTGT TAAGGCTTGA ACGTTTTAAC
AAATCTAAAA AATTTAGTGA AAAAGAAATC AGGTTAATCA AGGGGCTTGT CAGCGAAATT
TCCCATAATA TGATTAACAC GGAAATAGAT CAGGACAACT CAGAACGCCT TAGGCTTGCC
ACCACGTTAA ATGATTTAGC CGCGGTGTTT GCTTCTTCCA TGCGTTTTAA CGACGCTATT
GAAGTTATTT TGCGCGGCGT GCAAAAAACT TTTAAATTTG ACCGTGTAAG AATGTATTTG
TTTGATTATG AAGGTGCAAA CATACGCGCC TCTTTAAGCA CTGATATAGC GGGCAATGTT
TCCAGAAGGG ACGGTAATAT TGACCCCGCC GAAATAAAAA ACGTTTCAAA CATGGAAGAG
TCTTTCAGCT CGCGCGTACT TAACCTGCCG CTTAATGTGC AGGGGAAAAG GGTGGGTATT
TTAATTTTTG ACAATCTTCT TTCCCGCCGC GATATTACGT ATTTGGATTT TTTGCATGTT
AAACAGTTTT CCTCCCAAAT CGCGCTGGCG GTTGATAACG CAGTTTTGTT CGAGCGCGTG
CAGGACCTTT ATAATTATGA CGAACTTACC AAACTGCCCG TAAGAAGGTA TTTTAATGAA
AAACTGATAG AGGAAATTTA CCGCTCCGAG CGGTTTGAGC TTACAATGTC GGTTATTATT
TTAGATATTG ACCATTTTAA AACAATTAAC GATACTTTCG GCCACAGTAC GGGCGATATT
GTTTTAAAAT CCGTAAGCGA TACTATATTA AAAAGTTTAA GACAGACTGA CTTTCCCTGC
CGCTACGGCG GCGATGAAAT CATGATTATG CTTCCGCGCA CAAGCGGGCA GGAAGCTAAG
TATACCGCAA GACGTTTATC TGAGGGTATT AAAAAAATCA AAATACCGGA GCAGTACACC
AACGGACGGG AGTATATCAT TTCAATTACC CAGGGTATAG CAGTGTACCC TCAGGATTCT
TCAGACGCTA TTGATTTATT TAATAAAGCG GACAGGGCTT TATATTACGC TAAAAACAAA
GAACGCGGCA CATACGCTCT TTATAATGAA ATACCTCCCG AAAGTAAATA G
 
Protein sequence
MFGFNDRHRP KLQLYHFHKR LNQTFQNTSA LLNQALPFMQ KILGLDRIYF FNWEKNRELL 
SLTMLCKDGY CMDMQETISS TGKQEIMADL LAKGYSLKSD LSYPAIYVFL QWKAPAAYGK
NGGNSAMQEK FGVLRLERFN KSKKFSEKEI RLIKGLVSEI SHNMINTEID QDNSERLRLA
TTLNDLAAVF ASSMRFNDAI EVILRGVQKT FKFDRVRMYL FDYEGANIRA SLSTDIAGNV
SRRDGNIDPA EIKNVSNMEE SFSSRVLNLP LNVQGKRVGI LIFDNLLSRR DITYLDFLHV
KQFSSQIALA VDNAVLFERV QDLYNYDELT KLPVRRYFNE KLIEEIYRSE RFELTMSVII
LDIDHFKTIN DTFGHSTGDI VLKSVSDTIL KSLRQTDFPC RYGGDEIMIM LPRTSGQEAK
YTARRLSEGI KKIKIPEQYT NGREYIISIT QGIAVYPQDS SDAIDLFNKA DRALYYAKNK
ERGTYALYNE IPPESK