Gene Elen_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1803 
Symbol 
ID8416107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2113754 
End bp2115313 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content63% 
IMG OID645024774 
Productdiguanylate cyclase 
Protein accessionYP_003182157 
Protein GI257791551 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00715144 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.974304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA CAACAGGCAA AAACCTGAGA ACAACGCGGT TCTCGTTCCG GCGCATGGCC 
TTGTACGGAG TCGCGCTCGC GTTGGCGCTG CTTATCGCAT CCGCTGTGAT GGGCGCGGTT
CTGGCCGTGC GGGATATGCG CGATCGGGCC GACGTATCGC TGACGAACGC CAAGGCGCGC
ATCGAGTCGA GCGTCGCGGA GTCTTTCAAG CTGCTGGAAT CGTTGGCCGA GCAGCCGACG
CTGTACGAAC GCTCCGTCTG GGTCATGGAC AAAGTGACGA TGCTCGACCA GGTGAACGAG
CATTTCGGCT ACTTCCTGCT ATGCTACGTC GATGACGAGA TGAACGTGTG GGACGTGACA
GGTTCGGCGA GCCTCGCCAG CCGCGATTTC ATGCAGAAAT GCTATTCCAC GGGTCAGGGC
TTGGTCACCG ACAGCTTCGC GGCCGGCGCC GACGGCGTGA CGCTCAACTA CGTCGTGCTC
GTGCCCCTCT TCGACGGCGG CGAGATGACG GGATCGCTGT TCGTCTCGCT GTACTTCGAC
GATATGGTGC GCATCCTCGC CGAGAGCGCC GTCGGCCCCG ATGTGGGATC GGTGCTCATC
GGAAGCCGGG GCCAGACGAT GTCGGCCACG TCGGGTTTCG TGTACGACGA CATGTTCCTC
GACCCGCTTC GCAGCAGCAT CGCGTTCGGT ATGACCGCCG ATGTCGTGGA GCGGGAGCTG
ATGGCGCTCA ACCCGGTGTC CTTCTGGACC GTAGACGGAT TGGATGTACG ATACTACACG
GCCGTTCCTA TCGCCGATAC CGCATGGGAC GCCGTATGCG TGACGAGCTT CTGGGACGCG
TACACCAAGG TCATGGCCGC GCTCGCTCCG CTGATCGCCG CCGGTTTGGC GATCGTCGCG
GGCGTGTTCC TGTTGCTTCG CCGCGATTTC ATGTGTCAAA TGGAAAACGC CCGCATGCTT
GAGAAGTCCG TCGAGGAGCT GCAGAGGAAA GTGTACGACG ATGGGCGATC GGCTGAAGCC
GACATCGCCG ACATCCTCGA GCTCACCTCG TCGGGCCTGT CCGACGGGCT GACCGGCACC
GTCACGCGCT CGGTGTTCTC CAGCAAGCTG GCGAGTGCGC TCGAGAACGC GCGGGACGGC
GGATCCCTGT ACGCCCTCTG CTTCATCGAC CTCGACGACT TCAAGACGAT CAACGACACG
TATGGCCATG CGACCGGCGA CGCGGCTTTG AAATCCATCG GCTACGCCCT GCGCGGCTAC
GAGCGGCGCT ACGACGGCAT GGTGGGACGT TACGGCGGCG ACGAGTTCGT GATGCTCATG
ACCGACATCG ACGACGAAGG CGAGCTGCGC GCCGTGCTCG ACGAGATGGT GGGCGACCTT
CATGTGGACA TCCAGGTGGG CGACGCGGTG GTCTCGGTGC ATTGCAGCAT CGGCGCGGCC
GTGTGGGACC GGGTTTCCGA CGCCGATGCG CTTTTGGGGC AGGCCGACAA CGCTCTGTAT
CGCGTCAAGC AGCATGGCAA GGAAGGGTAT TTCGTGTTCG GCGAAGAGGA TGCGCAGTGA
 
Protein sequence
MAKTTGKNLR TTRFSFRRMA LYGVALALAL LIASAVMGAV LAVRDMRDRA DVSLTNAKAR 
IESSVAESFK LLESLAEQPT LYERSVWVMD KVTMLDQVNE HFGYFLLCYV DDEMNVWDVT
GSASLASRDF MQKCYSTGQG LVTDSFAAGA DGVTLNYVVL VPLFDGGEMT GSLFVSLYFD
DMVRILAESA VGPDVGSVLI GSRGQTMSAT SGFVYDDMFL DPLRSSIAFG MTADVVEREL
MALNPVSFWT VDGLDVRYYT AVPIADTAWD AVCVTSFWDA YTKVMAALAP LIAAGLAIVA
GVFLLLRRDF MCQMENARML EKSVEELQRK VYDDGRSAEA DIADILELTS SGLSDGLTGT
VTRSVFSSKL ASALENARDG GSLYALCFID LDDFKTINDT YGHATGDAAL KSIGYALRGY
ERRYDGMVGR YGGDEFVMLM TDIDDEGELR AVLDEMVGDL HVDIQVGDAV VSVHCSIGAA
VWDRVSDADA LLGQADNALY RVKQHGKEGY FVFGEEDAQ