Gene Elen_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2229 
Symbol 
ID8416552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2616659 
End bp2619604 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content62% 
IMG OID645025215 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_003182579 
Protein GI257791973 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0465115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.264216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTTG CCTGGGCGTT CGTCATCGCG TTCACCTGTA CGGTCGCAAC GACCACGTAT 
CCCGGATCCT CGCGGGCTTT CGCCGACGAG TTCGCCACGT TCGGCAAGCG CGTCGTGACC
GTCACCTACT TCGAAGACGG CGACTATATG TCCACCGACG AAGAGGGGCG CTACGTTGGG
TACAACATCG AGTATCTCAA CGAAATCGCA CGCTACGCCG ACTGGACATA TGAATACGTC
AAGTACCCTA GTTGGGAAGA GGCCTGTGCC GCGCTTGAAG CGAGCAAGGT GGATCTGCTT
CCCATGGTGT ACTACACCGA AGACCGAGAG AGGCGGATGA TCTTCTCGGC ATCATCGCTC
TGCGAGATTT CCACCACGCT CAACGTGAGG CTCGACGACA CGCGTTACGC CTACGAAGAC
TTCAAAACGT TTTCCGGCAT GCGCGTGGGG GTAATCGCGA ACAGCCAGGA CGCCGAAGCG
TTCGCACAAT ACAGCGAGAA GAACGGCTTT TCGGCGGATA TCGTGGCCTA CAGCGCTACC
GGAGACCTGC TGAGAGCGCT CGACGAAGGA GCGGTGGATG CCATTGCCAT CACATATCTC
GGGACGAATT CGCGCTTCCG AACCGTAGCT CAGTTCGTCC CCGAGCCTTT GTACATCGCC
CTCTCGCCCG AACGCACCGA CATAGCCGAT GAGCTCGACA GCGCGATGAG CCGCCTCAAA
CTGCGCGACC CCGACTTCGC CACGTTGCTG TACGACCGCT ACTTCGGCAT CAACACCGAC
CAGGATCCCG TCTTCACCGA AGACGAGTAC GCCTATCTGG CGTCGGCTCC CACCCTGCGC
GTGGCCTACG ATTCGTATCG CGCGCCGCTT TCCTACACCG ACCCCGAAAC CGGCGCGTTC
GCCGGCGCGG TCGCATTGCT GTTCGAAGAC ATCGCGCAGA TCACCGGGCT GAAGTTCGAA
TTCGTGGCTG CGGACTGCCA CGACGAGGCC GTCCGCCTCG TCGAACGCGG CGACGCCGAC
ATCGTGTACG ACGTCGATCG AGAATCCGAC CCCCAGGCGA TCAAAAGCCT CGACACCACC
GGCCCCTATC TGCGCGACCC CATGGCCCTT GTCGCAGGAC CGAATCCTTC CGGTTCGCGC
GTCGCGCTGC CAAGCGGCTT CTCGCTCGCC GCGAGCACGG CGTACTCTTC GTACGCCGAA
TACGACATCG TGTACTGCGA CACCCCGAAA GATTGCTTCG ATGCAGTGCT TGAAGGCAAA
GCCGATATCG CCTTTGCCGA TACCCACGTG GCGAACTACT TGCTGGCGGA ACCCCAATAC
GAGAGCCTGA GCGTCACCAC CATCACCTCC TTCTTCAACA GCATGAGCAT CGGCGTGAGC
CGCAACGCCG ACCGACGACT GGTGAGCATC CTCGACCGCT GCGTGCAGTA CACCGCCGAA
AGCAAGATGA CCACCTGGCT TTCGCAAAGC AGCCTCGCCG TACATCCCAT CAGCCCTTTC
GACTTCTTGC GCCAGTATCC GGTGCAATTC ATGGCCGGCA TCGTCGCACT GCTCGGCTCC
GTGCTCGGCG TTGCCCTGTA CGTCAGCCAT GTGAAGCTTC GCGCGGCGCG GCGAGTGGAG
GACTTCTCGT TCACCGATCC GCTGACCGAA GGCTGGAGTC TCGCCCGCTT CCGCTCCGAG
GTGGGCGCTC AGATGGCGAA CGCCCGCGAT GGCGCTTACG CCATCGTGTA CCTCGACGTA
AAAAGCTTCA AAGGCTTCAA CGCAGCATTC GGCTACGCCA CGGGCGATCG CGTGCTGCTC
GACCTCAACG GCACGCTGGC CGGCATGAAA GCACCCGACG AACGATACGC GCACGTCATC
GCAGACGAAT TCGTGCTGTT GGTTCGCTGG AGAGGCTGGG ACGCGCTGCT GGAGCGCTTC
GATGAACTGG ATCGTCGCTT CAACAGCACC GAAACGCTTA CCGAGCTATC GCACCGGCTC
ATGCTGCAGG CCGGCGTCTG CATCATCGAG CGCAGCGCCG AAACGCCGCG CATCGACGTG
CAAACCATCA TCGAGTTCGT GGACGCCGCA CGCTACGCCC GGGACAGCAT CGGGGAGGCC
TCGCGCAGCA CCGCAGCATT GTACTCGGCG AGCATGAAGG ATCGCGACAT CGCCGAACGC
GCGCTGGTGG CCGCCGCGCA CGACGCGCTC GAGCGCGGGG AGTTCACGGC CTATTACCAG
CCGAAGGTGG AAATAGCGAC GAACCGCCTC GTGGGCTTCG AAGCGCTCGT TCGTTGGGAA
TCGCCTGAAC GCGGCCTCGT GCCGCCCGAC GAGTTCATTC CCCTGTTCGA GCGCACCGGC
CTGGTCGTCG ACTTGGACTT GCAGGTATTT CGCCTCGTCT GCGCCCGTAT CCGGGAGCAG
CTGGACGCGG GAGAACACCC GCTCGTCATC GCCTGCAACT TCTCGCGCTT GCACATGCGA
AACGACGCGT TCCCTGAAAC GGTGAAGAGC ATCGTCGACG GATTCGGCGT GCCCATCGAG
CTGCTGGAGC TCGAACTGAC CGAGAACATC GTCATGGAAG ACCTCGAACG CGCCGAACGA
CTGTGCCGCC GTTTAAAGGA TCTGGGCTTC CGCATCGCCA TTGACGACTT CGGCAGCGGG
TACTCGTCGC TGGGCACGCT GCAGAACCTG CCGATCGACG TGCTGAAGCT CGACCGCAGC
TTCCTCATGA GCAGCGAGAG CGGCGAGCGC TGCAAGGCCA TCCTGGACGG CGTGGTGTCC
ATCGCCGACA AGCTGGCCGT GAACGTGGTG GTGGAAGGCG TGGAAACGCG CGATCAGGCG
TCCATGCTCG TGCGCATGGA CGATCGCATC ATCGCGCAGG GGTTCCTCTA CTCGCGCCCC
GTCCCTCGGG ACGTCTCGGA CGCGCAGTTC GCCGTCGGCT TCATCGAGCC GAACGAACGC
CCGTAG
 
Protein sequence
MMFAWAFVIA FTCTVATTTY PGSSRAFADE FATFGKRVVT VTYFEDGDYM STDEEGRYVG 
YNIEYLNEIA RYADWTYEYV KYPSWEEACA ALEASKVDLL PMVYYTEDRE RRMIFSASSL
CEISTTLNVR LDDTRYAYED FKTFSGMRVG VIANSQDAEA FAQYSEKNGF SADIVAYSAT
GDLLRALDEG AVDAIAITYL GTNSRFRTVA QFVPEPLYIA LSPERTDIAD ELDSAMSRLK
LRDPDFATLL YDRYFGINTD QDPVFTEDEY AYLASAPTLR VAYDSYRAPL SYTDPETGAF
AGAVALLFED IAQITGLKFE FVAADCHDEA VRLVERGDAD IVYDVDRESD PQAIKSLDTT
GPYLRDPMAL VAGPNPSGSR VALPSGFSLA ASTAYSSYAE YDIVYCDTPK DCFDAVLEGK
ADIAFADTHV ANYLLAEPQY ESLSVTTITS FFNSMSIGVS RNADRRLVSI LDRCVQYTAE
SKMTTWLSQS SLAVHPISPF DFLRQYPVQF MAGIVALLGS VLGVALYVSH VKLRAARRVE
DFSFTDPLTE GWSLARFRSE VGAQMANARD GAYAIVYLDV KSFKGFNAAF GYATGDRVLL
DLNGTLAGMK APDERYAHVI ADEFVLLVRW RGWDALLERF DELDRRFNST ETLTELSHRL
MLQAGVCIIE RSAETPRIDV QTIIEFVDAA RYARDSIGEA SRSTAALYSA SMKDRDIAER
ALVAAAHDAL ERGEFTAYYQ PKVEIATNRL VGFEALVRWE SPERGLVPPD EFIPLFERTG
LVVDLDLQVF RLVCARIREQ LDAGEHPLVI ACNFSRLHMR NDAFPETVKS IVDGFGVPIE
LLELELTENI VMEDLERAER LCRRLKDLGF RIAIDDFGSG YSSLGTLQNL PIDVLKLDRS
FLMSSESGER CKAILDGVVS IADKLAVNVV VEGVETRDQA SMLVRMDDRI IAQGFLYSRP
VPRDVSDAQF AVGFIEPNER P