Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2229 |
Symbol | |
ID | 8416552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2616659 |
End bp | 2619604 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645025215 |
Product | diguanylate cyclase/phosphodiesterase |
Protein accession | YP_003182579 |
Protein GI | 257791973 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0465115 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.264216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTTG CCTGGGCGTT CGTCATCGCG TTCACCTGTA CGGTCGCAAC GACCACGTAT CCCGGATCCT CGCGGGCTTT CGCCGACGAG TTCGCCACGT TCGGCAAGCG CGTCGTGACC GTCACCTACT TCGAAGACGG CGACTATATG TCCACCGACG AAGAGGGGCG CTACGTTGGG TACAACATCG AGTATCTCAA CGAAATCGCA CGCTACGCCG ACTGGACATA TGAATACGTC AAGTACCCTA GTTGGGAAGA GGCCTGTGCC GCGCTTGAAG CGAGCAAGGT GGATCTGCTT CCCATGGTGT ACTACACCGA AGACCGAGAG AGGCGGATGA TCTTCTCGGC ATCATCGCTC TGCGAGATTT CCACCACGCT CAACGTGAGG CTCGACGACA CGCGTTACGC CTACGAAGAC TTCAAAACGT TTTCCGGCAT GCGCGTGGGG GTAATCGCGA ACAGCCAGGA CGCCGAAGCG TTCGCACAAT ACAGCGAGAA GAACGGCTTT TCGGCGGATA TCGTGGCCTA CAGCGCTACC GGAGACCTGC TGAGAGCGCT CGACGAAGGA GCGGTGGATG CCATTGCCAT CACATATCTC GGGACGAATT CGCGCTTCCG AACCGTAGCT CAGTTCGTCC CCGAGCCTTT GTACATCGCC CTCTCGCCCG AACGCACCGA CATAGCCGAT GAGCTCGACA GCGCGATGAG CCGCCTCAAA CTGCGCGACC CCGACTTCGC CACGTTGCTG TACGACCGCT ACTTCGGCAT CAACACCGAC CAGGATCCCG TCTTCACCGA AGACGAGTAC GCCTATCTGG CGTCGGCTCC CACCCTGCGC GTGGCCTACG ATTCGTATCG CGCGCCGCTT TCCTACACCG ACCCCGAAAC CGGCGCGTTC GCCGGCGCGG TCGCATTGCT GTTCGAAGAC ATCGCGCAGA TCACCGGGCT GAAGTTCGAA TTCGTGGCTG CGGACTGCCA CGACGAGGCC GTCCGCCTCG TCGAACGCGG CGACGCCGAC ATCGTGTACG ACGTCGATCG AGAATCCGAC CCCCAGGCGA TCAAAAGCCT CGACACCACC GGCCCCTATC TGCGCGACCC CATGGCCCTT GTCGCAGGAC CGAATCCTTC CGGTTCGCGC GTCGCGCTGC CAAGCGGCTT CTCGCTCGCC GCGAGCACGG CGTACTCTTC GTACGCCGAA TACGACATCG TGTACTGCGA CACCCCGAAA GATTGCTTCG ATGCAGTGCT TGAAGGCAAA GCCGATATCG CCTTTGCCGA TACCCACGTG GCGAACTACT TGCTGGCGGA ACCCCAATAC GAGAGCCTGA GCGTCACCAC CATCACCTCC TTCTTCAACA GCATGAGCAT CGGCGTGAGC CGCAACGCCG ACCGACGACT GGTGAGCATC CTCGACCGCT GCGTGCAGTA CACCGCCGAA AGCAAGATGA CCACCTGGCT TTCGCAAAGC AGCCTCGCCG TACATCCCAT CAGCCCTTTC GACTTCTTGC GCCAGTATCC GGTGCAATTC ATGGCCGGCA TCGTCGCACT GCTCGGCTCC GTGCTCGGCG TTGCCCTGTA CGTCAGCCAT GTGAAGCTTC GCGCGGCGCG GCGAGTGGAG GACTTCTCGT TCACCGATCC GCTGACCGAA GGCTGGAGTC TCGCCCGCTT CCGCTCCGAG GTGGGCGCTC AGATGGCGAA CGCCCGCGAT GGCGCTTACG CCATCGTGTA CCTCGACGTA AAAAGCTTCA AAGGCTTCAA CGCAGCATTC GGCTACGCCA CGGGCGATCG CGTGCTGCTC GACCTCAACG GCACGCTGGC CGGCATGAAA GCACCCGACG AACGATACGC GCACGTCATC GCAGACGAAT TCGTGCTGTT GGTTCGCTGG AGAGGCTGGG ACGCGCTGCT GGAGCGCTTC GATGAACTGG ATCGTCGCTT CAACAGCACC GAAACGCTTA CCGAGCTATC GCACCGGCTC ATGCTGCAGG CCGGCGTCTG CATCATCGAG CGCAGCGCCG AAACGCCGCG CATCGACGTG CAAACCATCA TCGAGTTCGT GGACGCCGCA CGCTACGCCC GGGACAGCAT CGGGGAGGCC TCGCGCAGCA CCGCAGCATT GTACTCGGCG AGCATGAAGG ATCGCGACAT CGCCGAACGC GCGCTGGTGG CCGCCGCGCA CGACGCGCTC GAGCGCGGGG AGTTCACGGC CTATTACCAG CCGAAGGTGG AAATAGCGAC GAACCGCCTC GTGGGCTTCG AAGCGCTCGT TCGTTGGGAA TCGCCTGAAC GCGGCCTCGT GCCGCCCGAC GAGTTCATTC CCCTGTTCGA GCGCACCGGC CTGGTCGTCG ACTTGGACTT GCAGGTATTT CGCCTCGTCT GCGCCCGTAT CCGGGAGCAG CTGGACGCGG GAGAACACCC GCTCGTCATC GCCTGCAACT TCTCGCGCTT GCACATGCGA AACGACGCGT TCCCTGAAAC GGTGAAGAGC ATCGTCGACG GATTCGGCGT GCCCATCGAG CTGCTGGAGC TCGAACTGAC CGAGAACATC GTCATGGAAG ACCTCGAACG CGCCGAACGA CTGTGCCGCC GTTTAAAGGA TCTGGGCTTC CGCATCGCCA TTGACGACTT CGGCAGCGGG TACTCGTCGC TGGGCACGCT GCAGAACCTG CCGATCGACG TGCTGAAGCT CGACCGCAGC TTCCTCATGA GCAGCGAGAG CGGCGAGCGC TGCAAGGCCA TCCTGGACGG CGTGGTGTCC ATCGCCGACA AGCTGGCCGT GAACGTGGTG GTGGAAGGCG TGGAAACGCG CGATCAGGCG TCCATGCTCG TGCGCATGGA CGATCGCATC ATCGCGCAGG GGTTCCTCTA CTCGCGCCCC GTCCCTCGGG ACGTCTCGGA CGCGCAGTTC GCCGTCGGCT TCATCGAGCC GAACGAACGC CCGTAG
|
Protein sequence | MMFAWAFVIA FTCTVATTTY PGSSRAFADE FATFGKRVVT VTYFEDGDYM STDEEGRYVG YNIEYLNEIA RYADWTYEYV KYPSWEEACA ALEASKVDLL PMVYYTEDRE RRMIFSASSL CEISTTLNVR LDDTRYAYED FKTFSGMRVG VIANSQDAEA FAQYSEKNGF SADIVAYSAT GDLLRALDEG AVDAIAITYL GTNSRFRTVA QFVPEPLYIA LSPERTDIAD ELDSAMSRLK LRDPDFATLL YDRYFGINTD QDPVFTEDEY AYLASAPTLR VAYDSYRAPL SYTDPETGAF AGAVALLFED IAQITGLKFE FVAADCHDEA VRLVERGDAD IVYDVDRESD PQAIKSLDTT GPYLRDPMAL VAGPNPSGSR VALPSGFSLA ASTAYSSYAE YDIVYCDTPK DCFDAVLEGK ADIAFADTHV ANYLLAEPQY ESLSVTTITS FFNSMSIGVS RNADRRLVSI LDRCVQYTAE SKMTTWLSQS SLAVHPISPF DFLRQYPVQF MAGIVALLGS VLGVALYVSH VKLRAARRVE DFSFTDPLTE GWSLARFRSE VGAQMANARD GAYAIVYLDV KSFKGFNAAF GYATGDRVLL DLNGTLAGMK APDERYAHVI ADEFVLLVRW RGWDALLERF DELDRRFNST ETLTELSHRL MLQAGVCIIE RSAETPRIDV QTIIEFVDAA RYARDSIGEA SRSTAALYSA SMKDRDIAER ALVAAAHDAL ERGEFTAYYQ PKVEIATNRL VGFEALVRWE SPERGLVPPD EFIPLFERTG LVVDLDLQVF RLVCARIREQ LDAGEHPLVI ACNFSRLHMR NDAFPETVKS IVDGFGVPIE LLELELTENI VMEDLERAER LCRRLKDLGF RIAIDDFGSG YSSLGTLQNL PIDVLKLDRS FLMSSESGER CKAILDGVVS IADKLAVNVV VEGVETRDQA SMLVRMDDRI IAQGFLYSRP VPRDVSDAQF AVGFIEPNER P
|
| |