Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0953 |
Symbol | |
ID | 8415243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1160512 |
End bp | 1161879 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023917 |
Product | putative manganese-dependent inorganic pyrophosphatase |
Protein accession | YP_003181314 |
Protein GI | 257790708 |
COG category | [C] Energy production and conversion |
COG ID | [COG1227] Inorganic pyrophosphatase/exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0157605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00000184283 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGGCAC CGATCATCGT CGTGGGCCAC AAGAACCCGG ACAACGATTC CATCTCCTCG GCAGTGGGCT ACGCCTACCT GAAGAACGAG CTGGCGCGCC GTGCGGCCGG CGAGGGGGAG CCGTTCCAGA CGTACGTTCC CGCCCGCTTG GGGCCGTTGC CTCCGGAGAG CGCCTGGGTT CTGGAAGAGA GCGGTATCCC CGCGCCCGAG ATCGTGGGTC ACGTGCATGC GCGCGTCGGA GACGTGATGA CGCCGAGCCC TATATCCATC AGCCATAACG CCACGCTGCT CGAAGCGGGT CGCCTGCTGC GCCAGTACAA CGTGCGCGCG CTCGTGGTGA CGAACGACGA CGGCACCTAC CGTGGACTCA TCACCACGCG CATGATCGCC GAGCGCTACA TCGCCGCCAC CGACGCCCTC GAGGACGGAG GGGCGAACGA GATGGCGGTC GCCGGCGACC TCATCGCCTC GCTCGGTCAG AAGGTGGACG AGATCACCGA GACCGATGTG CTCATCCTCG ACAAGGAGGG CCTGCTCAAG GAGGCTATCG AAGACCTCAT GGCCAGCGCG TTGCGCGAGG CCGTCGTGCT GAACGACGAC GGCCTCGCCA TCGGCATCGT CACGCGCTCG GACGTGGCCG TGCGCCCGAA GCGCAAGGTG GTGCTCGTGG ACCACAACGA GACGCGCCAG GCCGCCAACG GCATCGAGGA GGCCGAGGTC GTCGAGATCG TCGACCATCA TCGCATCGCC GACGTGATGA CTGCCAACCC CATCCAGTTC CTCAACCTTC CCGTGGGCTC CACGGCGACC ATCGTCACGA TGGAGTTCCG CCGCCACAAC GTGGAGATGC CTCCGGCCAT CGCGCGCGTG CTGCTGTCGG CCGTGATGAC AGACACCGTC ATCCTCAAGT CGCCCACCGC CACGCCGACC GATCACGAGC AGGTAGCCTA CCTCGCCGGC ATCGCGGGCG TCGATCCCAC CGAATTCGGC CTTGCCGTGT TCAAGTGCCG CGGCGGCGAG GACGACATGC CCGTCGACAA GCTCGTCGGC GCCGACGCCA AGGAGTTCCA GATCGGCGAC GCCACCGTTC TCATCGCGCA GCACGAGACG GTGGATCTTC CCGCCGTCAT GAAACGCGAA GAGGAGATCC GCGAGCATAT GCGCCGTCTG CGCGACGACC ACGGCTACGA GTTCGTGCTG CTGCTGGTCA CCGATATCGT GGCCGAGGGC AGCCAGTTCA TGTGCGAGGG CAACCGCCGC ATCGTCAACC GCGTGTTCGG CATCCATTGC ACGGGCGAAG GCGGCACCTG GATGCCCGGC ATCCTCAGCA GGAAGAAGCA GGTGGCGGCG AAGATCCTAG GAGCATAG
|
Protein sequence | MSAPIIVVGH KNPDNDSISS AVGYAYLKNE LARRAAGEGE PFQTYVPARL GPLPPESAWV LEESGIPAPE IVGHVHARVG DVMTPSPISI SHNATLLEAG RLLRQYNVRA LVVTNDDGTY RGLITTRMIA ERYIAATDAL EDGGANEMAV AGDLIASLGQ KVDEITETDV LILDKEGLLK EAIEDLMASA LREAVVLNDD GLAIGIVTRS DVAVRPKRKV VLVDHNETRQ AANGIEEAEV VEIVDHHRIA DVMTANPIQF LNLPVGSTAT IVTMEFRRHN VEMPPAIARV LLSAVMTDTV ILKSPTATPT DHEQVAYLAG IAGVDPTEFG LAVFKCRGGE DDMPVDKLVG ADAKEFQIGD ATVLIAQHET VDLPAVMKRE EEIREHMRRL RDDHGYEFVL LLVTDIVAEG SQFMCEGNRR IVNRVFGIHC TGEGGTWMPG ILSRKKQVAA KILGA
|
| |