Gene Elen_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0953 
Symbol 
ID8415243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1160512 
End bp1161879 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content66% 
IMG OID645023917 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_003181314 
Protein GI257790708 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0157605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000184283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGGCAC CGATCATCGT CGTGGGCCAC AAGAACCCGG ACAACGATTC CATCTCCTCG 
GCAGTGGGCT ACGCCTACCT GAAGAACGAG CTGGCGCGCC GTGCGGCCGG CGAGGGGGAG
CCGTTCCAGA CGTACGTTCC CGCCCGCTTG GGGCCGTTGC CTCCGGAGAG CGCCTGGGTT
CTGGAAGAGA GCGGTATCCC CGCGCCCGAG ATCGTGGGTC ACGTGCATGC GCGCGTCGGA
GACGTGATGA CGCCGAGCCC TATATCCATC AGCCATAACG CCACGCTGCT CGAAGCGGGT
CGCCTGCTGC GCCAGTACAA CGTGCGCGCG CTCGTGGTGA CGAACGACGA CGGCACCTAC
CGTGGACTCA TCACCACGCG CATGATCGCC GAGCGCTACA TCGCCGCCAC CGACGCCCTC
GAGGACGGAG GGGCGAACGA GATGGCGGTC GCCGGCGACC TCATCGCCTC GCTCGGTCAG
AAGGTGGACG AGATCACCGA GACCGATGTG CTCATCCTCG ACAAGGAGGG CCTGCTCAAG
GAGGCTATCG AAGACCTCAT GGCCAGCGCG TTGCGCGAGG CCGTCGTGCT GAACGACGAC
GGCCTCGCCA TCGGCATCGT CACGCGCTCG GACGTGGCCG TGCGCCCGAA GCGCAAGGTG
GTGCTCGTGG ACCACAACGA GACGCGCCAG GCCGCCAACG GCATCGAGGA GGCCGAGGTC
GTCGAGATCG TCGACCATCA TCGCATCGCC GACGTGATGA CTGCCAACCC CATCCAGTTC
CTCAACCTTC CCGTGGGCTC CACGGCGACC ATCGTCACGA TGGAGTTCCG CCGCCACAAC
GTGGAGATGC CTCCGGCCAT CGCGCGCGTG CTGCTGTCGG CCGTGATGAC AGACACCGTC
ATCCTCAAGT CGCCCACCGC CACGCCGACC GATCACGAGC AGGTAGCCTA CCTCGCCGGC
ATCGCGGGCG TCGATCCCAC CGAATTCGGC CTTGCCGTGT TCAAGTGCCG CGGCGGCGAG
GACGACATGC CCGTCGACAA GCTCGTCGGC GCCGACGCCA AGGAGTTCCA GATCGGCGAC
GCCACCGTTC TCATCGCGCA GCACGAGACG GTGGATCTTC CCGCCGTCAT GAAACGCGAA
GAGGAGATCC GCGAGCATAT GCGCCGTCTG CGCGACGACC ACGGCTACGA GTTCGTGCTG
CTGCTGGTCA CCGATATCGT GGCCGAGGGC AGCCAGTTCA TGTGCGAGGG CAACCGCCGC
ATCGTCAACC GCGTGTTCGG CATCCATTGC ACGGGCGAAG GCGGCACCTG GATGCCCGGC
ATCCTCAGCA GGAAGAAGCA GGTGGCGGCG AAGATCCTAG GAGCATAG
 
Protein sequence
MSAPIIVVGH KNPDNDSISS AVGYAYLKNE LARRAAGEGE PFQTYVPARL GPLPPESAWV 
LEESGIPAPE IVGHVHARVG DVMTPSPISI SHNATLLEAG RLLRQYNVRA LVVTNDDGTY
RGLITTRMIA ERYIAATDAL EDGGANEMAV AGDLIASLGQ KVDEITETDV LILDKEGLLK
EAIEDLMASA LREAVVLNDD GLAIGIVTRS DVAVRPKRKV VLVDHNETRQ AANGIEEAEV
VEIVDHHRIA DVMTANPIQF LNLPVGSTAT IVTMEFRRHN VEMPPAIARV LLSAVMTDTV
ILKSPTATPT DHEQVAYLAG IAGVDPTEFG LAVFKCRGGE DDMPVDKLVG ADAKEFQIGD
ATVLIAQHET VDLPAVMKRE EEIREHMRRL RDDHGYEFVL LLVTDIVAEG SQFMCEGNRR
IVNRVFGIHC TGEGGTWMPG ILSRKKQVAA KILGA