Gene Elen_1617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1617 
Symbol 
ID8415916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1915147 
End bp1916679 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content67% 
IMG OID645024586 
ProductRNA binding metal dependent phosphohydrolase 
Protein accessionYP_003181974 
Protein GI257791368 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000205751 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000531756 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGAAT TCATCTGGCT GATCATCGGA GCCGTCGTCG GCATCGCCCT CGGCTTCGTC 
GTCACCCGCT ACCTGGTCAA CGCATCCACG AAGCGAGCGG CGCAAGAGGC CGAATCCGTG
GTGAACGACG CGAAACGACA AGCCGAAACG CTGCGCCGCG AGGCCATCAT CGAAGCGAAG
GACGAAGCTC TCAAGCTGAA GCAGGATGCG CAAGCCGAGA GCAAGGAGCG TCTGCGCGAA
GTGCGTTCCG CTGAGAACCG CATCTCGCAG CGCGAGGAAT CGCTCGACCG CCGCGTCGAA
TCGCTCGACG CGCGCGAGCA TCAGATCTCA TCCATGCAGG GCCAGCTCGA GCGTCGCGAA
CGCGATCTCG AAGAGGCCAC GCGGGAGGTG AACTACCGCC TCGAGCGCGT GGCCGGCATG
ACGCCGGACG AGGCGAAGGC GGAGCTGCTC GACACCCTCA AGGACGAGGT GACGCACGAG
TCCGCGGCCA TCATCCGCGA TGCCGAGGCG CGCGCGAAGG CGGAGGCCGA CAAGAAGGCC
CGCTCCATCC TCAGCCTCGC CATCCAGCGC GTGGCGGCCG ACCACTCGGC CGAAACCACC
GTGTCCACCA TCCATATCCC CTCCGACGAC CTCAAGGGCC GCATCATCGG CCGCGAGGGC
CGCAACATCC GCTCGTTCGA ACAGCTGACC GGAACGAACC TCATCATCGA CGACACCCCG
GAGTGCGTGA CCATCTCGTG CTTCGATCCG GTCCGTCGCG AGATCGGCCG CGTTACGATG
GAGAACCTCG TGGCCGACGG CCGCATCCAT CCGGCGCGCA TCGAGGAGAT GTTCGGCAAG
GCCGAGGCGT TCGTGAACCA GCGCGTCCAG GAAGCGGGCG AGCAGGCCAC GTTCGACACC
GGCATCCACG ATCTGCACCC CGAGCTCGTG CGCACGCTGG GCCGTCTGCG CTACCGCACC
TCGTACGGCC AGAACGTGCT GAACCACTCG CTGGAGGTGG CCTACCTCTC CGGCGTCATG
GCTTCCGAAC TGGGGCTGGA TCCCATCCCG GCCAAGCGCG CCGGCCTGCT GCACGATTTG
GGCAAGGCGG TCGACCACGA GGTGGAGGGC AGCCACGCCG TCATTGGAGC CGACCTGGCC
CGCCGTTTCG GCGAGCGACC CGAGATCGTG CACGCCATCG AGGCGCACCA CAACGACGTG
GAGCCGTCCA GCGTGCTGGC CGTGCTCGTT CAGGCGGCCG ATGCCGTGTC CGCGGCGCGT
CCCGGCGCCC GCAAGGAGAC ACTCGACGCC TACGTGAAGC GCCTCGAGAA GCTGGAGGAG
ATCGCCAGCT CGTACAAGGG CGTGGAGCGC ACGTACGCCA TTCAGGCGGG TCGCGAGGTG
CGCGTGATGG TGGAGCCCGA CACGGTGGAC GAAGCCGCCA CCACGGTGCT TGCGCACGAC
ATCGCGCAGC GCATCGAGAA CGAGATGCAG TATCCCGGCC AGGTGAAGGT CGTGGTCATC
CGCGAGAGCC GCGCGGTCGG CGTCGCGAAG TAG
 
Protein sequence
MPEFIWLIIG AVVGIALGFV VTRYLVNAST KRAAQEAESV VNDAKRQAET LRREAIIEAK 
DEALKLKQDA QAESKERLRE VRSAENRISQ REESLDRRVE SLDAREHQIS SMQGQLERRE
RDLEEATREV NYRLERVAGM TPDEAKAELL DTLKDEVTHE SAAIIRDAEA RAKAEADKKA
RSILSLAIQR VAADHSAETT VSTIHIPSDD LKGRIIGREG RNIRSFEQLT GTNLIIDDTP
ECVTISCFDP VRREIGRVTM ENLVADGRIH PARIEEMFGK AEAFVNQRVQ EAGEQATFDT
GIHDLHPELV RTLGRLRYRT SYGQNVLNHS LEVAYLSGVM ASELGLDPIP AKRAGLLHDL
GKAVDHEVEG SHAVIGADLA RRFGERPEIV HAIEAHHNDV EPSSVLAVLV QAADAVSAAR
PGARKETLDA YVKRLEKLEE IASSYKGVER TYAIQAGREV RVMVEPDTVD EAATTVLAHD
IAQRIENEMQ YPGQVKVVVI RESRAVGVAK