Gene Elen_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1968 
Symbol 
ID8416279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2305140 
End bp2307506 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content69% 
IMG OID645024945 
Producttrehalose-phosphatase 
Protein accessionYP_003182321 
Protein GI257791715 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR00685] trehalose-phosphatase
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACA CTCACAGCAC CGCGGCGACC CTTCTGCGCG ACACGCCGCT CCCCTTACCC 
CTCGCGCGCA TCGACCGCCC CCGGGAAGAG AGCGCGGCAG CTCGGCGCGC GCAGCATGAC
GCGCGCAGCG TCGTCGCGCC CGAGGCCGGG CGGCTGATCA TCGTGTCGAA CCGGCTGCCC
TACCGTTTCG AAGACGACGG CGAAGGCGGC TTCGAGCTCG AGCGCAGCGT GGGCGGCCTG
GCCACGGGCC TGGGCCCCTT GCACGAGCAG GACGGCAACC TCTGGATCGG CTGGGCCGAC
GCGGATGCGG GCATGGACGA CGACGAGCGC GCCCGGCTGG CCGCGGCGTT CGACGAGCGC
GATTGCCGCG CGGTGTTCTT GGACGACCGC GACGCCGAAG CCTACTACGA GGGCTTCTCC
AACTCGGCCA TCTGGCCGCT GTTCCACGGG TTCCCCCAGT TCACCCGCTT CGACGAGGAA
GAATGGGAGG CATACCGGCG CGTGAACGAA CGGTTCTGCG AGGCAGCACT GGCAGAAGCG
CGCCCGGGCG ACACGCTGTG GATCCAGGAC TACCACCTCA TGCTGCTGCC GTCGATGCTG
CGAGAGGCAC TGCCGGATGC GTCCATCGGG TTCTTCCTGC ACATCCCCTT CCCCGACTAC
GAGACGTTCC GCACGCTGCC GTGGCGCGAC GAGATCGTCC GCGGCGTGCT GGGAGCCGAC
CTCATCGGCT TCCATGCCTA CGACTACGTG CGCCATTTCC TGTCCAGCTG CCGGCGCGTG
GCGGGCATCG AGAACACGAG CGGCACGCTC ACGGTTGACG GACGCGTGGT GCAGGTGGAC
GCCTTCCCGC TCGGCATCGA CTACGCGCGC TTCCGCGACG CCGCACGCAC ACCGGAAGTG
CAAGCAGCAG TGGAAACGCT CGCGGCCGAG AAGGGACACG AAGGTTGCAA GGTGATGCTG
TCGGTGGAAC GGCTGGACTA CTCGAAGGGC ATTCCCGAGC GGCTGGACGC CTTCGACGCG
TTCCTCGACA AGCACCCCGA GTGGAAGGGT CGCGTGGTGC TCATGCTGGT CACCGTGCCG
TCGCGCGAGA ACGTGGCGTC GTACCGCGCG CTCAAGAAGC GCATCGACGA GCTGGTGGGC
CAGGTGAACG GCAAACACTC AACGATGGAC TGGACGCCTG TGGACTACTA CTACCGCTCG
TTGCCGTTCG AGCAGCTGGT CAGCCTGTAC GCCGCGAGCG ACGTGATGCT GGTCACCCCG
CTGCGCGACG GCATGAACCT CGTATGCAAG GAGTACCTGG CCTGCCACGA CGGCGACGGA
GGCGTGCTGG TGCTGTCGGA GATGGCCGGC GCGTCCTACG AGCTGCACGA GGCTCTGTGC
GTGAACCCGT TCGACCGCGC GGGCATCGCC CGTGCCATGC AGGAGGCGCT CACCATGCCA
CCCGACGAGC AGCGCAAGCG CAACGCACCC ATGCAACAGC GCCTGGCACG CTACACGTCG
AAGAAGTGGG CGCGCGAGTT CCTCGATGCC GTGGCCGACG TGAAGCGCCG CCAGGCGGGC
ATGAGCGCGC ATCTGCTGGG CCCCACGTCG GCCGGCCGGC TGCTGGAGGC GTACCGCCGC
GCGGATCGGC GCGCGCTGCT GCTGGACTAC GACGGCACGC TCATGCCCTT CTCGGACGAC
CCCGCGCGCG TCGCGCCCGA CGAACGGCTG CTGGACGTGC TGGCGCGCCT GGGCGGCAGC
GCGGACAACG ACGTGGTGGT GGTGTCGGGG CGCGACCACG CCACGCTCGA GGCATGGCTG
GGCCGGCTGC CCGTCGACCT CGTGGCCGAG CACGGCGTTT GGTTCGCCGC GCAAGCGGAC
GGAGGCACGG CGAGCGGGCG GGCATGGACG CTGCAGGAAC CGCTCGACAA CAGCTGGAAG
GACGCCATCC GCCCCGTGCT GGCCGACTTC GTCGACCGCA CGCCCGGATC GCTGCTGGAA
GAGAAGGACT ACTCGCTGGT GTGGCATTAC CGCATGTGCA GCCAAGAGCT GGCCGAACGG
CGCGTCATCG AGATCAGATG CGCACTGGGA GACGGCCTGG CCGACCGCGG CATCGCGCTC
ATGGACGGCA ATAAGGTGAT CGAGGTGAAG CCGCGCGGCG TCGACAAGGG GCACGCGGCC
CACCGCTGGT TCCGCGACCC CGCCTACGGC TTCTTGCTGG CGGCCGGCGA CGACCGCACC
GACGAGGACG TGTTCGAAGC CGCGCCCGAC GACGCGTGGA CCGTCAAGAT AGGCGGCGGC
CCCACCCGCG CCCGCTTCGC GCTCAAGAAC AGCGCCGAGA TGCGCCAGCT GCTGGAAGCG
ATGGCCGAAG CCGAGCCGGT CCGCTAG
 
Protein sequence
MPNTHSTAAT LLRDTPLPLP LARIDRPREE SAAARRAQHD ARSVVAPEAG RLIIVSNRLP 
YRFEDDGEGG FELERSVGGL ATGLGPLHEQ DGNLWIGWAD ADAGMDDDER ARLAAAFDER
DCRAVFLDDR DAEAYYEGFS NSAIWPLFHG FPQFTRFDEE EWEAYRRVNE RFCEAALAEA
RPGDTLWIQD YHLMLLPSML REALPDASIG FFLHIPFPDY ETFRTLPWRD EIVRGVLGAD
LIGFHAYDYV RHFLSSCRRV AGIENTSGTL TVDGRVVQVD AFPLGIDYAR FRDAARTPEV
QAAVETLAAE KGHEGCKVML SVERLDYSKG IPERLDAFDA FLDKHPEWKG RVVLMLVTVP
SRENVASYRA LKKRIDELVG QVNGKHSTMD WTPVDYYYRS LPFEQLVSLY AASDVMLVTP
LRDGMNLVCK EYLACHDGDG GVLVLSEMAG ASYELHEALC VNPFDRAGIA RAMQEALTMP
PDEQRKRNAP MQQRLARYTS KKWAREFLDA VADVKRRQAG MSAHLLGPTS AGRLLEAYRR
ADRRALLLDY DGTLMPFSDD PARVAPDERL LDVLARLGGS ADNDVVVVSG RDHATLEAWL
GRLPVDLVAE HGVWFAAQAD GGTASGRAWT LQEPLDNSWK DAIRPVLADF VDRTPGSLLE
EKDYSLVWHY RMCSQELAER RVIEIRCALG DGLADRGIAL MDGNKVIEVK PRGVDKGHAA
HRWFRDPAYG FLLAAGDDRT DEDVFEAAPD DAWTVKIGGG PTRARFALKN SAEMRQLLEA
MAEAEPVR