Gene Elen_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1340 
Symbol 
ID8415638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1604181 
End bp1606190 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content71% 
IMG OID645024309 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_003181698 
Protein GI257791092 
COG category[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.774889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCT ACGTCTCCAA CATCCTGTTC GCCGCGCTCT CGTTTCCGCT GATCGCCTTC 
TTCATCACCC TTCCCTACAT GATCTACCAG TACCGCAGGT TCGGCTCTAT ACCCTGGCTG
CGCACGCTTG TGGTGTACTC GTTCGCGTTC TACCTCTTAT GCGCGTACTT CCTCGTGCTG
CTGCCGTTGC CCGAAGACCG CTCGGCCGTC GTGCCCTACG CGCAGACGCC GCAGCTCGTG
CCGCTCAACT TCGTGCGCGG GTTCCTGGCC GAAACGACGT TCTCGCTGAG CGATCCGTCC
ACCTGGCTGG CCGCGCTGCG CGACCCGTAC GTCTACGAGG CGTTCTTCAA CGTGCTGCTG
CTGGTGCCGC TGGGCATGTA CCTGCGCTAC TATTTCCGCC GCACGTGGTG GCAGACGCTC
GCCATCGGTT TTCTGGTGAC GCTGTCGTTC GAGACGACCC AGCTCACCGG GCTCTGGGGC
CTCTACGAGC ATCCGTACCG CCTGTTCGAC GTCGACGACC TCATGCTGAA CACGCTGGGA
GCGATGATCG GATTCTGGAC GGTTGGGCCC GCGATGCGCG TGCTGCCCGA CATCCGGCTC
GTGAACGAGG AGGCGCGCGA GGCGGGCATG CGCGCCAGCG TGACGAAGCG CGCGCTGTCG
TTCTTCATCG ATCTGGCGAT CACCCTGGCT GCGGCCGGGG CGGCGACGGC CGCAGCCGAA
GCGCTGGGCG CTCGCGCAGC CGTCGAGGCC GCCGGCGCGA GCTGGGGCAC GGCGGTGCAG
GCCGCGGACG CGGTATCGTT CGCAGCGTTC TTCGCGCTGG TTCCCGCACT CACCCGCGGG
CAGACGCTGG CGCAGAAGCT GCTGAGGCTG CGCATCGTGA GGACGGACGC GATCCCGGCG
CGCTGGTACC AGTACCTCGC ACGTTACGGG CTGTTGGCGT TGTTCGGCTG GGCGCCGTTC
GCGCTGCTGT TCGGCGTGCT GGACCTCGAC GCGGCACAGG TCGGCGAGAT GAACGCCCTC
GCGGCGTTCG CCGCCGAACA CCGGGCGGCC GTCGTGGGGG CGTGGACGGC CTTCATGACG
GCCTGGGCCG TGTCGCTGGC CGTGCGCGCC GTGCGAGCCG GCGCGAGGAA GCGGTCGTTC
GTCATGCTGA ACGGCGTGCT GTCGGGAACG CGCGTCATGA CGGAGGCCGG CGTGGAGCTG
GCGCGCGAGC GCCGCGGCGT GCTCGACGTC GACGAGATGG CCGCGCTCGA GCGTGCCGTG
GCCGAGGACG GCACGCCCCT TGCCGAGCTC ATGGACCGGG CGGGGCGTGC CGTGGCCGAC
GAGGTGCGCG CCTGGGTGCC CGACCCGGCG CCGGTCGTGG TGCTTTCAGG ATCGGGCAAC
AACGGCGGCG ACGGGTGGGT GGCGGCCCGC GTGCTGGCGG AGGCCGGCTA CCCCGTGACG
CTGGTCGCCC CCGACCTGGC CGAGCGCCTG CATGCCGAAC CGGCGCGTTC CGCGGCGCTG
GAAACGTTCG CCCGAGCCGC CGAGGACGGC CTTCCGCTGT CCGTGCTCAT CGCGCCCGAC
GCCGACGTGC TGGCCGATGC CGTCGACGAA GCCGAGGCCG TGGTGGACGC GCTGCTGGGC
ACCGGTTTCT CCGGCGGCGA GGTACGCGAG CCGTACGCCG GATGGATACG GGCCGCCAAC
CGCCGGCGCT TCGAGGGGAA GCGCGGCAAG GGCCGCGGGC GGCATCGCAA GCGCACGCAC
GAGCGAGGCG AGCACGAACG GCCGCGGCGC TCGCTGCCTG CGAAAGCCAA GGACGCCCCG
TTCGCCGTGG CGGCAGACGT CCCCAGCGGC CTCTCGGCGC AAACCGGAGC CGCGGCGCGG
CCGACGTTCG CCGCCGATGC TACCGTGACG ATGCTCGCCT ACAAGCCGGG CCTCGTCGCC
TCCGCGGGCG CCCCATGGGT CGGCGCCGTG AAGCTGGCGA AGCTCGGCGT GGACGCATCG
AAGTACTTGG AAGCCGAAGA GCGGGCGTAA
 
Protein sequence
MNVYVSNILF AALSFPLIAF FITLPYMIYQ YRRFGSIPWL RTLVVYSFAF YLLCAYFLVL 
LPLPEDRSAV VPYAQTPQLV PLNFVRGFLA ETTFSLSDPS TWLAALRDPY VYEAFFNVLL
LVPLGMYLRY YFRRTWWQTL AIGFLVTLSF ETTQLTGLWG LYEHPYRLFD VDDLMLNTLG
AMIGFWTVGP AMRVLPDIRL VNEEAREAGM RASVTKRALS FFIDLAITLA AAGAATAAAE
ALGARAAVEA AGASWGTAVQ AADAVSFAAF FALVPALTRG QTLAQKLLRL RIVRTDAIPA
RWYQYLARYG LLALFGWAPF ALLFGVLDLD AAQVGEMNAL AAFAAEHRAA VVGAWTAFMT
AWAVSLAVRA VRAGARKRSF VMLNGVLSGT RVMTEAGVEL ARERRGVLDV DEMAALERAV
AEDGTPLAEL MDRAGRAVAD EVRAWVPDPA PVVVLSGSGN NGGDGWVAAR VLAEAGYPVT
LVAPDLAERL HAEPARSAAL ETFARAAEDG LPLSVLIAPD ADVLADAVDE AEAVVDALLG
TGFSGGEVRE PYAGWIRAAN RRRFEGKRGK GRGRHRKRTH ERGEHERPRR SLPAKAKDAP
FAVAADVPSG LSAQTGAAAR PTFAADATVT MLAYKPGLVA SAGAPWVGAV KLAKLGVDAS
KYLEAEERA