Gene Elen_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1983 
Symbol 
ID8416294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2323610 
End bp2326879 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content67% 
IMG OID645024960 
ProductNon-specific serine/threonine protein kinase 
Protein accessionYP_003182336 
Protein GI257791730 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATCCG AAGACGACAT CAGCAGGCAC TGCCATGCGC GCACGTTGCA ACGGGCACGG 
TCCATCGCAG CCTCCGACCG CAACATCCTC ACAAAGCAGG TCCGCTACAA CCCTCCCGAG
ACCACGCTGT CCGCGTTCGT TGCCAGCAGC AGCGGCTGGA ACGACCGTTA CCGCACCTCC
GTCACCTTCG ACGAGGACGA AGGCGACCTG GTCGACTACG CCTGCACCTG CCCCGCCTAC
CGCGAGTATG ACGGCATGTG CAAGCACTGC GTGGCTCTGG CGTTGACGTA TCTGGACGCG
CCTGAGAAGT TCATGGGCTA CCGAGCCCAT CGCGCGCCGA CCACGTCTTC ATGCCTGCTC
GAGCTCATGG AGCGCAGCAA AGCCGCTGCA GAAGCTGAGG AACAGGGCGG CATCGATCTG
GAAGCAACCG TCGTGTACGG CTATCGGTCG TGGTCGGCGC ACTTCAAGGT CGTCGGCCCC
CAGGGCTCCT ACGTCATGAA GAGCATCTCG GACTTCGTCG GCCGCATGCG GCGCGGCGAG
CGGTTCTCCT ACGGCAAGAA GCTCGCGTTC ACGCACGTCC CGGCGATGCT CGCCGAGTCG
GCGCGCCCCA TCGCCCGGTT CCTCGACCGC GCCGTGGCGC TGCGCGAGCA GGCCACCGGC
AGCGCGTTTT GGCGCTACCG CGGACGCGAC GAGATCGGAC GCGACCTCGA CCTGTCGGAC
TACGAGCTCA TCGAGCTGTT GGATCTGCTG GACGGCCGCC CCTTCACCGT CGAAGGCACC
GACTACGGAA CGCGCTCCCT CACCCGCGCG CATGTCGTCA GCGCCGACCC CGATGTCGAG
GTCTCGGTGC GGCGCACCGA CGACGGCGGC TACGCCATCG AAGCCGACGA GCTGCCTTTC
GTGGCGCAGG GCGACCGCAT GTACCTGTGG CAGGGCGAGA CGTTCTTCCG CTGCTCGGCC
GACTTCGCGC GCACGGCGGG CTTCCTGCGC ACCGTGTACG AGAACGACGA TGCCCGCCTG
TTCGTCAGCC TCGCCGACAT GCCCCTGTTC TGCGCCACCG TGCTGCCCGC CATCGAGAAG
CGCCTCCATG TGGAAACGCC GACCGAGATC GGGCTGTTCC GCCCCGTGCC GTGCAAGCTG
GAGTTCTACT TCGACAAGAC CGACCGCGAC GTCACCTGCG ACGCGCAAGC CGTGTACGGC
GAACGCCGCT ATCCGCTGCT CGACTCCCCC GCTCGCGACG AGGCGGGCCC CCTGCGCGAC
GAGAAGCTGG AGGGTCGCGC GAGGCGGCTG GTCAAGCAGT ATTTCGACAC ATTAGAAGCT
CCCCCATCCA TCATGCTGAG CGACGAGACG GCGGTGGCCG ACCTCGTGTT CGGCGGCCTT
GTCCAGTTCC AGGCGCTGGG CCAGGCGTTC ACCACGCCCG CGTTCGACCG GCTGCTCGTA
GACAAGAAGC CACGCATCTC GGTGGGCATC TCGCTGGCGG GCAACCTCAT CAACCTGGCC
GTGTCCACCG ACGACCTGCC CCCGGCCGAG GTGGCGGCGC TGCTGGCGAG CTACCGGCGG
CGCAAGCGCT TCCACCGTCT GAAGAGCGGT GCGTACCTCG ATCTGACGGA GTACGACCTC
GCTCAGCTCG ACCACCTTGC GGAGGACTTC GGCTTCACGC CCAAACAGCT GGCCGCAGGC
GCGGTGGAGC TTCCCGCCTA CCACGCGTTC TACCTCGACG AGCAGTTCAA GGGCGCAGAG
CGCAACCGCT CGTTCATGCG CTACCTGGAA AGCTTCCGGG CATCGGCCGG CGAGCCGTGC
CCGGTGCCCG ACCAGCTGGC CGCCACGCTG CGCCCCTACC AGGCGGAAGG GCTGCGCTGG
ATGAGCGCGT TGGCCGACCG CGATCTCGGC GGCATCCTGG CCGACGAGAT GGGCCTGGGC
AAGTCGGTGC AGCTCATCGC GTTTCTGCTG GCACGCCAGA GCGAGGCGCG CGCCGTTGGG
CCCAGCCTCA TCGTGTGCCC CGCCTCCCTC GTGTACAACT GGATGGCCGA GTTCGAGCGC
TTCGCCCCGA CCCTAGACGT GCGCGCCGCG GTGGGCGCCA AGCGCGAGCG CATGCGCATC
CGCGCCGAGG CGTGCGAGAG AGACGCACGC GAAAGCGAGC TTGCCCGCGA CGGGCGCTGC
TGCGACGTGC TGATCACCTC CTACGACCTG CTGCGCATCG ACGCGGAGGA CTTCGCCCGG
CGCGAATTCT ACTGCTGCGC GCTCGACGAG GCGCAGTACG TGAAGAACCA CGCCACCAAG
ACGGCGCGCG CCGCGAAGCG CGTGCGGGCG CGCCACCGCT TCGCGCTGAC CGGAACGCCG
ATGGAGAACC GTCTGAGCGA GCTGTGGAGC ATCTTCGACT TCCTCATGCC CGGGCTTCTG
GGATCCTACA TGCGCTTCCG CGAGCACTTC GAGCTGGACA TCACCGGAGG CGACGAGGAC
GCCGCCCGCC GTCTGCGCTC CCTCGTGGCA CCGTTCATGC TGCGCCGCTT GAAGGCCGAC
GTGCTGCAAG ACTTGCCCGA CAAGCTGGAA TCGGTGGTGT ACGTCCCCAT GGAGGCCGAG
CAGCAGCGCC TGTACGCCGC CCACGAGCAG CAGCTGCGCG ACGCGCTGAC CCTGCAGAAG
AACAACCGCA ACAACAAGCA GTTCCACGAG CGCAAAGTGG AGGTGCTCGC CGAGCTGACG
AAGCTGCGCC AGCTGTGCTG CGACCCGCGC CTTCTGTACG AGAACTACGC CGGGCACGCG
GCAAAGCTGG ACGCCATCGC AGAGATCGTG GAATCGGCCA TGGACGCCGG CGAGAAGACG
CTTGTGTTCT CCCAGTTCAC AAGCTTCCTT TCGCTGATCG CCGAAGTGCT GGACGCGCAC
GGCGTGCCCT ACTTCACCAT CACGGGAACC ACGCCGAAGA AGCGCCGGCT CGATCTGGTG
AACGCGTTCA ACGACGACGA CACGCCCGTG TTCCTCGTGT CGCTCAAAGC GGGTGGCACC
GGGCTCAACC TCACCGGCGC GTCGGTGGTG GTGCACGCCG ACCCCTGGTG GAACGCCGCT
GCGCAGAACC AGGCCACCGA CCGCGCGCAC CGCATCGGCC AGACGCAGGT GGTGAGCGTC
CACAAGGTCA TCGCGAAGGA CACCGTCGAA GAGCGCATCC TGCATTTGCA GGATGCGAAG
ACCGACCTCG CCGACCAGGT GATCGGCGCC GGCGGCGTGT CGCTGGCGAG CCTGAGCCAG
GAGGAGCTGC TCGATTTGCT GGACGGATGA
 
Protein sequence
MLSEDDISRH CHARTLQRAR SIAASDRNIL TKQVRYNPPE TTLSAFVASS SGWNDRYRTS 
VTFDEDEGDL VDYACTCPAY REYDGMCKHC VALALTYLDA PEKFMGYRAH RAPTTSSCLL
ELMERSKAAA EAEEQGGIDL EATVVYGYRS WSAHFKVVGP QGSYVMKSIS DFVGRMRRGE
RFSYGKKLAF THVPAMLAES ARPIARFLDR AVALREQATG SAFWRYRGRD EIGRDLDLSD
YELIELLDLL DGRPFTVEGT DYGTRSLTRA HVVSADPDVE VSVRRTDDGG YAIEADELPF
VAQGDRMYLW QGETFFRCSA DFARTAGFLR TVYENDDARL FVSLADMPLF CATVLPAIEK
RLHVETPTEI GLFRPVPCKL EFYFDKTDRD VTCDAQAVYG ERRYPLLDSP ARDEAGPLRD
EKLEGRARRL VKQYFDTLEA PPSIMLSDET AVADLVFGGL VQFQALGQAF TTPAFDRLLV
DKKPRISVGI SLAGNLINLA VSTDDLPPAE VAALLASYRR RKRFHRLKSG AYLDLTEYDL
AQLDHLAEDF GFTPKQLAAG AVELPAYHAF YLDEQFKGAE RNRSFMRYLE SFRASAGEPC
PVPDQLAATL RPYQAEGLRW MSALADRDLG GILADEMGLG KSVQLIAFLL ARQSEARAVG
PSLIVCPASL VYNWMAEFER FAPTLDVRAA VGAKRERMRI RAEACERDAR ESELARDGRC
CDVLITSYDL LRIDAEDFAR REFYCCALDE AQYVKNHATK TARAAKRVRA RHRFALTGTP
MENRLSELWS IFDFLMPGLL GSYMRFREHF ELDITGGDED AARRLRSLVA PFMLRRLKAD
VLQDLPDKLE SVVYVPMEAE QQRLYAAHEQ QLRDALTLQK NNRNNKQFHE RKVEVLAELT
KLRQLCCDPR LLYENYAGHA AKLDAIAEIV ESAMDAGEKT LVFSQFTSFL SLIAEVLDAH
GVPYFTITGT TPKKRRLDLV NAFNDDDTPV FLVSLKAGGT GLNLTGASVV VHADPWWNAA
AQNQATDRAH RIGQTQVVSV HKVIAKDTVE ERILHLQDAK TDLADQVIGA GGVSLASLSQ
EELLDLLDG