Gene Elen_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1907 
Symbol 
ID8416211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2237638 
End bp2239551 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content67% 
IMG OID645024877 
Productthreonyl-tRNA synthetase 
Protein accessionYP_003182260 
Protein GI257791654 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000167344 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACATCG TACTTCCCGA CGGCTCGAAC AAGGGGCTCG CAGACGGCTC CACCGTCGCC 
GACGTGGCCG CGTCCATCGG CGCCGGCCTG GCGAAGGCCG CGCTCGCGGG CATCGTGAAC
GACCAGCCGG TCGATCTCTC GGCTCCCGTG GCCGAAGGCG ACTCGGTTGC CATCGTCACG
GCCAAGAGCG ACGAGGGGCT GGAGCTTCTG CGCCACTCCA CCGCGCACGT CATGGCCGCC
GCGCTCGTCG ACCTCTACGG CGACGTGCAG TTCGGTGTGG GGCCGGCCAT CGAGGACGGT
TTCTACTACG ACGTGAAGCT CGACCGCGCG CTGTCGCCGG ACGACTTTGC CGCCATCGAG
GCCCGCATGG CCGAGATCGT GAAGGCCGAC GAAGCCTTCG AGCGCCGCGT GGTCACGCGC
GCCGAGGCCG AGGAGATCTT CGCGAGCCAG CCGTTCAAGC TGGAGCTCAT CGCCGAGCTG
CCGGAGGACG CCGTCATCAC CACGTACACC ATCGGCAAGT TCACCGACCT GTGCCGCGGC
CCGCACCTGC CCTCTACCGG CAAGCTGGGC GCGTTCAAGT TGACCAAGCT GGCCGGCGCG
TACTGGCGCG GCGATGCCGA GCGCGAGATG CTCACCCGCA TCTACGGCAC CGCCTTCTTC
AAGCAGAAGG AGCTGGACGA GCACCTGCGC AACCTGGAGG AGGCCGAGAA GCGCGACCAT
CGCAAGCTGG GCCGTGAACT GGGGATCTAC ACCATGGACC CGCTGGCCGG CGTGGGTCTG
CCCATGTACC TGCCCAAGGG CGCGCGCGTG ATCCGCACCA TGCAGGAGTG GCTGCGCCGC
GACCTGTACG AGCGCGGCTA CGAGGAGGTC ATCACGCCGC ACGTGTACAA CGCCGACGTG
TGGAAGACCT CGGGCCACTA CGGCTTCTAC AAAGAGAACA TGTACTTCTT CAACATCAAC
GAGGGCACCG ACGAGGATCC GCGCCTCACC GAGTACGCCG TGAAGCCCAT GAACTGCCCG
GGCCACGTGA TGCTGTACAA GAGCGAGCTG CATTCCTACC GCGACCTGCC GCTGCGCTAC
TTCGAGTTCG GCACCGTGTA CCGCCACGAG ATGAGCGGCG TCGTGCACGG CCTGCTGCGC
GCCCGCGGCT TCACTCAGGA CGACGCGCAC GTGTTCTGCA CGCGCGACCA GGTAGTGGAC
GAGGTGGTGG CCATCCTCGA CCTCGTGGAC CACATCATGT CCACGTTCGG GTTCCAGTAC
GAGGCCGAGA TCTCCACGCG CCCGGAGAAG AGCATCGGAA CCGACGACAT GTGGGAGCAC
GCCACCAACG CGCTCAAGGA AGCCTGCGCG CGCCACGAGC TGGCCTACGA CATCAACGAG
GGCGACGGCG CGTTCTACGG CCCGAAGATC GACATCAAGG TGAAGGACGC CATCGGGCGC
ACCTGGCAGT GCTCCACCGT GCAGGTGGAC TTCAACATGC CCGAGCGCTT CGAACTGACC
TACCGCACCG AGGACAACAC CGAGGAGCGC CCCTGGATGC TCCATCGCGC CATCTTCGGC
TCCATCGAGC GCTTCCTCGG CATCCTCATC GAGCACTACG CCGGGGCGCT GCCGCTGTGG
CTGGCCCCCG TGCAGGTGGC CGTGCTGCCG CTGGCCGACC GTCACAACGA GGCCGCGGCC
GAGCTGGCGA AGCAGCTGAA GGCCGCGGGC GGCCGCATCG AGGTGTACGA CCAGAACGAG
CCCATGCGCG TGAAGATCGC GAAGGCGCAA AGCCAGAAGA TCCCGTACAT GGTGGTGCTG
GGCGACAAGG AGATCGAGAA CGGCACGGTC AGCGTGCGCG AGCGCCACGA GGGCGACCTC
GGCGCGTGGC CGGTCGAGCA GCTGGTGGAG AAGCTGCGCG AGGCCGCGCT GTAG
 
Protein sequence
MNIVLPDGSN KGLADGSTVA DVAASIGAGL AKAALAGIVN DQPVDLSAPV AEGDSVAIVT 
AKSDEGLELL RHSTAHVMAA ALVDLYGDVQ FGVGPAIEDG FYYDVKLDRA LSPDDFAAIE
ARMAEIVKAD EAFERRVVTR AEAEEIFASQ PFKLELIAEL PEDAVITTYT IGKFTDLCRG
PHLPSTGKLG AFKLTKLAGA YWRGDAEREM LTRIYGTAFF KQKELDEHLR NLEEAEKRDH
RKLGRELGIY TMDPLAGVGL PMYLPKGARV IRTMQEWLRR DLYERGYEEV ITPHVYNADV
WKTSGHYGFY KENMYFFNIN EGTDEDPRLT EYAVKPMNCP GHVMLYKSEL HSYRDLPLRY
FEFGTVYRHE MSGVVHGLLR ARGFTQDDAH VFCTRDQVVD EVVAILDLVD HIMSTFGFQY
EAEISTRPEK SIGTDDMWEH ATNALKEACA RHELAYDINE GDGAFYGPKI DIKVKDAIGR
TWQCSTVQVD FNMPERFELT YRTEDNTEER PWMLHRAIFG SIERFLGILI EHYAGALPLW
LAPVQVAVLP LADRHNEAAA ELAKQLKAAG GRIEVYDQNE PMRVKIAKAQ SQKIPYMVVL
GDKEIENGTV SVRERHEGDL GAWPVEQLVE KLREAAL