Gene Elen_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2233 
Symbol 
ID8416556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2622808 
End bp2624313 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID645025219 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_003182583 
Protein GI257791977 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.056881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.890291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGAC TCTACAATAC GAAGACCCGC ACGAAGGTCG ACTTCGAGAC GCTCGAGCGC 
GGCAAGGTGG GCATGTACGT GTGCGGACCT ACCGTGTACA ACTACATTCA CATCGGCAAC
GCCCGCACGT TCATCAGCTT CGACGTGATC CGCCGTTACC TTATGTGGCG CGGCTTCGAC
GTGACGTTCG TGCAGAACGT CACCGACGTG GACGACAAGA TCATCGGCAA GTCGCTTGAG
GAAGGCCGCA GCGCTGCCGA GGTGGCCGCC GAGTACACCG AGGCGTTTAT CGAGGACATG
CGCGCGGCCG GCGTGCTGGA TCCCGATATC CGTCCGAAGG CCACCGAGGA GATCCCCGCA
ATGATAGAGC TCATCCAAGA GCTCATCGAC GGAGGCCACG CTTACGATGC CGACGGCGAC
GTGTACTTCA ACGTGCGTTC GTTCCCGGCT TACGGCGAGC TGTCGGGCCG CAACGTAGAC
GAGATGGAGA GCGGCCACCG CGAGCTGCGC GCCGACGGCA AGGGCGTGGA GGATCGCAAG
CGCGACCCGC TGGACTTCGC GCTGTGGAAG GCCGCCAAGC CCGGCGAGCC GGCGTGGGAG
TCGCCCTGGG GCATGGGCCG TCCCGGCTGG CACATCGAGT GCTCGGCCAT GTCGCGCAAG
TACCTGGGGC TTCCCTTCGA CATCCACGGC GGCGGCGCCG ACCTCGTGTT CCCGCACCAC
GAGAACGAGC GCGCGCAGAG CGAGGCCGCG TGCGGCTGCA CGTTCGCGAA CTACTGGATG
CACGGCGGCA TGCTGCAGAT CAACTCCGAG AAGATGAGCA AGTCGCTGGG CAACTTCAAG
CTGCTGCGCG ACGTGCTGAA GGTCACCGAT CCGAAGGTGC TGCGCTTCCT TATGCTGCAG
ACGCACTACC GCAGCCCGCT CGACTTCTCC GACGAGCGCC TGGCCGAGGC GGGCGCGGCG
CTGTGCCGTA TCGAGAACGC GGTGAAGAAC CTCGATTGGC AGCTGCAGAA CGCCCAGGAC
ATCCCCTCGC CGCTGGACAC GCGCGAGCTG ATGAAGCGTA CGAAGGAGGC GAAGCTGGCG
TTCATCCTGG CCATGGACGA CGACTTCAAC ACCTGCAAGG CGCTGGGCGA GGTGTTCGAC
TTCGTGGCCG CGGTGAACGC GCAGACGGCC GACAGGACTA TCTCGCTGTC CGACGTGCCG
CCCGTGCGCG ATGCCCGCGG CGTCATCGTG GAGCTCATGG GCGTGTTCGG CATCGACGTG
GAGGCCGCAT CCGCCTCGTG CGCCGCAGGA GGGTACCCGC CCGAGGTCGT GGGGTTGGCG
GCCGATATCG CGGGCTACGA AGGCGCGGAT GCCGCCGAGG CCGTGGACGC CCTGCTGGCG
GCCCGTGCCG ACGCCCGCGC CGCGAAGGAT TGGAGCCGTG CCGACGCCGT CCGCGACGGC
CTGTGCGGTT TGGGCTTCGT GATCGAGGAC ACCCCGCAGG GTGCGCGCGT GACCTACGAG
GGGTAA
 
Protein sequence
MIRLYNTKTR TKVDFETLER GKVGMYVCGP TVYNYIHIGN ARTFISFDVI RRYLMWRGFD 
VTFVQNVTDV DDKIIGKSLE EGRSAAEVAA EYTEAFIEDM RAAGVLDPDI RPKATEEIPA
MIELIQELID GGHAYDADGD VYFNVRSFPA YGELSGRNVD EMESGHRELR ADGKGVEDRK
RDPLDFALWK AAKPGEPAWE SPWGMGRPGW HIECSAMSRK YLGLPFDIHG GGADLVFPHH
ENERAQSEAA CGCTFANYWM HGGMLQINSE KMSKSLGNFK LLRDVLKVTD PKVLRFLMLQ
THYRSPLDFS DERLAEAGAA LCRIENAVKN LDWQLQNAQD IPSPLDTREL MKRTKEAKLA
FILAMDDDFN TCKALGEVFD FVAAVNAQTA DRTISLSDVP PVRDARGVIV ELMGVFGIDV
EAASASCAAG GYPPEVVGLA ADIAGYEGAD AAEAVDALLA ARADARAAKD WSRADAVRDG
LCGLGFVIED TPQGARVTYE G