Gene Elen_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1944 
Symbol 
ID8416251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2279719 
End bp2281446 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content66% 
IMG OID645024917 
Productarginyl-tRNA synthetase 
Protein accessionYP_003182297 
Protein GI257791691 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.567724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.240014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTC GCGAACAACT TGAACAGCTG ATCGACGCGG CCGTCGCGGC CGCGTGCGAG 
GACGGAACGC TCACGCTCGA GCAGGCTCCC GAGGCGGCTC TCGAGCGCCC GCGCGACGAG
AGCAACGGCG ACTGGGCGTC CACCGTCGCC ATGCGTTCGG CAAAGCTTGC CAAGAAGAAT
CCTCGCGAGA TCGCTCAGAT CATCGTCGAC CACCTGCCCG AGAACGACAT GATCGCTTCC
GTCGACATCG CCGGCCCCGG CTTCATCAAC ATTCGTCTGG CGAACGCCGT CCTGCAGGGC
GTTGTGGCGG CGGCTCGCGC CGAGAAGGAC GACTTCGGCA AGGGCGAGAT TCCCGAGGGC
GAGCGCAAGA TCAACCTGGA GTACATCTCG GCGAACCCCA CCGGCCCCTT GCACGTGGGC
CATGGCCGCT GGGCCGCGCT GGGCGATGCC ACGGCGCGCG TCATGCGCCA TGCGGGCTAC
GACGTGTTCG AAGAGTTCTA CATCAACGAT GCCGGCACGC AGATGGACAA CTTCGGCGAG
TCGGTGGCCG TGCGCTACCA GCAGCTGCTG GGCCGCGACG TGGAGATGCC CGAGGCGTGC
TATGCAGGCT CTTACGTGAA GGATATCGCG CAGACCATCA TCGATGAGGA CGGCGACAAG
TGGCTCGATG CCGACCCGAA GGAGCGCATG GAGAACTTCC GCGAGCGCGC CTATGCCTAC
GAGTTGGCCG AGCAGCACCG CGTCACCGAG CGGTTCGGCA CCACCTTCGG ATGCTGGTTC
TCCGAGCGCT CGCTGTACGT GCCCGATGAG GACGGCTTGA GCGCGGTGGA CCGCAGCCTC
AAGGCCATGG ACGAGAAGGG CTACATCTAC GTCGAGGACG GCGCCACCTG GTTCAGGTCC
AGCGCGTTCG ACGACGAGAA GGATCGCGTG CTCATCAAGG CCAACGGCGA GATGACGTAC
TTCATGAGCG ACGTGGCGTA CCACTACAAC AAGATGGAGC GCGGCTTCGA CCACCTCATC
AACATCTGGG GCGCCGACCA CCACGGCTAC ATCGCCCGCT GCGAGGCCAT GCTGGCCGCG
TGGGGCTGGC CCGGCGCGCT CGAGATCATG CTCGGCCAGC TGGTGAACCT GTTCCGCGAC
GGCGAGGCCG TGCGCATGTC GAAGCGCACG GGCGAGATGA TCACGTTCGA GGAGCTCATC
GACGAGGTGG GCGTGGACGC CACCCGCTAC CTCATGCTGG CGAAGTCCTC CGACCAGCCC
ATCGACTTCG ACATCGAGGT GGCGAAGAAG AAGGACGCGT CGAACCCGGT GTACTACGTG
CAGTACGCGC ACGCGCGCAT CTGCTCGATC CTGCGCAAGG CGGCCGACCC GGCCGATGCC
GAAGCGGCCG CGAACGGCGA CATGTCGATG GACGAGCTGG CCTCGAAGGT GATCCCGGCG
AACGTGGACC TCTCGCCGCT CACGCACGAG TCCGAGCTGG CGCTGATGCG CAAGATGGAC
GACTTCGGCC CGCTCGTGGC TCAGGCGGCT CGCGACCGCG CGGCGTTCCG TTTGACGCAC
TACGCTCAGG ATCTGGCCTC GCTGTTCCAT TCGTTCTACA CGAACTGCCA CGTCATCGGC
GAGGAGGAGG CTGTGACGAA CGCGCGTCTT GCCCTCGTGG ACGCCACGCG CATCGTGCTG
GCCAAGACGC TGGACCTGCT GGGCGTTTCC GCTCCGGCCA AGATGTAG
 
Protein sequence
MQIREQLEQL IDAAVAAACE DGTLTLEQAP EAALERPRDE SNGDWASTVA MRSAKLAKKN 
PREIAQIIVD HLPENDMIAS VDIAGPGFIN IRLANAVLQG VVAAARAEKD DFGKGEIPEG
ERKINLEYIS ANPTGPLHVG HGRWAALGDA TARVMRHAGY DVFEEFYIND AGTQMDNFGE
SVAVRYQQLL GRDVEMPEAC YAGSYVKDIA QTIIDEDGDK WLDADPKERM ENFRERAYAY
ELAEQHRVTE RFGTTFGCWF SERSLYVPDE DGLSAVDRSL KAMDEKGYIY VEDGATWFRS
SAFDDEKDRV LIKANGEMTY FMSDVAYHYN KMERGFDHLI NIWGADHHGY IARCEAMLAA
WGWPGALEIM LGQLVNLFRD GEAVRMSKRT GEMITFEELI DEVGVDATRY LMLAKSSDQP
IDFDIEVAKK KDASNPVYYV QYAHARICSI LRKAADPADA EAAANGDMSM DELASKVIPA
NVDLSPLTHE SELALMRKMD DFGPLVAQAA RDRAAFRLTH YAQDLASLFH SFYTNCHVIG
EEEAVTNARL ALVDATRIVL AKTLDLLGVS APAKM