Gene EcSMS35_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2552 
SymbolgltX 
ID6144265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2610388 
End bp2611803 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content53% 
IMG OID641617423 
Productglutamyl-tRNA synthetase 
Protein accessionYP_001744592 
Protein GI170679600 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0008] Glutamyl- and glutaminyl-tRNA synthetases 
TIGRFAM ID[TIGR00464] glutamyl-tRNA synthetase, bacterial family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000112104 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA AAACTCGCTT CGCGCCAAGC CCAACAGGCT ATCTGCACGT TGGCGGCGCG 
CGTACTGCTC TTTACTCTTG GCTTTTTGCA CGTAACCACG GCGGTGAGTT CGTGCTGCGT
ATTGAAGACA CCGATCTTGA GCGTTCCACG CCGGAAGCTA TCGAAGCCAT TATGGATGGC
ATGAACTGGC TGAGCCTGGA GTGGGATGAA GGTCCGTACT ACCAGACCAA ACGTTTTGAT
CGCTACAATG CGGTGATCGA TCAGATGCTG GAAGAGGGCA CTGCTTATAA ATGCTATTGC
TCTAAAGAGC GCCTGGAAGC GCTGCGCGAA GAGCAAATGG CGAAAGGTGA GAAGCCGCGT
TATGACGGTC GCTGCCGCCA CAGCCATGAG CATCACGCTG ATGATGAACC GTGTGTCGTG
CGTTTTGCTA ACCCGCAGGA AGGTTCTGTT GTTTTTGACG ATCAGATCCG TGGTCCGATC
GAATTCAGCA ACCAGGAGCT GGATGATCTG ATTATCCGTC GTACCGATGG TTCTCCAACC
TATAACTTCT GTGTAGTGGT TGACGACTGG GATATGGAAA TCACGCACGT CATCCGTGGT
GAAGACCATA TCAACAACAC GCCGCGCCAG ATCAACATCC TGAAAGCGCT GAACGCGCCC
GTACCGGTTT ACGCGCACGT TTCTATGATC AACGGTGATG ACGGTAAAAA ACTGTCCAAA
CGTCACGGGG CGGTCAGCGT AATGCAGTAT CGTGATGACG GTTATTTGCC AGAAGCACTG
CTGAACTATC TGGTGCGTCT GGGCTGGTCC CACGGCGATC AGGAAATCTT CACTCGTGAA
GAGATGATCA AATACTTCAC TCTGAATGCC GTCAGCAAAT CTGCCAGTGC GTTCAACACC
GACAAGCTGC TGTGGCTAAA CCATCACTAC ATTAACGCAC TGCCGCCGGA GTATGTGGCT
ACCCACTTAC AGTGGCACAT CGAGCAGGAA AATATCGATA CCCGTAACGG CCCGCAACTG
GCTGATCTGG TAAAACTGCT GGGTGAACGC TGCAAAACGC TGAAAGAGAT GGCACAGACT
TGCCGTTATT TCTACGAAGA TTTTGCTGAG TTCGATGCCG ACGCCGCGAA AAAACATCTG
CGTCCGGTAG CGCGTCAGCC GCTGGAAGTG GTTCGTGACA AACTGGCCGC GATTACTGAC
TGGACCGCTG AAAACGTTCA TCACGCTATT CAGGCGACGG CGGATGAGCT GGAAGTGGGT
ATGGGTAAAG TTGGTATGCC GCTGCGTGTC GCCGTTACCG GTGCGGGGCA GTCTCCGGCA
CTGGACGTTA CCGTTCACGC GATCGGTAAG ACCCGCAGTA TCGAGCGTAT CAACAAAGCG
CTGGCTTTTA TTGCGGAACG CGAAAACCAG CAGTAA
 
Protein sequence
MKIKTRFAPS PTGYLHVGGA RTALYSWLFA RNHGGEFVLR IEDTDLERST PEAIEAIMDG 
MNWLSLEWDE GPYYQTKRFD RYNAVIDQML EEGTAYKCYC SKERLEALRE EQMAKGEKPR
YDGRCRHSHE HHADDEPCVV RFANPQEGSV VFDDQIRGPI EFSNQELDDL IIRRTDGSPT
YNFCVVVDDW DMEITHVIRG EDHINNTPRQ INILKALNAP VPVYAHVSMI NGDDGKKLSK
RHGAVSVMQY RDDGYLPEAL LNYLVRLGWS HGDQEIFTRE EMIKYFTLNA VSKSASAFNT
DKLLWLNHHY INALPPEYVA THLQWHIEQE NIDTRNGPQL ADLVKLLGER CKTLKEMAQT
CRYFYEDFAE FDADAAKKHL RPVARQPLEV VRDKLAAITD WTAENVHHAI QATADELEVG
MGKVGMPLRV AVTGAGQSPA LDVTVHAIGK TRSIERINKA LAFIAERENQ Q