Gene EcSMS35_3882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3882 
SymbolglyQ 
ID6147154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3950556 
End bp3951467 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content56% 
IMG OID641618708 
Productglycyl-tRNA synthetase subunit alpha 
Protein accessionYP_001745847 
Protein GI170683810 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0752] Glycyl-tRNA synthetase, alpha subunit 
TIGRFAM ID[TIGR00388] glycyl-tRNA synthetase, tetrameric type, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.943552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGT TTGATACCAG GACCTTCCAG GGCTTGATCC TGACCTTACA GGATTACTGG 
GCTCGCCAGG GCTGCACCAT TGTTCAACCA TTGGACATGG AAGTCGGCGC GGGAACCTCT
CACCCAATGA CCTGTCTGCG CGCGCTGGGG CCAGAACCGA TGGCGGCTGC TTATGTTCAG
CCTTCTCGTC GCCCGACCGA TGGTCGCTAC GGCGAAAACC CCAACCGTTT ACAGCACTAC
TATCAGTTCC AGGTGGTCAT TAAGCCATCG CCGGACAATA TTCAGGAGCT GTACCTTGGT
TCTCTGAAAG AGCTGGGCAT GGACCCGACC ATCCACGACA TCCGTTTCGT GGAAGATAAC
TGGGAAAACC CGACGCTGGG TGCCTGGGGG CTGGGCTGGG AAGTGTGGCT GAACGGCATG
GAAGTGACGC AGTTCACTTA CTTCCAGCAG GTTGGTGGTC TGGAGTGTAA ACCGGTTACC
GGCGAGATCA CCTACGGTCT GGAACGTCTG GCAATGTACA TTCAGGGCGT AGACAGCGTT
TACGACCTGG TCTGGAGCGA CGGCCCGCTG GGTAAAACCA CCTACGGCGA CGTGTTCCAT
CAGAACGAAG TGGAGCAGTC CACTTACAAC TTCGAATACG CGGATGTGGA TTTCCTGTTC
ACCTGCTTTG AGCAGTACGA GAAAGAAGCG CAGCAGCTGC TGGCGCTGGA AAATCCGCTG
CCGCTGCCAG CCTACGAGCG TATTCTGAAA GCCGCCCACA GCTTCAACCT GCTGGATGCG
CGTAAAGCCA TCTCCGTCAC CGAGCGTCAG CGCTATATTC TGCGCATTCG CACCCTGACC
AAAGCAGTGG CAGAAGCATA CTACGCTTCC CGTGAAGCCC TCGGCTTCCC GATGTGCAAC
AAAGATAAGT AA
 
Protein sequence
MQKFDTRTFQ GLILTLQDYW ARQGCTIVQP LDMEVGAGTS HPMTCLRALG PEPMAAAYVQ 
PSRRPTDGRY GENPNRLQHY YQFQVVIKPS PDNIQELYLG SLKELGMDPT IHDIRFVEDN
WENPTLGAWG LGWEVWLNGM EVTQFTYFQQ VGGLECKPVT GEITYGLERL AMYIQGVDSV
YDLVWSDGPL GKTTYGDVFH QNEVEQSTYN FEYADVDFLF TCFEQYEKEA QQLLALENPL
PLPAYERILK AAHSFNLLDA RKAISVTERQ RYILRIRTLT KAVAEAYYAS REALGFPMCN
KDK