Gene EcSMS35_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3881 
SymbolglyS 
ID6142855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3948477 
End bp3950546 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content55% 
IMG OID641618707 
Productglycyl-tRNA synthetase subunit beta 
Protein accessionYP_001745846 
Protein GI170682940 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0751] Glycyl-tRNA synthetase, beta subunit 
TIGRFAM ID[TIGR00211] glycyl-tRNA synthetase, tetrameric type, beta subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA AAACTTTTCT GGTGGAAATC GGCACTGAAG AGCTGCCACC AAAAGCACTG 
CGCAGCCTGG CTGAGTCCTT TGCTGCGAAC TTTACTGCGG AGCTGGATAA CGCTGGCCTC
GCACACGGCA CCGTTCAATG GTTTGCTGCT CCGCGTCGTC TGGCGCTGAA AGTGGCTAAC
CTGGCGGAAG CGCAACCGGA TCGTGAAATC GAAAAACGCG GCCCGGCAAT TGCCCAGGCG
TTCGACGCTG AAGGCAAACC GAGCAAAGCG GCAGAAGGTT GGGCGCGTGG TTGCGGTATT
ACCGTTGACC AGGCTGAGCG TCTGACTACC GATAAAGGCG AATGGCTGCT GTATCGCGCC
CATGTGAAGG GCGAAAGCAC CGAAGCACTG CTGCCGAATA TGGTTGCGAC TTCGCTGGCG
AAACTGCCGA TCCCGAAACT GATGCGTTGG GGCGCAAGCG ACGTGCACTT CGTGCGTCCG
GTTCACACCG TGACCCTGCT GCTGGGCGAC AAAGTTATTC CGGCTACCAT TCTGGGTATT
CAGTCCGATC GCGTGATTCG CGGTCACCGC TTTATGGGCG AGTCGGAATT CACTATCGAC
AATGCCGATC AGTATCCGGA AATTCTGCGT GAGCGCGGGA AAGTCATCGC CGATTACGAA
GAACGTAAGG CGAAGATTAA AGCCGATGCC GAAGAAGCGG CGCGTAAGAT TGGCGGTAAC
GCTGACTTAA GCGAAAGCCT GTTGGAAGAA GTCGCTTCGC TGGTGGAATG GCCGGTTGTG
CTGACCGCAA AATTCGAAGA GAAATTCCTC GCGGTGCCGT CTGAAGCGCT GGTTTACACC
ATGAAAGGTG ACCAGAAATA CTTCCCGGTG TATGCGAACG ACGGCAAACT GCTGCCGAAC
TTTATCTTCG TTGCCAATAT CGAATCGAAA GATCCGCAGC AAATTATCTC TGGTAACGAG
AAAGTCGTTC GTCCACGTCT GGCGGATGCC GAGTTCTTCT TCAACACCGA CCGTAAAAAA
CGTCTGGAAG ATAACCTGCC GCGCCTGCAA ACCGTGTTGT TCCAGCAACA GCTGGGTACA
CTGCGCGACA AAACTGACCG CATCCAGGCG CTGGCTGGCT GGATTGCTGA ACAGATTGGC
GCTGACGTTA ACCACGCAAC CCGTGCGGGC CTGCTGTCCA AGTGCGACCT GATGACCAAC
ATGGTCTTCG AGTTCACCGA CACCCAGGGC GTTATGGGGA TGCACTACGC GCGTCACGAT
GGCGAAGCGG AAGATGTTGC CGTGGCGCTG AACGAGCAGT ATCAGCCGCG CTTTGCCGGT
GATGACCTGC CGTCTAACCC GGTAGCCTGT GCGCTGGCGA TTGCTGACAA GATGGATACT
CTGGCGGGTA TCTTCGGTAT CGGCCAGCAT CCGAAAGGCG ACAAAGACCC GTTTGCGTTG
CGTCGTGCCG CACTTGGCGT GCTGCGTATT ATCGTTGAGA AGAACCTCAA TCTTGACCTG
CAAACCCTGA CCGAAGAAGC AGTGCGTCTG TATGGCGATA AGCTGACTAA TGCCAACGTG
GTTGATGATG TTATCGACTT TATGCTCGGT CGCTTCCGCG CCTGGTATCA GGACGAAGGT
TACACCGTTG ACACCATCCA GGCGGTACTG GCGCGTCGTC CGACTCGTCC GGCTGATTTC
GATGCCCGAA TGAAAGCGGT ATCGCACTTC CGTACCCTGG ATGCAGCTGC AGCACTGGCG
GCGGCGAACA AGCGTGTATC TAACATTCTG GCGAAATCTG ACGAAGTGCT GAGCGACCGC
GTGAATGCCT CTACTCTGAA AGAGCCGGAA GAAATTAAAC TGGCGATGCA GGTTGTGGTG
CTACGTGACA AGCTAGAGCC GTACTTTGCT GAAGGTCGTT ACCAGGATGC GCTGGTCGAA
CTGGCTGAGC TGCGTGAACC GGTTGATGCC TTCTTCGATA AAGTGATGGT CATGGTTGAT
GACAAAGAAT TGCGAATCAA CCGTCTTACC ATGCTGGAGA AACTGCGCGA ATTATTCCTG
CGAGTTGCGG ATATTTCGCT GTTGCAGTAA
 
Protein sequence
MSEKTFLVEI GTEELPPKAL RSLAESFAAN FTAELDNAGL AHGTVQWFAA PRRLALKVAN 
LAEAQPDREI EKRGPAIAQA FDAEGKPSKA AEGWARGCGI TVDQAERLTT DKGEWLLYRA
HVKGESTEAL LPNMVATSLA KLPIPKLMRW GASDVHFVRP VHTVTLLLGD KVIPATILGI
QSDRVIRGHR FMGESEFTID NADQYPEILR ERGKVIADYE ERKAKIKADA EEAARKIGGN
ADLSESLLEE VASLVEWPVV LTAKFEEKFL AVPSEALVYT MKGDQKYFPV YANDGKLLPN
FIFVANIESK DPQQIISGNE KVVRPRLADA EFFFNTDRKK RLEDNLPRLQ TVLFQQQLGT
LRDKTDRIQA LAGWIAEQIG ADVNHATRAG LLSKCDLMTN MVFEFTDTQG VMGMHYARHD
GEAEDVAVAL NEQYQPRFAG DDLPSNPVAC ALAIADKMDT LAGIFGIGQH PKGDKDPFAL
RRAALGVLRI IVEKNLNLDL QTLTEEAVRL YGDKLTNANV VDDVIDFMLG RFRAWYQDEG
YTVDTIQAVL ARRPTRPADF DARMKAVSHF RTLDAAAALA AANKRVSNIL AKSDEVLSDR
VNASTLKEPE EIKLAMQVVV LRDKLEPYFA EGRYQDALVE LAELREPVDA FFDKVMVMVD
DKELRINRLT MLEKLRELFL RVADISLLQ