Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4598 |
Symbol | lysU |
ID | 6146751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4698166 |
End bp | 4699683 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619414 |
Product | lysyl-tRNA synthetase |
Protein accession | YP_001746526 |
Protein GI | 170682016 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1190] Lysyl-tRNA synthetase (class II) |
TIGRFAM ID | [TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.221304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAC GAGAAACACG GGGAGCCAAT GAGGCTATTG ATTTTAACGA TGAACTGAGA AATCGCCGCG AAAAACTGGC GGCACTACGT CAGCAAGGTG TGGCGTTTCC CAATGATTTT CGCCGCGACC ATACCTCTGA CCAGTTGCAC GACGAGTTTG ATGCGAAGGA TAACCAGGAA CTGGAATCCT TAAACATTGA AGTCTCGGTT GCTGGCCGAA TGATGACCCG TCGTATTATG GGGAAAGCCT CCTTTGTAAC GTTGCAGGAT GTCGGTGGCC GTATTCAACT GTACGTTGCA AGAGATAGCC TGCCAGAAGG TGTTTATAAC GATCAGTTTA AAAAATGGGA TCTGGGTGAC ATTATCGGTG CCCGCGGTAC GCTGTTTAAG ACGCAAACGG GTGAGCTTTC CATTCACTGT ACTGAGCTGC GCCTGCTGAC TAAAGCACTA CGTCCTTTAC CAGATAAATT CCATGGTCTG CAGGATCAGG AAGTCCGTTA TCGTCAACGT TATCTGGACC TCATCGCTAA CGATAAATCC CGTCAAACGT TTGTTGTCCG TTCAAAAATT CTGGCCGCTA TCCGTCAGTT CATGGTCGCG CGCGGCTTTA TGGAAGTAGA AACCCCGATG ATGCAGGTAA TTCCAGGTGG GGCATCTGCT CGCCCGTTTA TTACCCATCA TAACGCTCTG GATTTAGATA TGTACCTGCG TATCGCGCCG GAGCTGTATC TGAAACGTCT GGTTGTAGGC GGTTTTGAAC GGGTATTCGA AATCAACCGT AACTTCCGTA ATGAAGGTAT TTCTGTTCGC CATAATCCTG AGTTCACAAT GATGGAACTC TACATGGCAT ATGCGGATTA CCATGATTTG ATTGAACTGA CGGAGTCACT GTTCCGCACC CTGGCACAAG AGGTACTGGG TACCACTAAA GTCACTTATG GCGAGCATGT GTTTGATTTC GGCAAACCGT TTGAAAAACT CACCATGCGC GAAGCAATCA AAAAATATCG TCCAGAAACC GATATGGCGG ATCTGGATAA TTTTGATGCT GCTAAAGCAT TAGCTGAATC TATCGGAATT ACGGTAGAGA AAAGCTGGGG GCTGGGACGT ATTGTCACAG AGATCTTTGA TGAAGTGGCA GAAGCACATC TGATTCAGCC AACCTTTATT ACGGAATATC CGGCAGAAGT GTCCCCGCTG GCGCGCCGTA ATGATGTTAA CCCGGAAATC ACCGACCGTT TTGAATTCTT CATCGGTGGT CGTGAAATTG GTAATGGTTT TAGCGAATTA AACGACGCAG AAGATCAGGC TGAACGTTTC CAGGAACAGG TTAACGCTAA AGCTGCAGGT GACGACGAAG CCATGTTCTA TGACGAAGAT TACGTGACTG CGCTGGAATA TGGTCTGCCG CCAACCGCTG GTCTGGGTAT TGGTATCGAC CGAATGATTA TGCTGTTTAC TAACAGCCAT ACTATTCGCG ACGTTATTCT CTTCCCGGCG ATGCGCCCAC AGAAATAA
|
Protein sequence | MSERETRGAN EAIDFNDELR NRREKLAALR QQGVAFPNDF RRDHTSDQLH DEFDAKDNQE LESLNIEVSV AGRMMTRRIM GKASFVTLQD VGGRIQLYVA RDSLPEGVYN DQFKKWDLGD IIGARGTLFK TQTGELSIHC TELRLLTKAL RPLPDKFHGL QDQEVRYRQR YLDLIANDKS RQTFVVRSKI LAAIRQFMVA RGFMEVETPM MQVIPGGASA RPFITHHNAL DLDMYLRIAP ELYLKRLVVG GFERVFEINR NFRNEGISVR HNPEFTMMEL YMAYADYHDL IELTESLFRT LAQEVLGTTK VTYGEHVFDF GKPFEKLTMR EAIKKYRPET DMADLDNFDA AKALAESIGI TVEKSWGLGR IVTEIFDEVA EAHLIQPTFI TEYPAEVSPL ARRNDVNPEI TDRFEFFIGG REIGNGFSEL NDAEDQAERF QEQVNAKAAG DDEAMFYDED YVTALEYGLP PTAGLGIGID RMIMLFTNSH TIRDVILFPA MRPQK
|
| |