Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3660 |
Symbol | trpS |
ID | 6144938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3719423 |
End bp | 3720427 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618487 |
Product | tryptophanyl-tRNA synthetase |
Protein accession | YP_001745627 |
Protein GI | 170681086 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0180] Tryptophanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00233] tryptophanyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0133889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAAGC CCATCGTTTT TAGTGGCGCA CAGCCCTCAG GTGAATTGAC CATTGGTAAC TACATGGGTG CGCTGCGTCA GTGGGTAAAC ATGCAGGATG ACTACCATTG CATTTACTGT ATCGTTGACC AACACGCGAT CACCGTGCGC CAGGATGCAC AGAAGCTGCG TAAAGCGACG CTGGATACGC TGGCCTTGTA TCTGGCTTGT GGTATCGATC CTGAGAAAAG CACCATTTTT GTTCAGTCCC ACGTGCCGGA ACATGCGCAG TTAGGCTGGG CACTGAACTG CTATACCTAC TTCGGCGAAC TGAGCCGCAT GACGCAGTTT AAAGATAAAT CTGCGCGTTA TGCCGAGAAC ATCAATGCTG GTCTGTTTGA CTATCCGGTG CTGATGGCTG CGGACATCCT GCTGTATCAA ACTAATCTGG TACCGGTGGG TGAAGACCAG AAACAGCACC TGGAACTGAG CCGCGATATC GCCCAGCGTT TCAACGCGCT GTATGGCGAT ATCTTTAAAG TGCCGGAGCC GTTTATTCCG AAATCCGGCG CGCGCGTAAT GTCGCTGCTG GAGCCGACCA AAAAGATGTC CAAGTCTGAC GATAACCGCA ATAACGTTAT CGGCCTGCTG GAAGATCCGA AATCGGTAGT GAAGAAAATC AAACGTGCGG TCACTGACTC CGACGAGCCG CCGGTAGTTC GCTACGATGT GCAGAACAAA GCGGGCGTTT CCAACCTGCT GGATATCCTT TCTGCGGTAA CGGGCCAGAG CATCCCGGAA CTGGAAAAAC AGTTCGAAGG CAAGATGTAT GGTCATCTCA AAGGCGAAGT GGCTGATGCC GTTTCCGGTA TGCTGACTGA ATTGCAGGAA CGTTATCACC GTTTCCGCAA CGATGAAGCC TTCCTGCAAC AGGTGATGAA AGACGGCGCG GAAAAAGCCA GTGCACACGC TTCTCGTACG CTGAAAGCGG TGTACGAAGC GATTGGTTTT GTGGCGAAGC CGTAA
|
Protein sequence | MTKPIVFSGA QPSGELTIGN YMGALRQWVN MQDDYHCIYC IVDQHAITVR QDAQKLRKAT LDTLALYLAC GIDPEKSTIF VQSHVPEHAQ LGWALNCYTY FGELSRMTQF KDKSARYAEN INAGLFDYPV LMAADILLYQ TNLVPVGEDQ KQHLELSRDI AQRFNALYGD IFKVPEPFIP KSGARVMSLL EPTKKMSKSD DNRNNVIGLL EDPKSVVKKI KRAVTDSDEP PVVRYDVQNK AGVSNLLDIL SAVTGQSIPE LEKQFEGKMY GHLKGEVADA VSGMLTELQE RYHRFRNDEA FLQQVMKDGA EKASAHASRT LKAVYEAIGF VAKP
|
| |