Gene EcSMS35_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3660 
SymboltrpS 
ID6144938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3719423 
End bp3720427 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID641618487 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001745627 
Protein GI170681086 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0133889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAGC CCATCGTTTT TAGTGGCGCA CAGCCCTCAG GTGAATTGAC CATTGGTAAC 
TACATGGGTG CGCTGCGTCA GTGGGTAAAC ATGCAGGATG ACTACCATTG CATTTACTGT
ATCGTTGACC AACACGCGAT CACCGTGCGC CAGGATGCAC AGAAGCTGCG TAAAGCGACG
CTGGATACGC TGGCCTTGTA TCTGGCTTGT GGTATCGATC CTGAGAAAAG CACCATTTTT
GTTCAGTCCC ACGTGCCGGA ACATGCGCAG TTAGGCTGGG CACTGAACTG CTATACCTAC
TTCGGCGAAC TGAGCCGCAT GACGCAGTTT AAAGATAAAT CTGCGCGTTA TGCCGAGAAC
ATCAATGCTG GTCTGTTTGA CTATCCGGTG CTGATGGCTG CGGACATCCT GCTGTATCAA
ACTAATCTGG TACCGGTGGG TGAAGACCAG AAACAGCACC TGGAACTGAG CCGCGATATC
GCCCAGCGTT TCAACGCGCT GTATGGCGAT ATCTTTAAAG TGCCGGAGCC GTTTATTCCG
AAATCCGGCG CGCGCGTAAT GTCGCTGCTG GAGCCGACCA AAAAGATGTC CAAGTCTGAC
GATAACCGCA ATAACGTTAT CGGCCTGCTG GAAGATCCGA AATCGGTAGT GAAGAAAATC
AAACGTGCGG TCACTGACTC CGACGAGCCG CCGGTAGTTC GCTACGATGT GCAGAACAAA
GCGGGCGTTT CCAACCTGCT GGATATCCTT TCTGCGGTAA CGGGCCAGAG CATCCCGGAA
CTGGAAAAAC AGTTCGAAGG CAAGATGTAT GGTCATCTCA AAGGCGAAGT GGCTGATGCC
GTTTCCGGTA TGCTGACTGA ATTGCAGGAA CGTTATCACC GTTTCCGCAA CGATGAAGCC
TTCCTGCAAC AGGTGATGAA AGACGGCGCG GAAAAAGCCA GTGCACACGC TTCTCGTACG
CTGAAAGCGG TGTACGAAGC GATTGGTTTT GTGGCGAAGC CGTAA
 
Protein sequence
MTKPIVFSGA QPSGELTIGN YMGALRQWVN MQDDYHCIYC IVDQHAITVR QDAQKLRKAT 
LDTLALYLAC GIDPEKSTIF VQSHVPEHAQ LGWALNCYTY FGELSRMTQF KDKSARYAEN
INAGLFDYPV LMAADILLYQ TNLVPVGEDQ KQHLELSRDI AQRFNALYGD IFKVPEPFIP
KSGARVMSLL EPTKKMSKSD DNRNNVIGLL EDPKSVVKKI KRAVTDSDEP PVVRYDVQNK
AGVSNLLDIL SAVTGQSIPE LEKQFEGKMY GHLKGEVADA VSGMLTELQE RYHRFRNDEA
FLQQVMKDGA EKASAHASRT LKAVYEAIGF VAKP