Gene EcSMS35_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1476 
SymbolpheS 
ID6145905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1459802 
End bp1460785 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content54% 
IMG OID641616354 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_001743534 
Protein GI170681078 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000802192 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACATC TCGCAGAACT GGTTGCCAGT GCAAAGGCGG CCATTAGCCA GGCGTCAGAT 
GTTGCCGCGT TAGACAATGT GCGCGTCGAA TATTTGGGTA AAAAAGGGCA CTTAACCCTT
CAGATGACGA CCCTACGTGA GCTGCCGCCA GAAGAGCGTC CGGCAGCTGG TGCGGTTATC
AATGAAGCGA AAGAGCAGGT TCAGCAGGCG CTGAATGCGC GTAAAGCGGA ACTGGAAAGC
GCTGCACTGA ATGCGCGTCT GGCGGCGGAA ACGATTGATG TCTCCCTGCC GGGTCGTCGC
ATTGAAAACG GCGGGCTGCA TCCGGTGACC CGTACCATCG ACCGTATCGA AAGTTTCTTC
GGTGAGCTTG GCTTTACCGT GGCAACCGGG CCGGAAATCG AAGACGATTA TCATAACTTC
GATGCTCTGA ACATTCCTGG TCACCACCCG GCGCGCGCTG ACCACGACAC TTTCTGGTTT
GACGCTACCC GCCTGCTGCG TACCCAGACC TCTGGCGTAC AGATCCGCAC CATGAAAGCT
CAGCAGCCAC CGATTCGTAT CATCGCGCCT GGCCGCGTTT ATCGTAACGA CTACGACCAG
ACTCACACGC CGATGTTCCA TCAGATGGAA GGTCTGATTG TTGATACCAA CATCAGCTTT
ACCAACCTGA AAGGCACGCT GCACGACTTC CTGCGTAACT TCTTTGAGGA AGATTTGCAG
ATCCGCTTCC GTCCTTCCTA CTTCCCGTTT ACCGAACCTT CTGCAGAAGT GGACGTCATG
GGTAAAAACG GTAAATGGCT GGAAGTACTG GGCTGCGGGA TGGTGCATCC GAACGTGCTG
CGTAATGTTG GCATCGACCC GGAAGTTTAC TCTGGTTTCG CCTTCGGAAT GGGGATGGAG
CGTCTGACCA TGTTGCGTTA CGGCGTCACC GACCTGCGTT CATTCTTCGA AAACGATCTG
CGTTTCCTCA AACAGTTTAA ATAA
 
Protein sequence
MSHLAELVAS AKAAISQASD VAALDNVRVE YLGKKGHLTL QMTTLRELPP EERPAAGAVI 
NEAKEQVQQA LNARKAELES AALNARLAAE TIDVSLPGRR IENGGLHPVT RTIDRIESFF
GELGFTVATG PEIEDDYHNF DALNIPGHHP ARADHDTFWF DATRLLRTQT SGVQIRTMKA
QQPPIRIIAP GRVYRNDYDQ THTPMFHQME GLIVDTNISF TNLKGTLHDF LRNFFEEDLQ
IRFRPSYFPF TEPSAEVDVM GKNGKWLEVL GCGMVHPNVL RNVGIDPEVY SGFAFGMGME
RLTMLRYGVT DLRSFFENDL RFLKQFK