Gene EcSMS35_4428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4428 
Symboltuf1 
ID6143658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4521516 
End bp4522700 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID641619248 
Productelongation factor Tu 
Protein accessionYP_001746364 
Protein GI170681349 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0050] GTPases - translation elongation factors 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000375004 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00000974336 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAAAG AAAAGTTTGA ACGTACAAAA CCGCACGTTA ACGTCGGTAC TATCGGCCAC 
GTTGACCATG GTAAAACAAC GCTGACCGCT GCAATCACTA CCGTACTGGC TAAAACCTAC
GGCGGTGCTG CTCGCGCATT CGACCAGATC GATAACGCGC CGGAAGAAAA AGCTCGTGGT
ATCACCATCA ACACTTCTCA CGTTGAATAC GACACCCCGA CCCGTCACTA CGCACACGTA
GACTGCCCGG GGCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC
GGCGCGATCC TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC
CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA ATGCGACATG
GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG
TACGACTTCC CGGGCGACGA CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA
GGCGATGCAG AGTGGGAAGC GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCTTACATT
CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC
ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT CAAAGTTGGT
GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT CTACCTGTAC TGGCGTTGAA
ATGTTCCGCA AACTGCTGGA CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT
GGTATCAAAC GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG
CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT
ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA CGTGACTGGT
ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG GCGACAACAT CAAAATGGTT
GTTACCCTGA TCCACCCGAT CGCGATGGAC GACGGTCTGC GTTTCGCAAT CCGTGAAGGC
GGCCGTACCG TTGGCGCGGG CGTTGTAGCA AAAGTTCTGA GCTAA
 
Protein sequence
MSKEKFERTK PHVNVGTIGH VDHGKTTLTA AITTVLAKTY GGAARAFDQI DNAPEEKARG 
ITINTSHVEY DTPTRHYAHV DCPGHADYVK NMITGAAQMD GAILVVAATD GPMPQTREHI
LLGRQVGVPY IIVFLNKCDM VDDEELLELV EMEVRELLSQ YDFPGDDTPI VRGSALKALE
GDAEWEAKIL ELAGFLDSYI PEPERAIDKP FLLPIEDVFS ISGRGTVVTG RVERGIIKVG
EEVEIVGIKE TQKSTCTGVE MFRKLLDEGR AGENVGVLLR GIKREEIERG QVLAKPGTIK
PHTKFESEVY ILSKDEGGRH TPFFKGYRPQ FYFRTTDVTG TIELPEGVEM VMPGDNIKMV
VTLIHPIAMD DGLRFAIREG GRTVGAGVVA KVLS