Gene EcSMS35_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3620 
Symboltuf2 
ID6147134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3682274 
End bp3683458 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID641618447 
Productelongation factor Tu 
Protein accessionYP_001745587 
Protein GI170681311 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0050] GTPases - translation elongation factors 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000372822 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00440714 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC TATCGGCCAC 
GTTGACCACG GTAAAACTAC TCTGACCGCT GCAATCACCA CCGTACTGGC TAAAACCTAC
GGCGGTGCTG CTCGTGCATT CGACCAGATC GATAACGCGC CGGAAGAAAA AGCTCGTGGT
ATCACCATCA ACACTTCTCA CGTTGAATAT GACACCCCGA CCCGTCACTA CGCACACGTA
GACTGCCCGG GGCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC
GGCGCGATCC TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC
CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA ATGCGACATG
GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG
TACGATTTCC CGGGCGACGA CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA
GGCGACGCAG AGTGGGAAGC GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCTTACATT
CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC
ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT CAAAGTTGGT
GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT CTACCTGTAC TGGCGTTGAA
ATGTTCCGCA AACTGCTGGA CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT
GGTATCAAAC GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG
CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT
ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA CGTGACTGGT
ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG GCGACAACAT CAAAATGGTT
GTTACCCTGA TCCACCCGAT CGCGATGGAC GACGGTCTGC GTTTCGCAAT CCGTGAAGGC
GGCCGTACCG TTGGCGCGGG CGTTGTTGCT AAAGTTCTGG GCTAA
 
Protein sequence
MSKEKFERTK PHVNVGTIGH VDHGKTTLTA AITTVLAKTY GGAARAFDQI DNAPEEKARG 
ITINTSHVEY DTPTRHYAHV DCPGHADYVK NMITGAAQMD GAILVVAATD GPMPQTREHI
LLGRQVGVPY IIVFLNKCDM VDDEELLELV EMEVRELLSQ YDFPGDDTPI VRGSALKALE
GDAEWEAKIL ELAGFLDSYI PEPERAIDKP FLLPIEDVFS ISGRGTVVTG RVERGIIKVG
EEVEIVGIKE TQKSTCTGVE MFRKLLDEGR AGENVGVLLR GIKREEIERG QVLAKPGTIK
PHTKFESEVY ILSKDEGGRH TPFFKGYRPQ FYFRTTDVTG TIELPEGVEM VMPGDNIKMV
VTLIHPIAMD DGLRFAIREG GRTVGAGVVA KVLG