Gene EcSMS35_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4073 
SymboltrmE 
ID6144536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4163496 
End bp4164860 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content57% 
IMG OID641618898 
ProducttRNA modification GTPase TrmE 
Protein accessionYP_001746036 
Protein GI170681108 
COG category[R] General function prediction only 
COG ID[COG0486] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00450] tRNA modification GTPase TrmE 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.420944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA ATGACACTAT CGTAGCCCAG GCCACGCCTC CGGGACGTGG CGGCGTTGGC 
ATCCTGCGCA TCTCCGGCCT CAAAGCCCGT GAGGTTGCCG AAACCGTGCT GGGTAAACTG
CCAAAGCCGC GCTACGCCGA TTATCTTCCG TTTAAAGACG CCGACGGCAG CGTGCTCGAT
CAGGGGATTG CGCTATGGTT CCCTGGCCCG AACTCGTTCA CCGGCGAAGA TGTGCTGGAA
CTGCAAGGTC ATGGCGGTCC GGTGATCCTC GACCTGCTGT TAAAACGCAT TCTGACCATT
CCCGGCCTGC GGATTGCCCG CCCTGGTGAG TTTTCCGAAC GCGCATTTCT GAACGATAAA
CTCGACTTAG CCCAGGCCGA AGCGATTGCT GATCTTATCG ACGCCAGCTC GGAACAGGCG
GCCCGTTCGG CGCTAAACTC GCTGCAAGGC GCATTCTCCG CACGGGTCAA CCATCTGGTG
GAAGCCCTCA CCCACTTGCG CATTTACGTC GAAGCGGCAA TTGATTTCCC GGATGAAGAG
ATCGATTTCC TCTCCGACGG CAAAATTGAA GCCCAGCTCA ATGACGTTAT TGCCGATCTT
GATGCAGTGC GTGCTGAAGC ACGTCAGGGT AGTTTGTTGC GCGAAGGGAT GAAAGTGGTG
ATTGCCGGAC GTCCTAACGC CGGTAAATCG AGCCTGTTAA ACGCGCTGGC GGGCCGTGAA
GCGGCAATCG TAACCGATAT CGCCGGAACC ACGCGTGACG TGCTGCGTGA GCATATCCAC
ATTGACGGAA TGCCGCTGCA TATCATCGAT ACCGCCGGGC TACGTGAAGC CAGTGACGAA
GTGGAACGTA TTGGTATCGA GCGCGCGTGG CAGGAAATTG AACAGGCCGA CCGCGTGCTG
TTTATGGTCG ATGGCACCAC GACAGACGCC GTTGATCCGG CAGAGATCTG GCCGGAATTT
ATCGCCCGTC TGCCAGCGAA ACTGCCGATC ACCGTGGTGC GCAATAAAGC CGATATCACC
GGCGAAACGT TGGGGATGAG CGAAGTGAAC GGTCATGCGT TAATTCGTCT TTCGGCGCGG
ACTGGCGAAG GCGTGGATGT GCTGCGTAAC CATCTCAAAC AGAGCATGGG CTTTGACACC
AATATGGAAG GCGGCTTCCT GGCGCGTCGT CGTCACCTAC AGGCGCTGGA ACAGGCGGCA
GAGCATTTGC AGCAAGGTAA AGCGCAACTG TTGGGTGCCT GGGCGGGTGA ACTGCTGGCG
GAAGAGTTGC GCCTGGCGCA GCAGAACTTA AGCGAAATCA CCGGGGAGTT TACTTCAGAC
GACCTGCTTG GGCGGATTTT CTCCAGCTTC TGTATTGGTA AGTAA
 
Protein sequence
MSDNDTIVAQ ATPPGRGGVG ILRISGLKAR EVAETVLGKL PKPRYADYLP FKDADGSVLD 
QGIALWFPGP NSFTGEDVLE LQGHGGPVIL DLLLKRILTI PGLRIARPGE FSERAFLNDK
LDLAQAEAIA DLIDASSEQA ARSALNSLQG AFSARVNHLV EALTHLRIYV EAAIDFPDEE
IDFLSDGKIE AQLNDVIADL DAVRAEARQG SLLREGMKVV IAGRPNAGKS SLLNALAGRE
AAIVTDIAGT TRDVLREHIH IDGMPLHIID TAGLREASDE VERIGIERAW QEIEQADRVL
FMVDGTTTDA VDPAEIWPEF IARLPAKLPI TVVRNKADIT GETLGMSEVN GHALIRLSAR
TGEGVDVLRN HLKQSMGFDT NMEGGFLARR RHLQALEQAA EHLQQGKAQL LGAWAGELLA
EELRLAQQNL SEITGEFTSD DLLGRIFSSF CIGK