Gene EcolC_4288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4288 
SymboltrmE 
ID6068121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4741803 
End bp4743167 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID641603725 
ProducttRNA modification GTPase TrmE 
Protein accessionYP_001727211 
Protein GI170022257 
COG category[R] General function prediction only 
COG ID[COG0486] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00450] tRNA modification GTPase TrmE 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA ATGACACTAT CGTAGCCCAG GCCACGCCTC CGGGACGTGG CGGCGTTGGC 
ATCCTGCGCA TCTCCGGCTT CAAAGCCCGT GAAGTTGCCG AAACCGTGCT GGGTAAACTG
CCTAAGCCGC GCTACGCCGA TTATCTTCCG TTTAAAGACG CCGACGGCAG CGTGCTCGAT
CAGGGGATTG CGCTATGGTT CCCTGGCCCG AACTCGTTCA CCGGCGAAGA TGTGCTGGAA
CTGCAAGGTC ATGGCGGTCC GGTGATCCTC GACCTGCTGT TAAAACGCAT TCTGACCATT
CCCGGCCTGC GGATTGCTCG CCCTGGTGAG TTTTCCGAAC GCGCGTTTCT TAACGATAAA
CTTGACTTAG CCCAGGCCGA GGCGATTGCC GATCTTATCG ACGCCAGTTC GGAACAGGCG
GCCCGTTCGG CACTTAACTC GCTGCAAGGC GCATTCTCCG CACGGGTTAA TCATCTGGTA
GAAGCCCTCA CCCACTTGCG CATTTACGTC GAAGCGGCAA TTGATTTCCC CGATGAAGAG
ATCGATTTCC TCTCCGACGG AAAAATTGAA GCCCAGCTCA ATGACGTTAT TGCCGATCTT
GATGCAGTGC GTGCTGAAGC ACGTCAGGGT AGTTTGTTGC GCGAAGGGAT GAAAGTGGTG
ATTGCCGGAC GTCCTAACGC CGGTAAATCG AGCCTGTTAA ACGCGCTGGC GGGGCGTGAA
GCGGCAATCG TAACCGATAT CGCCGGAACT ACGCGTGACG TGCTGCGTGA GCATATCCAC
ATTGACGGAA TGCCGCTGCA TATCATCGAT ACCGCCGGGC TACGTGAAGC CAGTGACGAA
GTAGAACGTA TTGGTATCGA GCGCGCGTGG CAGGAAATTG AACAGGCCGA CCGCGTGCTG
TTTATGGTCG ATGGCACCAC AACAGACGCC GTGGATCCGG CAGAGATCTG GCCGGAATTT
ATTGCCCGTC TGCCAGCGAA ACTGCCGATC ACCGTGGTGC GCAATAAAGC CGATATCACC
GGCGAAACGC TGGGAATGAG TGAAGTGAAC GGTCACGCGT TAATTCGTCT CTCGGCAAGG
ACTGGTGAAG GCGTGGACGT GCTGCGTAAC CATCTCAAAC AGAGCATGGG CTTTGACACC
AACATGGAAG GCGGCTTCCT GGCGCGTCGT CGCCACCTAC AGGCGCTGGA ACAGGCAGCG
GAACATCTAC AACAGGGCAA AGCGCAACTG TTGGGAGCCT GGGCAGGTGA ACTGCTGGCG
GAAGAGTTGC GTCTGGCACA GCAGAACTTA AGCGAAATCA CCGGGGAATT TACTTCAGAC
GACCTGCTGG GGCGGATTTT CTCCAGCTTC TGTATTGGTA AGTAA
 
Protein sequence
MSDNDTIVAQ ATPPGRGGVG ILRISGFKAR EVAETVLGKL PKPRYADYLP FKDADGSVLD 
QGIALWFPGP NSFTGEDVLE LQGHGGPVIL DLLLKRILTI PGLRIARPGE FSERAFLNDK
LDLAQAEAIA DLIDASSEQA ARSALNSLQG AFSARVNHLV EALTHLRIYV EAAIDFPDEE
IDFLSDGKIE AQLNDVIADL DAVRAEARQG SLLREGMKVV IAGRPNAGKS SLLNALAGRE
AAIVTDIAGT TRDVLREHIH IDGMPLHIID TAGLREASDE VERIGIERAW QEIEQADRVL
FMVDGTTTDA VDPAEIWPEF IARLPAKLPI TVVRNKADIT GETLGMSEVN GHALIRLSAR
TGEGVDVLRN HLKQSMGFDT NMEGGFLARR RHLQALEQAA EHLQQGKAQL LGAWAGELLA
EELRLAQQNL SEITGEFTSD DLLGRIFSSF CIGK