Gene EcHS_A3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3920 
SymboltrmE 
ID5592654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3913999 
End bp3915363 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID640923028 
ProducttRNA modification GTPase TrmE 
Protein accessionYP_001460505 
Protein GI157163187 
COG category[R] General function prediction only 
COG ID[COG0486] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00450] tRNA modification GTPase TrmE 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.189642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATA ATGACACTAT CGTAGCCCAG GCCACGCCTC CGGGACGTGG CGGCGTTGGC 
ATCCTGCGCA TCTCCGGCTT CAAAGCCCGT GAAGTTGCCG AAACCGTGCT GGGTAAACTG
CCTAAGCCGC GCTACGCCGA TTATCTTCCG TTTAAAGACG CCGACGGCAG CGTGCTCGAT
CAGGGGATTG CGCTATGGTT CCCTGGCCCG AACTCGTTCA CCGGCGAAGA TGTGCTGGAA
CTGCAAGGTC ATGGCGGTCC GGTGATCCTC GACCTGCTGT TAAAACGCAT TCTGACCATT
CCCGGCCTGC GGATTGCTCG CCCTGGTGAG TTTTCCGAAC GCGCGTTTCT TAACGATAAA
CTTGACTTAG CCCAGGCCGA GGCGATTGCC GATCTTATCG ACGCCAGTTC GGAACAGGCG
GCCCGTTCGG CACTTAACTC GCTGCAAGGC GCATTCTCCG CACGGGTTAA TCATCTGGTA
GAAGCCCTCA CCCACTTGCG CATTTACGTC GAAGCGGCAA TTGATTTCCC CGATGAAGAG
ATCGATTTCC TCTCCGACGG AAAAATTGAA GCCCAGCTTA ATGACGTGAT TGCCGATCTC
GATGCAGTGC GTGCTGAAGC ACGTCAGGGT AGTTTGTTGC GCGAAGGGAT GAAGGTGGTG
ATTGCCGGAC GTCCTAACGC CGGTAAATCG AGCCTGTTAA ACGCGCTGGC GGGCCGTGAA
GCAGCAATCG TAACTGATAT CGCCGGAACC ACACGTGACG TGCTGCGTGA GCATATCCAC
ATTGACGGAA TGCCGCTGCA TATCATCGAT ACCGCTGGGC TACGTGAAGC CAGTGACGAA
GTGGAACGTA TTGGTATCGA GCGCGCGTGG CAGGAAATTG AACAGGCCGA CCGCGTGCTG
TTTATGGTCG ATGGCACCAC AACAGACGCC GTGGATCCGG CAGAGATCTG GCCGGAATTT
ATCGCCCGTC TGCCAGCGAA ACTGCCGATC ACCGTGGTGC GCAATAAAGC CGATATCACC
GGCGAAACGC TGGGAATGAG TGAAGTGAAC GGTCACGCGT TAATTCGTCT CTCGGCAAGG
ACAGGTGAAG GCGTGGAGGT GCTGCGTAAC CATCTCAAAC AGAGCATGGG CTTTGACACC
AACATGGAAG GCGGCTTCCT GGCGCGTCGT CGCCACCTAC AGGCGCTGGA ACAGGCAGCG
GAACATCTAC AACAGGGCAA AGCGCAACTG TTGGGAGCCT GGGCAGGTGA ACTGCTGGCG
GAAGAGTTGC GTCTGGCACA GCAGAACTTA AGCGAAATCA CCGGGGAATT TACTTCAGAC
GACCTGCTGG GGCGGATTTT CTCCAGCTTC TGTATTGGTA AGTAA
 
Protein sequence
MSDNDTIVAQ ATPPGRGGVG ILRISGFKAR EVAETVLGKL PKPRYADYLP FKDADGSVLD 
QGIALWFPGP NSFTGEDVLE LQGHGGPVIL DLLLKRILTI PGLRIARPGE FSERAFLNDK
LDLAQAEAIA DLIDASSEQA ARSALNSLQG AFSARVNHLV EALTHLRIYV EAAIDFPDEE
IDFLSDGKIE AQLNDVIADL DAVRAEARQG SLLREGMKVV IAGRPNAGKS SLLNALAGRE
AAIVTDIAGT TRDVLREHIH IDGMPLHIID TAGLREASDE VERIGIERAW QEIEQADRVL
FMVDGTTTDA VDPAEIWPEF IARLPAKLPI TVVRNKADIT GETLGMSEVN GHALIRLSAR
TGEGVEVLRN HLKQSMGFDT NMEGGFLARR RHLQALEQAA EHLQQGKAQL LGAWAGELLA
EELRLAQQNL SEITGEFTSD DLLGRIFSSF CIGK