Gene EcSMS35_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0438 
Symboltgt 
ID6146420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp450059 
End bp451186 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID641615334 
Productqueuine tRNA-ribosyltransferase 
Protein accessionYP_001742541 
Protein GI170680278 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0343] Queuine/archaeosine tRNA-ribosyltransferase 
TIGRFAM ID[TIGR00449] tRNA-guanine transglycosylases, various specificities
[TIGR00430] tRNA-guanine transglycosylase, queuosine-34-forming 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTG AACTGGACAC TACCGACGGT CGCGCACGCC GTGGCCGCCT GGTCTTTGAT 
CGTGGCGTAG TGGAAACGCC TTGTTTTATG CCTGTTGGCA CTTACGGCAC CGTAAAAGGG
ATGACGCCGG AAGAAGTTGA AGCCACTGGC GCGCAAATTA TCCTCGGCAA CACCTTCCAC
CTGTGGCTGC GCCCGGGCCA GGAAATCATG AAACTGCATG GCGATCTGCA CGATTTTATG
CAGTGGAAAG GCCCGATTCT TACCGACTCC GGCGGCTTCC AGGTCTTCAG CCTTGGCGAT
ATTCGTAAAA TCACCGAACA GGGCGTTCAC TTCCGTAACC CGATCAACGG CGATCCGATT
TTCCTCGACC CGGAAAAGTC GATGGAGATT CAGTACGATC TTGGTTCGGA TATCGTCATG
ATCTTTGATG AGTGTACGCC GTATCCTGCT GACTGGGATT ACGCAAAACG CTCCATGGAG
ATGTCTCTGC GTTGGGCGAA GCGTAGCCGT GAGCGTTTTG ACAGTCTCGG AAACAAAAAT
GCACTGTTTG GTATCATTCA GGGCAGCGTT TACGAAGATT TACGTGATAT TTCTGTTAAA
GGTCTGGTAG ATATCGGTTT TGATGGCTAC GCTGTCGGCG GTCTGGCTGT GGGTGAGCCG
AAAGCAGATA TGCACCGCAT TCTGGAGCAT GTATGCCCGC AAATTCCGGC AGACAAACCG
CGTTACCTGA TGGGCGTTGG TAAACCAGAA GACCTGGTTG AAGGCGTGCG TCGCGGTATC
GATATGTTTG ACTGCGTAAT GCCAACCCGC AACGCCCGAA ATGGTCATTT GTTCGTGACC
GATGGCGTGG TGAAAATCCG CAATGCGAAG TATAAGAGCG ATACTGGCCC ACTCGATCCT
GAGTGTGATT GCTACACCTG TCGCAATTAT TCACGCGCTT ACTTGCATCA TCTCGACCGT
TGCAACGAAA TATTAGGCGC GCGACTCAAC ACCATTCATA ACCTTCGTTA CTACCAGCGT
TTGATGGCGG GTTTACGCAA GGCTATTGAA GAGGGTAAAT TAGAGAGCTT CGTAACTGAT
TTTTACCAGC GTCAGGGGCG AGAAGTACCA CCTTTGAACG TTGATTAA
 
Protein sequence
MKFELDTTDG RARRGRLVFD RGVVETPCFM PVGTYGTVKG MTPEEVEATG AQIILGNTFH 
LWLRPGQEIM KLHGDLHDFM QWKGPILTDS GGFQVFSLGD IRKITEQGVH FRNPINGDPI
FLDPEKSMEI QYDLGSDIVM IFDECTPYPA DWDYAKRSME MSLRWAKRSR ERFDSLGNKN
ALFGIIQGSV YEDLRDISVK GLVDIGFDGY AVGGLAVGEP KADMHRILEH VCPQIPADKP
RYLMGVGKPE DLVEGVRRGI DMFDCVMPTR NARNGHLFVT DGVVKIRNAK YKSDTGPLDP
ECDCYTCRNY SRAYLHHLDR CNEILGARLN TIHNLRYYQR LMAGLRKAIE EGKLESFVTD
FYQRQGREVP PLNVD