Gene EcHS_A4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4290 
SymboldusA 
ID5595435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4293407 
End bp4294450 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content53% 
IMG OID640923392 
ProducttRNA-dihydrouridine synthase A 
Protein accessionYP_001460837 
Protein GI157163519 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAATGC ACGGTAATTC TGAAATGCAA AAAATCAACC AAACCAGCGC AATGCCTGAA 
AAAACTGACG TTCACTGGAG TGGTCGGTTT AGCGTTGCAC CAATGCTCGA CTGGACGGAC
AGACATTGCC GCTATTTCTT GCGTCTGCTT TCCCGCAATA CGTTGCTTTA TACCGAAATG
GTGACCACAG GGGCGATTAT TCACGGTAAA GGTGATTATC TGGCGTACAG TGAAGAAGAA
CATCCGGTAG CGTTGCAACT GGGCGGTAGC GATCCGGCGG CGCTGGCGCA GTGTGCAAAG
CTGGCAGAAG CGCGCGGATA TGATGAGATC AACCTGAATG TCGGCTGCCC GTCTGACCGG
GTGCAGAACG GCATGTTTGG TGCGTGTCTG ATGGGTAATG CGCAGCTGGT TGCCGACTGC
GTGAAAGCGA TGCGCGATGT GGTGTCGATT CCGGTGACGG TGAAAACGCG TATTGGCATC
GACGACCAGG ACAGCTATGA ATTTCTCTGC GATTTCATCA ATACCGTTTC CGGCAAAGGC
GAGTGTGAGA TGTTCATCAT CCACGCACGT AAAGCCTGGC TTTCGGGGTT AAGCCCGAAA
GAAAACCGTG AAATCCCGCC GCTCGATTAT CCGCGTGTGT ATCAACTGAA GCGTGACTTT
CCGCATCTGA CAATGTCGAT TAACGGTGGT ATCAAGTCGC TGGAAGAGGC CAAAGCACAC
CTGCAACATA TGGATGGCGT GATGGTCGGG CGCGAGGCGT ATCAGAATCC GGGTATTCTG
GCGGCGGTAG ACCGGGAGAT CTTTGGTTCC TCGGATACCG ATGCCGATCC GGTGGCGGTA
GTGCGCGCCA TGTATCCGTA CATTGAGCGT GAACTCAGCC AGGGGACGTA TCTCGGCCAT
ATTACCCGGC ATATGCTGGG TTTGTTCCAG GGTATTCCTG GCGCGCGGCA GTGGCGGCGT
TATTTAAGTG AAAATGCCCA TAAAGCGGGT GCTGACATTA ACGTGCTGGA ACACGCGCTC
AAACTGGTGG CGGATAAGCG TTAA
 
Protein sequence
MKMHGNSEMQ KINQTSAMPE KTDVHWSGRF SVAPMLDWTD RHCRYFLRLL SRNTLLYTEM 
VTTGAIIHGK GDYLAYSEEE HPVALQLGGS DPAALAQCAK LAEARGYDEI NLNVGCPSDR
VQNGMFGACL MGNAQLVADC VKAMRDVVSI PVTVKTRIGI DDQDSYEFLC DFINTVSGKG
ECEMFIIHAR KAWLSGLSPK ENREIPPLDY PRVYQLKRDF PHLTMSINGG IKSLEEAKAH
LQHMDGVMVG REAYQNPGIL AAVDREIFGS SDTDADPVAV VRAMYPYIER ELSQGTYLGH
ITRHMLGLFQ GIPGARQWRR YLSENAHKAG ADINVLEHAL KLVADKR