Gene EcolC_3979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3979 
Symbol 
ID6064516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4371166 
End bp4372203 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content53% 
IMG OID641603392 
ProducttRNA-dihydrouridine synthase A 
Protein accessionYP_001726907 
Protein GI170021953 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00742] tRNA dihydrouridine synthase A 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.16691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGTA ATTCTGAAAT GCAAAAAATC AACCAAACCA GCGCAATGCC TGAAAAAACT 
GACGTTCACT GGAGTGGTCG GTTTAGCGTT GCACCAATGC TCGACTGGAC GGACAGACAT
TGCCGCTATT TCTTGCGTCT GCTTTCCCGC AATACGTTGC TGTATACCGA AATGGTGACC
ACAGGGGCGA TTATTCACGG TAAAGGTGAT TACCTGGCGT ACAGTGAAGA AGAACATCCG
GTAGCGTTGC AACTCGGCGG TAGCGATCCG GCGGCGCTGG CACAGTGTGC GAAGCTGGCA
GAAGCGCGTG GATATGATGA GATCAACCTG AATGTCGGCT GCCCGTCTGA CCGGGTGCAG
AACGGCATGT TTGGTGCGTG TCTGATGGGT AATGCGCAGC TGGTTGCCGA CTGCGTGAAA
GCGATGCGCG ATGTGGTGTC GATTCCGGTG ACGGTGAAAA CGCGTATTGG CATCGACGAC
CAGGACAGCT ATGAATTTCT CTGCGATTTC ATCAACACCG TTTCCGGCAA AGGCGAGTGT
GAGATGTTCA TCATCCATGC ACGTAAAGCC TGGCTTTCGG GGTTAAGTCC GAAAGAAAAC
CGTGAGATCC CGCCGCTCGA TTATCCGCGT GTGTATCAAC TGAAGCGTGA CTTTCCGCAT
CTGACAATGT CGATTAACGG TGGTATCAAG TCGCTGGAAG AGGCCAAAGC ACACCTGCAA
CATATGGATG GCGTGATGGT CGGGCGCGAG GCGTATCAGA ATCCGGGTAT TCTGGCGGCG
GTAGACCGGG AGATCTTTGG TTCCTCGGAT ATCGATGCCG ATCCGGTGGC GGTAGTGCGC
GCCATGTATC CGTACATTGA GCGTGAACTC AGCCAGGGGA CGTATCTCGG TCATATTACC
CGGCATATGT TGGGCTTGTT CCAGGGTATT CCTGGCGCGC GGCAGTGGCG GCGTTATTTA
AGTGAAAATG CCCATAAAGC GGGTGCAGAC ATTAATGTGC TGGAACACGC GCTCAAACTG
GTGGCGGATA AGCGTTAA
 
Protein sequence
MHGNSEMQKI NQTSAMPEKT DVHWSGRFSV APMLDWTDRH CRYFLRLLSR NTLLYTEMVT 
TGAIIHGKGD YLAYSEEEHP VALQLGGSDP AALAQCAKLA EARGYDEINL NVGCPSDRVQ
NGMFGACLMG NAQLVADCVK AMRDVVSIPV TVKTRIGIDD QDSYEFLCDF INTVSGKGEC
EMFIIHARKA WLSGLSPKEN REIPPLDYPR VYQLKRDFPH LTMSINGGIK SLEEAKAHLQ
HMDGVMVGRE AYQNPGILAA VDREIFGSSD IDADPVAVVR AMYPYIEREL SQGTYLGHIT
RHMLGLFQGI PGARQWRRYL SENAHKAGAD INVLEHALKL VADKR