Gene EcolC_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0446 
Symbol 
ID6068193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp483697 
End bp484662 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content52% 
IMG OID641599852 
ProducttRNA-dihydrouridine synthase B 
Protein accessionYP_001723451 
Protein GI170018497 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000179057 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG GACAATATCA GCTCAGAAAT CGCCTGATCG CAGCGCCCAT GGCTGGCATT 
ACAGACAGAC CTTTTCGGAC GTTGTGCTAC GAGATGGGAG CCGGATTGAC AGTATCCGAG
ATGATGTCTT CTAACCCACA GGTTTGGGAA AGCGACAAAT CTCGTTTACG GATGGTGCAC
ATTGATGAAC CCGGTATTCG CACCGTGCAA ATTGCTGGTA GCGATCCGAA AGAAATGGCA
GATGCAGCAC GTATTAACGT GGAAAGCGGT GCCCAGATTA TTGATATCAA TATGGGTTGC
CCGGCTAAAA AAGTGAATCG CAAGCTCGCA GGTTCAGCCC TCTTGCAGTA CCCGGATGTC
GTTAAATCGA TCCTTACCGA GGTCGTCAAT GCAGTGGACG TTCCTGTTAC CCTGAAGATT
CGCACCGGCT GGGCACCGGA ACACCGTAAC TGCGAAGAGA TTGCCCAACT GGCTGAAGAC
TGTGGCATTC AGGCTCTGAC CATTCATGGC CGTACACGCG CCTGTTTGTT CAATGGAGAA
GCTGAGTACG ACAGTATTCG GGCAGTTAAG CAGAAAGTTT CCATTCCGGT TATCGCGAAT
GGCGACATTA CTGACCCGCT TAAAGCCAGA GCTGTGCTCG ACTATACAGG GGCGGATGCC
CTGATGATAG GCCGCGCAGC TCAGGGAAGA CCCTGGATCT TTCGGGAAAT CCAGCATTAT
CTGGACACTG GGGAGTTGCT GCCCCCGCTG CCTTTGGCAG AGGTAAAGCG CTTGCTTTGC
GCGCACGTTC GGGAACTGCA TGACTTTTAT GGTCCGGCAA AAGGGTACCG AATTGCACGT
AAACACGTTT CCTGGTATCT CCAGGAACAC GCTCCAAATG ACCAGTTTCG GCGCACATTC
AACGCCATTG AGGATGCCAG CGAACAGCTG GAGGCGTTGG AGGCATACTT CGAAAATTTT
GCGTAA
 
Protein sequence
MRIGQYQLRN RLIAAPMAGI TDRPFRTLCY EMGAGLTVSE MMSSNPQVWE SDKSRLRMVH 
IDEPGIRTVQ IAGSDPKEMA DAARINVESG AQIIDINMGC PAKKVNRKLA GSALLQYPDV
VKSILTEVVN AVDVPVTLKI RTGWAPEHRN CEEIAQLAED CGIQALTIHG RTRACLFNGE
AEYDSIRAVK QKVSIPVIAN GDITDPLKAR AVLDYTGADA LMIGRAAQGR PWIFREIQHY
LDTGELLPPL PLAEVKRLLC AHVRELHDFY GPAKGYRIAR KHVSWYLQEH APNDQFRRTF
NAIEDASEQL EALEAYFENF A