Gene EcE24377A_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3198 
SymbolhyuA 
ID5587404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3213670 
End bp3215055 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID640926838 
Productphenylhydantoinase 
Protein accessionYP_001464210 
Protein GI157158964 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTAT TGATCAAAAA CGGCACTGTC GTTAACGCAG ATGGACAAGC CAAACAGGAT 
TTGCTGATTG AAAGCGGGAT TGTTCGCCAG TTGGGCAACA ATATTTCGCC GCAGCTCCCG
TATGAAGAAA TTGATGCCAC TGGCTGTTAC GTTTTCCCTG GCGGCGTGGA TGTCCATACG
CATTTCAATA TTGATGTCGG CATCGCGCGC AGTTGTGATG ATTTTTTTAC CGGTACCCGC
GCAGCTGCGT GTGGCGGTAC AACAACCATT ATTGACCATA TGGGATTTGG CCCAAACGGC
TGTCGGTTAC GCCATCAACT GGAGGTTTAT CGTGGTTATG CCGCCCATAA AGCGGTCATC
GATTACAGCT TTCACGGTGT GATCCAGCAC ATTAATCACG CAATCCTCGA CGAAATCCCG
ATGATGGTCG AGGAAGGACT GAGCAGTTTT AAACTCTATT TAACCTATCA ATACAAACTC
AACGATGACG AGGTTTTGCA GGCATTACGC CGTCTGCATG AATCCGGCGC GCTGACCACC
GTGCACCCGG AAAATGATGC GGCTATCGCC AGCAAGCGGG CGGAATTTAT CGCCGCAGGG
TTAACCGCGC CGCGCTATCA TGCCTTGAGT CGCCCTCTGG AATGCGAAGC GGAAGCCATC
GCCCGCATGA TTAACCTGGC ACAAATTGCC GGTAACGCCC CGCTCTATAT CGTGCACCTG
TCTAACGGCT TAGGTCTGGA TTATCTGCGT CTTGCCCGTG CGAATCACCA GCCAGTCTGG
GTTGAAACCT GCCCACAATA TCTCCTGTTG GACGAACGCA GTTACGATAC AGAAGACGGC
ATGAAGTTCA TTCTTAGCCC ACCGCTGCGT AACGTACGCG AGCAGGACAA ACTGTGGTGT
GGCATCAGCG ATGGTGCGAT TGACGTGGTG GCGACCGATC ACTGCACCTT CTCAATGGCT
CAACGCCTGC AAATTTCTAA AGGCGATTTC AGTCGCTGCC CAAATGGCTT ACCCGGTGTG
GAAAACCGCA TGCAGTTACT GTTTTCCAGT GGCGTGATGA CGGGACGTAT AACACCGGAA
CGCTTTGTTG AATTAACCAG CGCAATGCCC GCCAGGCTGT TTGGCCTGTG GCCGCAAAAA
GGATTATTAG CGCCCGGTTC CGATGGCGAC GTGGTGATTA TCGACCCACG TCAGAGCCAA
CAAATTCAGC ATCGCCATCT CCACGACAAC GCCGACTACT CGCCATGGGA GGGTTTTACC
TGTCAGGGCG CGATTGTCAG AACCTTATCC CGTGGTGAAA CGATTTTCTG TGACGGCACC
TTTACAGGCA AAGCCGGGCG AGGTCGTTTC CTGCGACGCA AACCGTTTGT CCCTCCCGTG
CTCTAA
 
Protein sequence
MRVLIKNGTV VNADGQAKQD LLIESGIVRQ LGNNISPQLP YEEIDATGCY VFPGGVDVHT 
HFNIDVGIAR SCDDFFTGTR AAACGGTTTI IDHMGFGPNG CRLRHQLEVY RGYAAHKAVI
DYSFHGVIQH INHAILDEIP MMVEEGLSSF KLYLTYQYKL NDDEVLQALR RLHESGALTT
VHPENDAAIA SKRAEFIAAG LTAPRYHALS RPLECEAEAI ARMINLAQIA GNAPLYIVHL
SNGLGLDYLR LARANHQPVW VETCPQYLLL DERSYDTEDG MKFILSPPLR NVREQDKLWC
GISDGAIDVV ATDHCTFSMA QRLQISKGDF SRCPNGLPGV ENRMQLLFSS GVMTGRITPE
RFVELTSAMP ARLFGLWPQK GLLAPGSDGD VVIIDPRQSQ QIQHRHLHDN ADYSPWEGFT
CQGAIVRTLS RGETIFCDGT FTGKAGRGRF LRRKPFVPPV L