Gene EcolC_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0835 
Symbol 
ID6065012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp901330 
End bp902715 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID641600240 
Productphenylhydantoinase 
Protein accessionYP_001723834 
Protein GI170018880 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAT TGATCAAAAA CGGCACTGTC GTTAACGCAG ATGGACAAGC CAAACAGGAT 
TTGCTGATTG AAAGCGGGAT TGTTCGCCAG TTGGGCAACA ATATTTCGCC GCAGCTCCCG
TATGAAGAAA TTGATGCCAC TGGCTGTTAC GTTTTCCCTG GCGGCGTGGA TGTCCATACG
CATTTCAATA TTGATGTCGG CATCGCGCGC AGTTGTGATG ATTTTTTTAC CGGTACCCGC
GCAGCTGCGT GTGGCGGTAC AACAACCATT ATTGACCATA TGGGATTTGG CCCAAACGGC
TGTCGGTTAC GCCATCAACT GGAGGTTTAT CGTGGTTATG CCGCCCATAA AGCGGTCATC
GATTACAGCT TTCACGGTGT GATCCAGCAC ATTAATCACG CAATCCTCGA CGAAATCCCG
ATGATGGTCG AGGAAGGACT GAGCAGTTTT AAACTCTATT TAACCTATCA ATACAAACTC
AACGATGACG AGGTTTTGCA GGCATTACGC CGTCTGCATG AATCCGGCGC GCTGACCACC
GTGCACCCGG AAAATGATGC GGCTATCGCC AGCAAGCGGG CGGAGTTTAT CGCCGCAGGG
TTAACCGCGC CGCGCTATCA CGCCTTGAGT CGCCCTCTGG AATGCGAAGC GGAAGCCATC
GCCCGCATGA TTAACCTGGC ACAAATTGCC GGTAACGCCC CGCTCTATAT CGTGCACCTG
TCTAACGGCT TAGGTCTGGA TTATCTGCGT CTTGCCCGTG CGAATCACCA GCCAGTCTGG
GTTGAAACCT GCCCACAATA TCTCCTGTTG GACGAACGCA GTTACGATAC AGAAGATGGC
ATGAAGTTCA TTCTTAGCCC ACCGCTGCGT AACGTACGCG AGCAGGACAA ACTGTGGTGT
GGCATCAGCG ATGGTGCGAT TGACGTGGTG GCAACCGATC ACTGCACCTT CTCGATGGCT
CAACGCCTGC AAATTTCTAA AGGCGATTTC AGTCGCTGCC CAAATGGCTT ACCCGGTGTG
GAAAACCGCA TGCAGTTACT GTTTTCCAGT GGCGTGATGA CGGGACGTAT AACACCGGAA
CGCTTTGTTG AATTAACCAG CGCAATGCCC GCCAGGCTGT TTGGCCTGTG GCCGCAAAAA
GGATTATTAG CGCCCGGTTC CGACGGCGAC GTGGTGATTA TCGACCCACG TCAGAGCCAA
CAAATTCAGC ATCGCCATCT CCACGACAAC GCCGACTACT CGCCATGGGA GGGTTTTACC
TGTCAGGGCG CGATTGTCAG AACCTTATCC CGTGGTGAAA CGATTTTCTG TGACGGCACC
TTTACAGGCA AAGCCGGGCG AGGTCGTTTC CTGCGACGCA AACCGTTTGT CCCTCCCGTG
CTCTAA
 
Protein sequence
MRVLIKNGTV VNADGQAKQD LLIESGIVRQ LGNNISPQLP YEEIDATGCY VFPGGVDVHT 
HFNIDVGIAR SCDDFFTGTR AAACGGTTTI IDHMGFGPNG CRLRHQLEVY RGYAAHKAVI
DYSFHGVIQH INHAILDEIP MMVEEGLSSF KLYLTYQYKL NDDEVLQALR RLHESGALTT
VHPENDAAIA SKRAEFIAAG LTAPRYHALS RPLECEAEAI ARMINLAQIA GNAPLYIVHL
SNGLGLDYLR LARANHQPVW VETCPQYLLL DERSYDTEDG MKFILSPPLR NVREQDKLWC
GISDGAIDVV ATDHCTFSMA QRLQISKGDF SRCPNGLPGV ENRMQLLFSS GVMTGRITPE
RFVELTSAMP ARLFGLWPQK GLLAPGSDGD VVIIDPRQSQ QIQHRHLHDN ADYSPWEGFT
CQGAIVRTLS RGETIFCDGT FTGKAGRGRF LRRKPFVPPV L