Gene EcHS_A0590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0590 
SymbolallC 
ID5592412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp603634 
End bp604869 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content52% 
IMG OID640919774 
Productallantoate amidohydrolase 
Protein accessionYP_001457357 
Protein GI157160039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family
[TIGR03176] allantoate amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACAC ATTTTCGTCA AGCCATAGAA GAAGCGCTGC CCTGGCTTTC CTCTTTTGGC 
GCTGACCCAA CGGGTGGGAT GACCCGTTTA CTTTATTCGC CGGAATGGCT GGAAACCCAG
CAGCAATTTA AAAAAAGAAT GGCAGCAAGT GGGCTGGAAA CACGTTTCGA TGAAGTAGGG
AATTTATACG GTCGCCTGAA TGGTACCGAA TATCCACAGG AAGTGATTCT GAGCGGTTCG
CATATCGATA CCGTGGTTAA CGGCGGTAAC CTCGACGGGC AATTCGGCGC GCTGGCGGCG
TGGCTGGCAA TTGACTGGCT GAAAACGCAA TACGGCGCAC CGTTACGTAC GGTCGAAGTG
GTGGCGATGG CAGAAGAAGA AGGCAGCCGC TTCCCGTATG TCTTCTGGGG CAGTAAAAAT
ATCTTTGGGC TGGCGAATCC TGACGACGTG CGGAATATCT GTGATGCCAA AGGAAATAGT
TTTGTCGATG CGATGAAGGC TTGCGGATTT ACTCTGCCGG ACGCCCCGCT AACTCCGCGT
CAGGATATTA AAGCCTTTGT CGAACTGCAT ATCGAACAGG GCTGTGTGCT GGAAAGTAAT
GGGCAATCAA TTGGCGTGGT GAATGCAATT GTCGGGCAAC GTCGCTATAC GGTGACGCTG
AACGGCGAAT CAAACCATGC AGGCACCACG CCGATGGGTT ATCGTCGTGA TACGGTTTAC
GCTTTCAGTC GCATTTGCCA TCAGTCGATC GAAAAAGCGA AAAAGATGGG CGATCCGCTG
GTTCTGACCT TTGGGAAAGT AGAGCCGCGC CCGAATACGG TGAATGTGGT GCCGGGTAAA
ACCACGTTCA CCATTGATTG TCGTCATACC GACGCCGCCG TGCTGCGTGA TTTCACCCAA
CAGTTAGAAA ACGACATGCG GGCGATTTGC GATGAAATGG ACATTGGTAT TGATATCGAT
TTATGGATGG ACGAAGAACC CGTGCCGATG AATAAGGACC TGGTCGCCAC CCTGACAGAA
TTGTGTGAAA GTGAAAAACT GAATTACCGG GTGATGCACA GTGGTGCCGG GCACGACGCG
CAAATTTTCG CGCCTCGCGT GCCGACCTGC ATGATTTTCA TTCCCAGCAT CAACGGGATC
AGCCATAACC CGGCGGAACG CACCAATATT ACCGACCTTG CCGAAGGGGT CAAAACGTTG
GCACTCATGC TTTATCAACT TGCCTGGCAG AAATAA
 
Protein sequence
MITHFRQAIE EALPWLSSFG ADPTGGMTRL LYSPEWLETQ QQFKKRMAAS GLETRFDEVG 
NLYGRLNGTE YPQEVILSGS HIDTVVNGGN LDGQFGALAA WLAIDWLKTQ YGAPLRTVEV
VAMAEEEGSR FPYVFWGSKN IFGLANPDDV RNICDAKGNS FVDAMKACGF TLPDAPLTPR
QDIKAFVELH IEQGCVLESN GQSIGVVNAI VGQRRYTVTL NGESNHAGTT PMGYRRDTVY
AFSRICHQSI EKAKKMGDPL VLTFGKVEPR PNTVNVVPGK TTFTIDCRHT DAAVLRDFTQ
QLENDMRAIC DEMDIGIDID LWMDEEPVPM NKDLVATLTE LCESEKLNYR VMHSGAGHDA
QIFAPRVPTC MIFIPSINGI SHNPAERTNI TDLAEGVKTL ALMLYQLAWQ K