Gene EcSMS35_0561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0561 
SymbolallC 
ID6146302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp567925 
End bp569160 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content52% 
IMG OID641615453 
Productallantoate amidohydrolase 
Protein accessionYP_001742660 
Protein GI170681093 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family
[TIGR03176] allantoate amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACAC ATTTCCGTCA AGCTATAGAA GAAACGCTGC CCTGGCTTTC CTCTTTTGGC 
TCTGACCCAA CGGGTGGAAT GACCCGTTTA CTTTATTCGC CGGAATGGTT GGAAACCCAG
CAGCAATTTA AAAAAAGAAT GGCAGCAAGC GGGCTGGAAA CACGTTTCGA TGAAGTGGGG
AATTTATACG GTCGCCTGAG TGGCACCGAA TATCCACAGG AAGTGGTTCT GAGCGGTTCG
CATATCGATA CCGTGGTTAA CGGCGGTAAC CTCGACGGAC AATTCGGCGC ACTGGCGGCG
TGGCTGGCAA TTGACTGGTT GAAAACGCAA TACGGCGCGC CGTTACGTAC GGTCGAAGTG
GTGGCGATGG CAGAAGAAGA AGGCAGCCGC TTCCCGTATG TCTTCTGGGG CAGTAAAAAT
ATCTTTGGGC TGGCGAATCC TGACGACGTG CGGAATATCT GTGATGCCAA AGGAAATAGT
TTTGTCGATG CGATGAAGGC TTGCGGATTT ACTCTGCCGA ACGCCCCACT AACTCCGCGT
CAGGATATTA AAGCCTTTGT CGAACTGCAT ATCGAACAGG GCTGTGTGCT GGAAAGTAAT
GGGCAATCAA TTGGCGTGGT GAATGCAATT GTCGGGCAAC GTCGCTATAC GGTGACGCTG
AATGGCGAAT CAAACCATGC AGGCACCACG CCGATGGGTT ATCGTCGTGA TACGGTTTAC
GCTTTCAGTC GCATTTGCCA TCAGTCGATC GAAAAAGCGA AAAAGATGGG CGACCCGCTG
GTGCTGACCT TTGGCAAGGT GGAACCGCGC CCGAATACGG TAAATGTGGT GCCGGGGAAA
ACCACGTTCA CCATTGATTG TCGTCATACC GACGCCGCCG TGCTGCGCAA TTTCACCCAA
CAGTTAGAAA ACGACATGCG GGCGATTTGC GATGAAATGG ACATTGGTAT TGATATCGAT
TTATGGATGG ACGAAGAACC GGTGCCGATG AATAAGGAGC TGGTCGCCAC CCTGACAGAA
TTGTGTGAAA GCGAAAAACT GAATTACCGG GTAATGCACA GTGGTGCCGG GCACGACGCA
CAAATTTTCG CGCCTCGCGT GCCGACCTGC ATGATTTTCA TTCCCAGCAT CAACGGGATC
AGCCATAACC CGGCGGAACG CACCAATATT ACCGACCTTG CCGAAGGGGT CAAAACGTTG
GCACTCATGC TTTATCAACT TGCCTGGCAG AAATAA
 
Protein sequence
MITHFRQAIE ETLPWLSSFG SDPTGGMTRL LYSPEWLETQ QQFKKRMAAS GLETRFDEVG 
NLYGRLSGTE YPQEVVLSGS HIDTVVNGGN LDGQFGALAA WLAIDWLKTQ YGAPLRTVEV
VAMAEEEGSR FPYVFWGSKN IFGLANPDDV RNICDAKGNS FVDAMKACGF TLPNAPLTPR
QDIKAFVELH IEQGCVLESN GQSIGVVNAI VGQRRYTVTL NGESNHAGTT PMGYRRDTVY
AFSRICHQSI EKAKKMGDPL VLTFGKVEPR PNTVNVVPGK TTFTIDCRHT DAAVLRNFTQ
QLENDMRAIC DEMDIGIDID LWMDEEPVPM NKELVATLTE LCESEKLNYR VMHSGAGHDA
QIFAPRVPTC MIFIPSINGI SHNPAERTNI TDLAEGVKTL ALMLYQLAWQ K