Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4411 |
Symbol | |
ID | 5587563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4402560 |
End bp | 4403501 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640928028 |
Product | thioesterase/acetyltransferase domain-containing protein |
Protein accession | YP_001465372 |
Protein GI | 157156308 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR02447] thioesterase domain, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCACC TTCGGGTTCC ACAAACAGAA GAAGAATTAG AGCGTTACTA TCAGTTTCGC TGGGAAATGT TGCGTAAGCC CCTGCATCAA CCAAAAGGTT CGGAACGCGA CGCGTGGGAT GCGATGGCGC ATCACCAGAT GGTCGTCGAC GAGCAGGGTA ATCTGGTGGC GGTAGGCCGA TTGTATATTA ATGCCGACAA TGAAGCGTCC ATTCGCTTTA TGGCCGTTCA CCCGGACGTG CAGGACAAAG GGTTAGGCAC GCTGATGGCG ATGACCCTGG AGTCGGTGGC GCGTCAGGAA GGCGTTAAGC GCGTGACCTG TAGCGCCCGT GAAGACGCGG TGGAGTTTTT CGCCAAGCTG GGATTCATTA ATCAGGGGGA GATCACCACC CCAACTACCA CGCCGATTCG CCATTTTTTG ATGATTAAAC CCGTTGCCAC TCTGGATGAT ATTTTGCATC GCGGCGACTG GTGCGCGCAG CTGCAACAGG CGTGGTACGA ACATATCCCG CTTAGTGAAA AAATGGGCGT GCGTATTCAA CAATATACCG GGCAAAAATT TATCACCACC ATGCCGGAAA CCGGTAATCA GAATCCGCAC CATACGCTGT TTGCCGGGAG TTTATTCTCA CTGGCAACGC TCACCGGTTG GGGGCTTATC TGGCTGATGC TGCGCGAACG CCATCTCGGC GGAACGATTA TTCTGGCGGA TGCGCATATC CGCTACAGCA AGCCGATTAG CGGTAAACCT CATGCGGTAG CCGACCTTGG TGCCTTAAGC GGCGATCTCG ACCGTCTGGC GCGCGGACGA AAAGCACGGG TGCAGATGCA AGTTGAAATC TTTGGCGACG AGACGCCGGG TGCAGTGTTT GAAGGCACGT ATATCGTTCT GCCCGCGAAG CCATTTGGCC CGTATGAAGA GGGCGGGAAC GAAGAAGAGT AG
|
Protein sequence | MYHLRVPQTE EELERYYQFR WEMLRKPLHQ PKGSERDAWD AMAHHQMVVD EQGNLVAVGR LYINADNEAS IRFMAVHPDV QDKGLGTLMA MTLESVARQE GVKRVTCSAR EDAVEFFAKL GFINQGEITT PTTTPIRHFL MIKPVATLDD ILHRGDWCAQ LQQAWYEHIP LSEKMGVRIQ QYTGQKFITT MPETGNQNPH HTLFAGSLFS LATLTGWGLI WLMLRERHLG GTIILADAHI RYSKPISGKP HAVADLGALS GDLDRLARGR KARVQMQVEI FGDETPGAVF EGTYIVLPAK PFGPYEEGGN EEE
|
| |