Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4273 |
Symbol | |
ID | 6144309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4372105 |
End bp | 4373046 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619094 |
Product | acetyltransferase |
Protein accession | YP_001746218 |
Protein GI | 170680652 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR02447] thioesterase domain, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCACC TTCGGGTTCC ACAAACAGAA GAAGAATTAG AGCGTTACTA TCAGTTTCGC TGGGAAATGT TGCGTAAGCC CCTGCATCAA CCGAAAGGTT CGGAACGCGA CGCGTGGGAT GCGATGGCGC ATCACCAGAT GGTCGTCGAC GAGCAGGGTA ATCTGGTGGC GGTAGGCCGA CTGTATATTA ATGCCGACAA TGAAGCGTCC ATTCGCTTTA TGGCCGTTCA TCCCGACGTG CAGGACAAAG GGTTAGGAAC GCTGATGGCG ATGACCCTTG AGTCGGTGGC GCGTCAGGAA GGCGTTAAGC GCGTGACCTG TAGTGCCCGT GAAGACGCGG TGGAGTTTTT TGCCAAACTG GGGTTTGTCA ATCAAGGGGA GATCACTACC CCAACCACCA CGCCGATTCG CCATTTTTTG ATGATTAAAC CCGTTGCCAC TCTGGATGAT ATTTTGCATC GCGGCGACTG GTGCGCGCAG CTGCAACAGG CGTGGTACGA ACATATCCCG CTAAGCGAAA AAATGGGCGT GCGCATTCAG CAATATACCG GGCAAAAATT TATCACCACC ATGCCGGAAA CCGGTAATCA GAATCCGCAC CATACGCTGT TTGCCGGGAG TTTATTCTCA CTGGCAACGC TCACCGGTTG GGGACTTATC TGGCTGATGC TGCGTGAACG CCATCTGGGC GGAACGATTA TTCTGGCGGA TGCGCATATT CGCTACAGCA AGCCGATTAG CGGTAAACCT CATGCGGTAG CCGACCTCGG TGCCTTAAGC GGCGATCTCG ACCGTCTGGC GCGCGGGCGC AAAGCGCGGG TACAAATGCA GGTCGAAATC TTTGGCGACG AGACGCCGGG TGCAGTGTTT GAAGGCACGT ATATCGTTCT GCCCGCGAAG CCATTTGGCC CCTATGAAGA GGGCGGGAAC GAAGAAGAGT AG
|
Protein sequence | MYHLRVPQTE EELERYYQFR WEMLRKPLHQ PKGSERDAWD AMAHHQMVVD EQGNLVAVGR LYINADNEAS IRFMAVHPDV QDKGLGTLMA MTLESVARQE GVKRVTCSAR EDAVEFFAKL GFVNQGEITT PTTTPIRHFL MIKPVATLDD ILHRGDWCAQ LQQAWYEHIP LSEKMGVRIQ QYTGQKFITT MPETGNQNPH HTLFAGSLFS LATLTGWGLI WLMLRERHLG GTIILADAHI RYSKPISGKP HAVADLGALS GDLDRLARGR KARVQMQVEI FGDETPGAVF EGTYIVLPAK PFGPYEEGGN EEE
|
| |