Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4366 |
Symbol | |
ID | 6273091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4076949 |
End bp | 4077890 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728170 |
Product | acetyltransferase, GNAT family |
Protein accession | YP_001882583 |
Protein GI | 187731115 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR02447] thioesterase domain, putative |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCACC TTCGGGTTCC ACAAACAGAA GAAGAGTTAG AGCGTTACTA TCAGTTTCGC TGGGAAATGT TGCGTAAGCC CCTGCATCAA CCAAAAGGTT CGGAACGCGA CGCGTGGGAT GCGATGGCGC ATCATCAGAT GGTCGTCGAC GAGCAGGGGA ATCTGGTGGC GGTAGGCCGA CTGTATATTA ATGCCGACAA TGAAGCGTCC ATTCGCTTTA TGGCCGTTCA TCCCGACGTG CAGGACAAAG GGCTGGGGAC GTTGATGGCG ATGACCCTGG AGTCGGTGGC GCGTCAGGAA GGCGTTAAGC GCGTGACCTG TAGCGCCCGT GAAGACGCGG TGGAGTTTTT CGCCAAGCTG GGGTTTGTTA ATCAGGGAGA AATCACCACA CCAACCACCA CGCCGATTCG CCATTTTTTG ATGATTAAAC CTGTTGCCAC TCTGGATGAT ATTTTGCATC GCGGCGACTG GTGCGCGCAG CTGCAACAGG CGTGGTACGA ACACATCCCG CTTAGTGAAA AAATGGGCGT GCGCATTCAG CAATATACCG GGCAAAAATT TATCACCACC ATGCCGGAAA CCGGTAATCA GAATCTGCAC CATACGCTGT TTGCCGGGAG TTTATTCTCG CTGGCAACGC TCACCGGTTG GGGGCTTATC TGGCTGATGC TGCGCGAACG CCACCTCGGC GGAACGATTA TTCTGGCCGA TGCGCATATC CGCTACAGCA AGCCGATTAG CGGTAAACCT CATGCGGTAG CCGACCTTGG TGCCTTAAGC GGCGATCTCG ACCGTCTGGC GCGCGGACGA AAAGCACGGG TGCAGATGCA AGTTGAAATC TTTGGCGACG AGACGCCGGG TGCAGTGTTT GAAGGCACGT ATATCGTTCT GCCCGCGAAG CCATTTGGCC CGTATGAAGA GGGCGGGAAC GAAGAAGAGT AG
|
Protein sequence | MYHLRVPQTE EELERYYQFR WEMLRKPLHQ PKGSERDAWD AMAHHQMVVD EQGNLVAVGR LYINADNEAS IRFMAVHPDV QDKGLGTLMA MTLESVARQE GVKRVTCSAR EDAVEFFAKL GFVNQGEITT PTTTPIRHFL MIKPVATLDD ILHRGDWCAQ LQQAWYEHIP LSEKMGVRIQ QYTGQKFITT MPETGNQNLH HTLFAGSLFS LATLTGWGLI WLMLRERHLG GTIILADAHI RYSKPISGKP HAVADLGALS GDLDRLARGR KARVQMQVEI FGDETPGAVF EGTYIVLPAK PFGPYEEGGN EEE
|
| |