Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1211 |
Symbol | hisC |
ID | 6269071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1118005 |
End bp | 1119075 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641725342 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001879856 |
Protein GI | 187733922 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG TGACTATTAC CGATTTAGCG CGTGAAAACG TCCGCAACCT GACGCCGTAT CAGTCGGCGC GTCGTCTGGG CGGTAATGGC GACGTCTGGC TGAACGCCAA CGAATATCCC ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGTCAGCCG AAAGCGGTGA TTGAAAATTA CGCGCAATAT GCAGGCGTAA AACCGGAACA GGTGCTGGTC AGCCGTGGCG CGGACGAAGG TATTGAACTG CTGATCCGCG CTTTTTGCGA ACCGGGTAAA GACTCCATCC TCTACTGCCC ACCAACGTAC GGCATGTACA GCGTCAGCGC CGAAACGATT GGCGTCGAGT GCCGCACAGT GCCGACGCTG GACAACTGGC AACTGGACTT ACAGGGCATT TCCGACAAGC TGGACGGCGT AAAAGTGGTC TATGTTTGCA GCCCCAACAA CCCGACCGGG CAACTGATCA ATCCGCAGGA TTTTCGCACC CTGCTGGAGT TAACGCGCGG TAAGGCGATT GTGGTTGCCG ATGAAGCCTA TATAGAGTTT TGCCCGCAGG CATCGCTGGC TGGCTGGCTG ACGGAATATC CGCACCTGGC TATTTTGCGC ACACTGTCGA AAGCTTTTGC TCTGGCGGGC CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATT GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCGG CCCAGGCGTT AAGCCCGCAG GGGATCGTCG CCATGCGTGA GCGGGTGGCG CAAATTATTG CAGAACGCGA ATACCTGATT GCCGCACTGA AAGAGATCCC CTGCGTGGAG CAGGTTTTCG ACTCTGAAAC CAACTACATT CTGGCGCGCT TTAAAGCCTC CAGTGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTG A
|
Protein sequence | MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP KAVIENYAQY AGVKPEQVLV SRGADEGIEL LIRAFCEPGK DSILYCPPTY GMYSVSAETI GVECRTVPTL DNWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI VVADEAYIEF CPQASLAGWL TEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI APYPLSTPVA DIAAQALSPQ GIVAMRERVA QIIAEREYLI AALKEIPCVE QVFDSETNYI LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV
|
| |