Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1039 |
Symbol | hisC |
ID | 6142706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1059632 |
End bp | 1060702 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615926 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001743118 |
Protein GI | 170683323 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.191971 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG TGACTATTAC CGATTTAGCG CGTGAGAACG TCCGCAACCT GACGCCGTAT CAGTCGGCGC GTCGTCTGGG CGGTAACGGC GACGTCTGGC TGAACGCCAA CGAATACCCC ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGCCAGCCG AAAGCGGTGA TTGAAAATTA CGCGCAGTAT GCAGGCGTGA AGGCGGAGCA GGTGCTGGTC AGCCGTGGCG CGGACGAAGG TATTGAACTG CTGATTCGCG CTTTTTGCGA ACCGGGTAAA GACGCCATCC TCTACTGCCC GCCAACGTAC GGCATGTACA GCGTCAGCGC CGAAACCATT GGCGTCGAGT GCCGCACAGT GCCGACGCTG AAAAACTGGC AACTGGACTT GCAGGGCATT TCCGACAAGC TGGACGGCGT AAAAGTGGTT TATGTTTGCA GCCCCAACAA CCCGACCGGG CAACTGATCA ATCCACAGGA TTTTCGCACC CTGCTGGAGT TAACGCGCGG TAAAGCGATT GTGGTTGCCG ATGAGGCCTA TATCGAGTTT TGCCCGCAGG CATCGCTGGC TGGCTGGCTG GCGGAATATC CGCACCTGGC TATTTTGCGC ACACTGTCGA AAGCCTTCGC TCTGGCGGGC CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATC GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCAG CCCAGGCGTT AAGCCCGCAG GGGATCGTCG CCATGCGCGA ACGAGTGGCG CAAATTATTG CTGAACGCGA ATACCTGATG GCCGCACTGA AAGAGATCCC CTGCGTGGAG CAGGTTTTCG ACTCCGAAAC CAACTACATT CTGGCGCGCT TTAAAGCCTC CAGCGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTG A
|
Protein sequence | MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP KAVIENYAQY AGVKAEQVLV SRGADEGIEL LIRAFCEPGK DAILYCPPTY GMYSVSAETI GVECRTVPTL KNWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI VVADEAYIEF CPQASLAGWL AEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI APYPLSTPVA DIAAQALSPQ GIVAMRERVA QIIAEREYLM AALKEIPCVE QVFDSETNYI LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV
|
| |