Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01909 |
Symbol | hisC |
ID | 8116336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1987514 |
End bp | 1988584 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644848124 |
Product | hypothetical protein |
Protein accession | YP_002999697 |
Protein GI | 251785393 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG TGACTATTAC CGATTTAGCG CGTGAAAACG TCCGCAACCT GACGCCGTAT CAGTCGGCGC GTCGTCTGGG CGGTAACGGC GACGTCTGGC TGAACGCCAA CGAATACCCC ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGTCAGCCG AAAGCGGTGA TTGACAATTA CGCGCAGTAT GCAGGCATAA AACCGGAGCA GGTGCTGGTC AGCCGTGGCG CGGACGAAGG TATTGAACTA CTGATTCGCG CTTTTTGCGA ACCGGGTAAA GACGCCATCC TCTACTGCCC GCCAACGTAC GGCATGTACA GCGTCAGCGC TGAAACCATT GGCGTCGAGT GCCGCACAGT GCCGACGCTG GACAACTGGC AACTGGACTT GCAGGGCATT TCCGACAAGC TGGACGGCGT AAAAGTGGTC TATGTTTGCA GCCCCAACAA CCCGACAGGG CAACTGATCA ATCCGCAGGA TTTTCGCACC CTGCTGGAGT TAACCCGCGG TAAGGCGATT GTGGTTGCCG ATGAAGCCTA TATCGAGTTT TGCCCACAGG CATCGCTGGC TGGCTGGCTG ACGGAATATC CGCACCTGGC TATTTTGCGC ACACTGTCGA AAGCTTTTGC TCTGGCGGGC CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATC GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCGG CCCAGGCGTT AAGCCCGCAG GGAATCGTCG CCATGCGTGA ACGGGTAGTG CAAATTATTG CTGAACGCGA ATACCTGATT GCCGCATTGA AAGAAATCCC CTGCGTGGAG CAGGTTTTCG ACTCCGAAAC CAACTACATT CTGGCGCGCT TTAAAGCCTC CAGCGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTA A
|
Protein sequence | MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP KAVIDNYAQY AGIKPEQVLV SRGADEGIEL LIRAFCEPGK DAILYCPPTY GMYSVSAETI GVECRTVPTL DNWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI VVADEAYIEF CPQASLAGWL TEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI APYPLSTPVA DIAAQALSPQ GIVAMRERVV QIIAEREYLI AALKEIPCVE QVFDSETNYI LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV
|
| |