Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01907 |
Symbol | hisG |
ID | 8116342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1985308 |
End bp | 1986207 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644848122 |
Product | hypothetical protein |
Protein accession | YP_002999695 |
Protein GI | 251785391 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0040] ATP phosphoribosyltransferase |
TIGRFAM ID | [TIGR00070] ATP phosphoribosyltransferase [TIGR03455] ATP phosphoribosyltransferase, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACA ACTCTCGTTT ACGCATAGCT ATGCAGAAAT CCGGCCGTTT AAGTGATGAC TCACGCGAAT TACTGGCGCG CTGTGGCATT AAAATCAATC TTCACACCCA GCGCCTGATT GCGATGGCAG AAAACATGCC GATTGATATT CTGCGCGTGC GTGACGATGA CATTCCCGGT CTGGTAATGG ACGGCGTGGT AGACCTTGGG ATTATCGGCG AAAACGTGCT GGAAGAAGAG CTGCTTAACC GCCGCGCCCA GGGTGAAGAT CCGCGCTACT TTACCCTGCG TCGTCTGGAT TTCGGCGGCT GCCGCCTTTC GCTGGCAACG CCGGTTGATG AAGCCTGGGA CGGTCCGCTC TCCTTAAACG GTAAACGTAT CGCCACCTCT TATCCTCACC TGCTCAAGCG TTATCTCGAC CAGAAAGGTA TCTCTTTTAA ATCCTGCTTA CTGAACGGTT CTGTCGAAGT TGCGCCGCGC GCCGGACTGG CGGATGCGAT TTGCGATCTG GTTTCCACCG GTGCCACGCT GGAAGCTAAC GGCCTGCGCG AAGTCGAAGT TATCTACCGC TCGAAAGCCT GCCTGATCCA GCGCGATGGC AAAATGGAAG AATCTAAACA GCAACTGATC GACAAGCTGC TGACCCGTAT TCAGGGTGTG ATCCAGGCGC GCGAATCAAA ATACATCATG ATGCACGCAC CGACCGAACG CCTGGATGAA GTCATCGCCC TGCTGCCAGG TGCCGAACGC CCAACTATTC TGCCGCTGGC GGGTGATCAA CAGCGCGTAG CGATGCACAT GGTCAGTAGC GAAACCCTGT TCTGGGAAAC GATGGAAAAA CTGAAAGCGC TGGGTGCCAG TTCAATCCTG GTCCTGCCGA TTGAGAAGAT GATGGAGTGA
|
Protein sequence | MTDNSRLRIA MQKSGRLSDD SRELLARCGI KINLHTQRLI AMAENMPIDI LRVRDDDIPG LVMDGVVDLG IIGENVLEEE LLNRRAQGED PRYFTLRRLD FGGCRLSLAT PVDEAWDGPL SLNGKRIATS YPHLLKRYLD QKGISFKSCL LNGSVEVAPR AGLADAICDL VSTGATLEAN GLREVEVIYR SKACLIQRDG KMEESKQQLI DKLLTRIQGV IQARESKYIM MHAPTERLDE VIALLPGAER PTILPLAGDQ QRVAMHMVSS ETLFWETMEK LKALGASSIL VLPIEKMME
|
| |