Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02306 |
Symbol | eutG |
ID | 8116596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2432023 |
End bp | 2433210 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644848510 |
Product | hypothetical protein |
Protein accession | YP_003000083 |
Protein GI | 251785779 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAATG AATTGCAGAC CGCGCTCTTT CAGGCGTTCG ATACCCTGAA TCTGCAACGG GTAAAAACAT TTAGCGTTCC ACCGGTGACG CTTTGCGGTC CGGGCGCGGT GAGCAGTTGC GGGCAGCAAG CGCAAACGCG TGGGCTGAAA CATCTGTTCG TGATGGCAGA CAGCTTTTTG CATCAGGCGG GGATGACCGC CGGGCTGACG CGCAGCCTGG CTGTTAAAGG CATCGCCATG ACGCTCTGGC CATGTCCGGT GGGCGAACCG TGCATTACCG ACGTGTGTGC AGCCGTGGCG CAGTTGCGTG AGTCAGGCTG TGATGGGGTG ATCGCATTTG GCGGCGGCTC GGTGCTGGAT GCGGCGAAAG CCGTGGCGTT GCTGGTGACG AACCCCGATA GCACGCTGGC AGAGATGTCA GAAACCAGCG TTCTGCAACC GCGCTTGCCG CTGATTGCCA TTCCAACGAC CGCCGGAACC GGCTCTGAAA CCACCAATGT AACGGTGATT ATCGACGCGG TGAGCGGGCG CAAGCAGGTG TTAGCCCATG CCTCGCTGAT GCCGGATGTG GCGATCCTCG ACGCCGCATT GACCGAAGGT GTGCCGTCGC ATGTCACGGC GATGACCGGC ATTGATGCGT TAACCCATGC CATTGAAGCA TACAGCGCCC TGAACGCTAC ACCGTTTACC GACAGCCTGG CGATTGGTGC CATTGCGATG ATTGGCAAAT CGCTGCCGAA AGCGGTGGGC TACGGTCACG ACCTTGCCGC GCGCGAGAGC ATGTTACTGG CTTCATGTAT GGCGGGAATG GCGTTTTCCA GTGCGGGTCT TGGGTTGTGC CACGCGATGG CGCATCAGCC GGGCGCGGCG CTGCATATTC CGCACGGTCT CGCGAACGCC ATGTTGCTGC CAACGGTGAT GGAATTTAAC CGGATGGTTT GTCGTGAACG CTTTAGTCAG ATTGGTCGGG CACTGCGAAC TAAAAAATCC GACGATCGTG ACGCTATTAA CGCGGTAAGT GAGCTGATTG CGGAAGTTGG GATTGGTAAA CGACTGGGCG ATGTTGGTGC GACATCTGCG CATTACGGCG CATGGGCGCA GGCCGCGCTG GAAGATATTT GTCTGCGCAG TAACCCGCGT ACCGCCAGCC TGGAGCAGAT TGTCGGCCTG TACGCAGCGG CGCAATAA
|
Protein sequence | MQNELQTALF QAFDTLNLQR VKTFSVPPVT LCGPGAVSSC GQQAQTRGLK HLFVMADSFL HQAGMTAGLT RSLAVKGIAM TLWPCPVGEP CITDVCAAVA QLRESGCDGV IAFGGGSVLD AAKAVALLVT NPDSTLAEMS ETSVLQPRLP LIAIPTTAGT GSETTNVTVI IDAVSGRKQV LAHASLMPDV AILDAALTEG VPSHVTAMTG IDALTHAIEA YSALNATPFT DSLAIGAIAM IGKSLPKAVG YGHDLAARES MLLASCMAGM AFSSAGLGLC HAMAHQPGAA LHIPHGLANA MLLPTVMEFN RMVCRERFSQ IGRALRTKKS DDRDAINAVS ELIAEVGIGK RLGDVGATSA HYGAWAQAAL EDICLRSNPR TASLEQIVGL YAAAQ
|
| |