Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01831 |
Symbol | unknown |
ID | 8113743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1899811 |
End bp | 1900839 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644848051 |
Product | hypothetical protein |
Protein accession | YP_002999624 |
Protein GI | 251785320 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCACCAGG ATACTAATCG TCTGATTAAA GCATGGCAGA AACCGGAGAT GATCGTCGTT TCTGAATGCT ACTGGACCGC AGCTGCTAAA CATGCAGATA TCGTATTACC GATCACCACA TCGTTTGAGC GCAATGACTT GACGATGACC GGTGATTACA GTAACCAGCA TATTGTGCCG ATGAAGCAGG CTGTCGCTCC GCAATTTGAA GCGCGAAACG ATTTTGATGT GTTTGCCGAT CTTGCGGAAT TACTGAAACC TGGCGGAAAA GAGATCTATA CCGAAGGTAA AGATGAAATG GCGTGGCTGA AATTTTTCTA TGATGCCGCT CAGAAAGGTG CCCGTGCGCA ACGCGTGACT ATGCCGATGT TTAATGCCTT CTGGCAGCAA AATAAACTGA TCGAAATGCG CCGCAGCGAG AAGAACGAAC AGTACGTTCG TTATGGTGAT TTCCGCGCCG ATCCGGTGAA AAATGCGCTG GGTACGCCAA GCGGCAAAAT TGAGATTTAC TCCAAAACGC TGGAAAAATT TGGCTATAAG GATTGCCCGG CACACCCAAC CTGGCTTGCG CCTGATGAGT GGAAGGGTAC CGCCGACGAG AAGCAGTTGC AGCTTCTGAC CGCACATCCG GCACACCGTT TACATAGTCA GCTTAACTAT GCGGAACTGC GTAAAAAATA TGCGGTTGCA GATCGTGAAC CAATCACTAT TCACACCGAA GATGCTGCTC GCTTTGGTAT TGCGAATGGT GATCTGGTGC GTGTGTGGAA CAAACGTGGT CAGATTCTGA CAGGCGCGGT GGTGACTGAC GGGATCAAAA AAGGCGTGGT ATGCGTGCAT GAAGGTGCAT GGCCAGATCT GGAAAATGGC TTGTGTAAAA ACGGCAGTGC GAACGTGTTA ACGGCGGATA TCCCCAGCTC GCAGCTGGCA AATGCCTGTG CCGGTAACTC TGCGCTGGTG TATATCGAAA AATATACGGG CAATGCGCCG AAGTTAACGG CGTTTGATCA GCCAGCTATT CAGGCATAA
|
Protein sequence | HHQDTNRLIK AWQKPEMIVV SECYWTAAAK HADIVLPITT SFERNDLTMT GDYSNQHIVP MKQAVAPQFE ARNDFDVFAD LAELLKPGGK EIYTEGKDEM AWLKFFYDAA QKGARAQRVT MPMFNAFWQQ NKLIEMRRSE KNEQYVRYGD FRADPVKNAL GTPSGKIEIY SKTLEKFGYK DCPAHPTWLA PDEWKGTADE KQLQLLTAHP AHRLHSQLNY AELRKKYAVA DREPITIHTE DAARFGIANG DLVRVWNKRG QILTGAVVTD GIKKGVVCVH EGAWPDLENG LCKNGSANVL TADIPSSQLA NACAGNSALV YIEKYTGNAP KLTAFDQPAI QA
|
| |