Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00246 |
Symbol | yagR |
ID | 8114752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 268916 |
End bp | 271114 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644846536 |
Product | hypothetical protein |
Protein accession | YP_002998109 |
Protein GI | 251783805 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTG ATAAACCCGC AGGGGAAAAC CCGATCGATC AGCTGAAGGT TGTCGGTCGT CCCTATGACC GCATCGACGG ACCGCTGAAA ACTACCGGCA CGGCACGCTA CGCCTACGAA TGGCATGAAG AATCCCCCAA CGCCGCCTAT GGCTATATCG TCGGTTCCGC CATGGCCAAA GGACGCCTCA CCGCCCTTGA TACGGACGCC GCGCAAAAAG CGCCGGGCGT ACTGGCTGTC ATTACCGCCA GTAACGCCGG GGCACTCGGC AAAGGCGACA AAAACACCGC CAGGCTGTTA GGCGGTCCCA CTATTGAGCA CTATCATCAG GCCATTGCGC TGGTAGTGGC CGAGACCTTC GAACAGGCGC GAGCGGCGGC CTCGCTGGTG CAGGCGCACT ATCGCCGTAA TAAAGGAGCT TACTCCCTGG CGGACGAAAA ACAGGCCGTC AGTCAGCCGC CGGAAGACAC GCCCGACAAA AACGTCGGTG ACTTTGACGG AGCTTTCACC TCCGCTGCGG TGAAGATTGA TGCTACCTAC ACGACCCCGG ACCAGAGCCA TATGGCGATG GAGCCGCATG CCTCGATGGC CGTCTGGGAT GGAAATAAGC TTACTCTCTG GACCTCAAAT CAGATGATTG ACTGGTGCCG CACCGATCTG GCAAAAACGC TAAAAGTGCC CGTGGAGAAT GTGCGTATTA TCTCCCCGTA TATCGGCGGA GGGTTTGGCG GCAAGCTGTT CCTGAGAAGC GATGCGCTGC TGGCGGCCCT CGCCGCCCGA GCGGTGAAAC GTCCGGTTAA AGTGATGCTC CCCCGCCCCA CTATTCCCAA TAACACCACG CACCGCCCCG CCACCCTTCA GCACCTGCGT ATCGGTGCCG ACCAGAGCGG GAAAATCACA GCTATCTCAC ATGAAAGCTG GTCCGGAAAC CTGCCCGGCG GCACGCCGGA AACGGCGGTA CAGCAAAGCG AATTACTCTA CGCCGGGGCA AACCGTCATA CCGGCCTGCG GCTCGCCACG CTTGATTTGC CGGAAGGGAA CGCCATGCGT GCGCCCGGCG AAGCCCCCGG TCTGATGGCG CTCGAAATCG CGATCGACGA ACTGGCGGAA AAAGCGGGCA TCGATCCCGT CGAGTTTCGC ATCCTGAATG ACACTCAGGT TGACCCCGCC GACCCGACGC GCCGCTTCTC TCGCCGTCAG CTTATCGAGT GCTTGCGCAC CGGAGCGGAT AAATTTGGCT GGAAGCAGCG CAACGCCACA CCCGGACAGG TGCGCGACGG GGAGTGGCTA GTCGGCCACG GCGTCGCGGC GGGCTTTCGC AATAATCTGC TGGAAAAATC GGGGGCTCGG GTTCACCTCG AACCAAACGG CACCGTTACC GTGGAAACGG ACATGACCGA CATTGGCACC GGCAGCTACA CCATTCTGGC CCAGACGGCA GCGGAAATGC TTGGCGTACC GCTGGAGCAG GTTGCGGTTC ACCTCGGCGA TTCCAGTTTC CCGGTTTCTG CGGGTTCTGG TGGACAATGG GGCGCGAATA CCTCCACCTC CGGCGTTTAC GCCGCCTGTG TGAAGCTTCG CGAAATGATT GCCTCGGCAG TCGGGTTTGA TCCTGAGCAG TCGCAGTTTG CCGACGGCAA GATTACCAAC GGTACCCGAA GCGCCATGCT ACATGAGGCC ACCGCAGGCG GCAGACTGAT AGCGGAAGAG AGCATTGAAT TCGGAACACT GAGCAAGGAG TACCAGCAGT CGACCTTTGC CGGGCATTTT GTGGAGGTCG GCGTGCATAG CGCGACGGGA GAAGTTCGGG TCCGGCGTAT GCTCGCTGTG TGTGCTGCAG GACGCATCCT GAATCCGAAA ACTGCACGCA GCCAGGTCAT TGGCGCAATG ACTATGGGCA TGGGCGCGGC ACTGATGGAG GAGCTGGCGG TGGATGACCG TTTGGGCTAC TTCGTTAATC ACGATATGGC GGGGTATGAG GTGCCGGTTC ATGCGGATAT CCCAAAACAG GAGGTGATTT TCCTGGATGA TACCGACCCC ATCTCCTCCC CGATGAAGGC CAAAGGTGTC GGTGAGCTGG GCCTGTGCGG CGTGAGCGCG GCTATCGCCA ACGCGGTGTA TAACGCCACC GGTATTCGGG TACGGGATTA TCCCATCACT CTGGATAAGC TGCTCGATAA ACTGCCGGAT GTGGTTTAA
|
Protein sequence | MKFDKPAGEN PIDQLKVVGR PYDRIDGPLK TTGTARYAYE WHEESPNAAY GYIVGSAMAK GRLTALDTDA AQKAPGVLAV ITASNAGALG KGDKNTARLL GGPTIEHYHQ AIALVVAETF EQARAAASLV QAHYRRNKGA YSLADEKQAV SQPPEDTPDK NVGDFDGAFT SAAVKIDATY TTPDQSHMAM EPHASMAVWD GNKLTLWTSN QMIDWCRTDL AKTLKVPVEN VRIISPYIGG GFGGKLFLRS DALLAALAAR AVKRPVKVML PRPTIPNNTT HRPATLQHLR IGADQSGKIT AISHESWSGN LPGGTPETAV QQSELLYAGA NRHTGLRLAT LDLPEGNAMR APGEAPGLMA LEIAIDELAE KAGIDPVEFR ILNDTQVDPA DPTRRFSRRQ LIECLRTGAD KFGWKQRNAT PGQVRDGEWL VGHGVAAGFR NNLLEKSGAR VHLEPNGTVT VETDMTDIGT GSYTILAQTA AEMLGVPLEQ VAVHLGDSSF PVSAGSGGQW GANTSTSGVY AACVKLREMI ASAVGFDPEQ SQFADGKITN GTRSAMLHEA TAGGRLIAEE SIEFGTLSKE YQQSTFAGHF VEVGVHSATG EVRVRRMLAV CAAGRILNPK TARSQVIGAM TMGMGAALME ELAVDDRLGY FVNHDMAGYE VPVHADIPKQ EVIFLDDTDP ISSPMKAKGV GELGLCGVSA AIANAVYNAT GIRVRDYPIT LDKLLDKLPD VV
|
| |