Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03722 |
Symbol | yiiD |
ID | 8114563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3982115 |
End bp | 3983104 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644849883 |
Product | hypothetical protein |
Protein accession | YP_003001456 |
Protein GI | 251787152 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR02447] thioesterase domain, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.645417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC TTCCAGGGTT GTCACGGGAA ACAAGAGAGA GTATCGCTAT GTATCACCTT CGGGTTCCAC AAACAGAAGA AGAATTAGAG CGTTACTATC AGTTTCGCTG GGAAATGTTG CGTAAGCCCC TGCATCAACC AAAAGGTTCG GAACGCGACG CGTGGGATGC GATGGCGCAT CACCAGATGG TCGTCGACGA GCAGGGTAAT CTGGTGGCGG TAGGCCGACT GTATATTAAT GCCGACAATG AAGCGTCCAT TCGCTTTATG GCCGTTCATC CCGACGTGCA GGACAAAGGG TTAGGCACGC TGATGGCGAT GACCCTGGAG TCGGTGGCGC GTCAGGAAGG CGTTAAGCGC GTGACCTGTA GCGCCCGTGA AGACGCGGTG GAGTTTTTCG CCAAGCTGGG GTTTGTTAAT CAGGGAGAAA TCACCACGCC AACCACCACG CCGATTCGCC ATTTTTTGAT GATTAAGCCC GTCGCCACTC TGGATGACAT TCTGCATCGC GGCGACTGGT GCGCGCAGCT GCAACAGGCG TGGTACGAAC ATATCCCGCT TAGTGAAAAA ATGGGCGTGC GCATTCAGCA ATATACCGGG CAAAAATTTA TCACTACCAT GCCAGAAACC GGCAATCAGA ATCCGCACCA TACGCTGTTT GCCGGGAGTT TATTCTCACT GGCGACGCTC ACCGGTTGGG GACTTATCTG GCTGATGCTG CGTGAACGCC ACCTCGGCGG AACGATTATT CTTGCGGATG CGCATATCCG CTACAGCAAG CCGATTAGCG GTAAACCTCA TGCGGTAGCC GACCTCGGTG CCTTAAGCGG CGATCTCGAC CGTCTGGCGC GCGGACGAAA AGCACGGGTG CAGATGCAGG TCGAAATCTT TGGCGACGAG ACGCCGGGTG CAGTGTTTGA AGGCACGTAT ATCGTTCTGC CCGCGAAGCC ATTTGGCCCG TATGAAGAGG GCGGGAACGA AGAAGAGTAG
|
Protein sequence | MSQLPGLSRE TRESIAMYHL RVPQTEEELE RYYQFRWEML RKPLHQPKGS ERDAWDAMAH HQMVVDEQGN LVAVGRLYIN ADNEASIRFM AVHPDVQDKG LGTLMAMTLE SVARQEGVKR VTCSAREDAV EFFAKLGFVN QGEITTPTTT PIRHFLMIKP VATLDDILHR GDWCAQLQQA WYEHIPLSEK MGVRIQQYTG QKFITTMPET GNQNPHHTLF AGSLFSLATL TGWGLIWLML RERHLGGTII LADAHIRYSK PISGKPHAVA DLGALSGDLD RLARGRKARV QMQVEIFGDE TPGAVFEGTY IVLPAKPFGP YEEGGNEEE
|
| |