Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04061 |
Symbol | ytfR |
ID | 8116683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4360764 |
End bp | 4362266 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644850209 |
Product | hypothetical protein |
Protein accession | YP_003001782 |
Protein GI | 251787478 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTA ACCAACACCA GGAGATCCTC CGCACCGAAG GATTAAGTAA ATTTTTCCCC GGCGTCAAAG CGTTAGACAA CGTTGATTTC AGCCTGCGCC GCGGCGAAAT CATGGCGCTG CTCGGTGAAA ACGGGGCAGG AAAATCAACG CTAATCAAAG CATTAACTGG TGTATACCAC GCCGATCGCG GCACCATCTG GCTGGAAGGC CAGGCTATCT CACCGAAAAA TACCGCCCAT GCACAACAAC TCGGTATCGG CACCGTCTAT CAGGAAGTAA ACCTGCTACC CAATATGTCG GTCGCTGATA ATCTATTTAT AGGCCGCGAA CCCAAACGCT TCGGCCTTCT ACGCCGCAAA GAGATGGAAA AGCGCGCCAC CGAACTGATG GCATCTTACG GTTTCTCCCT CGACGTGCGC GAACCGCTCA ACCGCTTTTC AGTCGCGATG CAGCAAATCG TCGCTATTTG CCGGGCTATC GATCTCTCTG CCAAAGTGCT GATCCTCGAT GAACCCACCG CCAGTCTCGA CACCCAGGAA GTGGAGTTAC TGTTTGACCT GATGCGTCAG TTGCGCGATC GCGGCGTCAG CCTGATTTTT GTCACTCACT TTCTCGATCA GGTCTATCAG GTCAGCGATC GGATCACCGT CTTACGCAAC GGCAGTTTCG TAGGCTGTCG GGAAACGTGC GAGCTACCGC AGATCGAACT GGTAAAAATG ATGCTGGGGC GCGAGCTGGA TACCCACGCG CTACAGCGTG CCGGGCGAAC ATTGTTGAGC GACAAACCCG TTGCCGCGTT CAAAAATTAC GGCAAAAAAG GAACGATCGC ACCGTTTGAT CTCGAAGTAC GCCCCGGCGA GATCGTCGGT CTGGCTGGAT TGCTGGGATC AGGACGTACC GAAACCGCCG AAGTGATCTT CGGTATCAAA CCTGCTGACA GCGGCACGGC GTTGATCAAA GGCAAACCGC AAAACCTGCG ATCGCCACAT CAGGCTTCGG TACAGGGCAT TGGCTTCTGC CCGGAAGACA GGAAAACCGA TGGCATCATC GCTGCCGCCT CGGTGCGGGA AAATATCATC CTCGCTCTCC AGGCCCAGCG CGGCTGGCTA CGTCCCATTT CCCGCAAAGA ACAGCAAGAG ATTGCCGAAC GCTTTATCCG CCAGCTTGGC ATTCGCACAC CTTCAACTGA ACAACCGATT GAATTTCTCT CTGGCGGCAA TCAGCAAAAA GTGTTGCTTT CACGTTGGCT ACTGACCCGA CCGCAATTTC TGATCCTTGA TGAGCCAACT CGCGGCATTG ATGTTGGTGC CCACGCCGAG ATCATCCGCC TGATTGAAAC GCTATGCGCC GATGGTCTGG CGCTGCTGGT GATCTCCTCC GAACTGGAAG AACTGGTGGG CTATGCCGAC CGGGTGATCA TCATGCGCGA TCGCAAACAG GTGGCGGAGA TCCCGCTGGC AGAGCTTTCC GTTCCGGCGA TCATGAACGC CATTGCGGCG TAA
|
Protein sequence | MTTNQHQEIL RTEGLSKFFP GVKALDNVDF SLRRGEIMAL LGENGAGKST LIKALTGVYH ADRGTIWLEG QAISPKNTAH AQQLGIGTVY QEVNLLPNMS VADNLFIGRE PKRFGLLRRK EMEKRATELM ASYGFSLDVR EPLNRFSVAM QQIVAICRAI DLSAKVLILD EPTASLDTQE VELLFDLMRQ LRDRGVSLIF VTHFLDQVYQ VSDRITVLRN GSFVGCRETC ELPQIELVKM MLGRELDTHA LQRAGRTLLS DKPVAAFKNY GKKGTIAPFD LEVRPGEIVG LAGLLGSGRT ETAEVIFGIK PADSGTALIK GKPQNLRSPH QASVQGIGFC PEDRKTDGII AAASVRENII LALQAQRGWL RPISRKEQQE IAERFIRQLG IRTPSTEQPI EFLSGGNQQK VLLSRWLLTR PQFLILDEPT RGIDVGAHAE IIRLIETLCA DGLALLVISS ELEELVGYAD RVIIMRDRKQ VAEIPLAELS VPAIMNAIAA
|
| |