Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01990 |
Symbol | yegT |
ID | 8115412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2075928 |
End bp | 2077205 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644848203 |
Product | hypothetical protein |
Protein accession | YP_002999776 |
Protein GI | 251785472 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00889] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAA CAGCAAAGCT GTCGTTCATG ATGTTTGTTG AATGGTTTAT CTGGGGCGCG TGGTTTGTGC CATTGTGGTT GTGGTTAAGT AAAAGCGGTT TTAGTGCCGG AGAAATTGGC TGGTCGTATG CCTGTACCGC CATTGCGGCG ATCCTGTCGC CAATTCTGGT TGGCTCCATC ACTGACCGCT TTTTCTCGGC GCAAAAAGTG CTGGCGGTAT TGATGTTCGC AGGCGCGCTA CTGATGTATT TCGCTGCGCA ACAGACCACT TTTGCCGGGT TCTTTCCGTT ACTGCTGGCC TACTCTCTAA CCTATATGCC GACCATTGCG CTGACTAACA GCATCGCTTT TGCCAACGTG CCGGATGTGG AGCGTGATTT CCCACGCATT CGTGTGATGG GCACTATCGG CTGGATTGCC TCTGGTCTGG CATGTGGTTT CTTGCCGCAA ATGCTGGGGT ATGCCGATAT CTCACCGACT AACATCCCGC TGCTGATTAC CGCCGGAAGT TCTGCTCTGC TCGGTGTGTT TGCGTTTTTC CTGCCCGACA CGCCGCCAAA AAGCACCGGC AAAATGGACA TTAAAGTCAT GCTCGGCCTG GATGCGCTGA TCCTGCTGCG CGATAAAAAC TTCCTCGTCT TTTTCTTCTG TTCATTCCTG TTTGCGATGC CACTGGCGTT CTATTACATC TTTGCCAACG GTTATCTGAC CGAAGTTGGC ATGAAAAACG CCACCGGCTG GATGACGCTC GGCCAGTTCT CTGAAATCTT CTTTATGCTG GCATTGCCGT TTTTCACTAA ACGCTTTGGT ATCAAAAAGG TATTATTGCT TGGTCTGGTC ACCGCTGCGA TCCGCTATGG CTTCTTTATT TACGGTAGTG CGGATGAATA TTTCACCTAC GCGTTACTGT TCCTCGGTAT TTTGCTTCAC GGCGTAAGTT ACGATTTTTA CTACGTTACC GCTTACATCT ATGTCGATAA AAAAGCCCCC GTGCATATGC GTACCGCTGC GCAGGGGCTG ATCACGCTCT GCTGCCAGGG CTTCGGCAGT TTGCTCGGCT ATCGTCTTGG CGGTGTGATG ATGGAAAGGA TGTTCGCTTA TCAGGAACCG GTAAACGGAC TGACTTTCAA CTGGTCCGGG ATGTGGACTT TCGGCGCGGT GATGATTGCC ATTATCGCCG TGCTGTTCAT GATTTTTTTC CGCGAATCCG ACAACGAAAT TACGGCTATC AAGGTCGATG ATCGCGATAT TGCGTTGACA CAAGGGGAAG TTAAATGA
|
Protein sequence | MKTTAKLSFM MFVEWFIWGA WFVPLWLWLS KSGFSAGEIG WSYACTAIAA ILSPILVGSI TDRFFSAQKV LAVLMFAGAL LMYFAAQQTT FAGFFPLLLA YSLTYMPTIA LTNSIAFANV PDVERDFPRI RVMGTIGWIA SGLACGFLPQ MLGYADISPT NIPLLITAGS SALLGVFAFF LPDTPPKSTG KMDIKVMLGL DALILLRDKN FLVFFFCSFL FAMPLAFYYI FANGYLTEVG MKNATGWMTL GQFSEIFFML ALPFFTKRFG IKKVLLLGLV TAAIRYGFFI YGSADEYFTY ALLFLGILLH GVSYDFYYVT AYIYVDKKAP VHMRTAAQGL ITLCCQGFGS LLGYRLGGVM MERMFAYQEP VNGLTFNWSG MWTFGAVMIA IIAVLFMIFF RESDNEITAI KVDDRDIALT QGEVK
|
| |