Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02842 |
Symbol | ygiS |
ID | 8114299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3032752 |
End bp | 3034359 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644849030 |
Product | hypothetical protein |
Protein accession | YP_003000603 |
Protein GI | 251786299 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.81173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATACGC GAAATTTATT ATGGCTGGTC AGCCTGGTAA GTGCGGCTCC TCTCTACGCT GCTGACGTTC CCGCCAACAC ACCGCTCGCC CCGCAACAAG TCTTTCGTTA CAACAATCAT AGCGACCCAG GTACGCTCGA CCCGCAAAAG GTGGAGGAGA ATACTGCCGC GCAGATTGTG CTGGATCTGT TTGAAGGTCT GGTATGGATG GACGGTGAAG GCCAGGTGCA GCCCGCTCAG GCTGAACGCT GGGAGATACT GGACGGCGGC AAGCGCTATA TTTTCCATCT GCGTAGCGGT TTGCAGTGGT CCGACGGTCA GCCTCTGACG GCAGAGGATT TTGTCCTCGG CTGGCAGCGC GCGGTTGACC CGAAAACGGC AAGCCCTTTT GCTGGCTATC TGGCACAGGC GCACATTAAC AATGCCGCAG CTATTGTTGC GGGTAAAGCA GATGTTACAT CGCTGGGTGT CAAAGCGACG GATGATCGTA CTCTTGAAGT TACGCTTGAG CAGCCGGTTC CTTGGTTCAC GACGATGCTC GCCTGGCCGA CGCTGTTCCC GGTTCCTCAT CATGTCATCG CTAAACATGG CGATAGCTGG AGTAAGCCAG AGAACATGGT TTACAACGGT GCCTTTGTGC TTGATCAGTG GGTAGTTAAC GAAAAGATTA CTGCACGCAA AAATCCAAAG TACCGCGATG CGCAACATAC AGTATTGCAA CAGGTTGAGT ATCTGGCGCT AGATAATTCG GTCACCGGCT ATAACCGCTA TCGCGCGGGA GAGGTCGATC TCACCTGGGT TCCGGCGCAG CAAATTCCCG CCATTGAAAA ATCACTGCCT GGCGAGCTAC GAATTATTCC GCGTCTGAAC AGCGAATATT ACAACTTCAA CCTTGAGAAA CCGCCATTTA ACGATGTGCG AGTGCGTCGG GCGCTATATC TTACGGTTGA TCGACAGCTT ATTGCGCAAA AGGTACTGGG GTTGAGAACG CCCGCAACCA CGCTGACGCC GCCAGAGGTA AAAGGCTTTA GCGCGACGAC GTTCGATGAA CTGCAAAAGC CAATGAGTGA GCGCGTCGCG ATGGCAAAAG CCTTGCTGAA ACAGGCGGGA TACGACGCCT CTCATCCGCT ACGCTTTGAG CTGTTCTACA ACAAGTACGA TCTGCATGAA AAGACCGCGA TAGCGTTGTC TTCCGAATGG AAAAAATGGC TGGGTGCACA GGTGACGCTG CGCACAATGG AGTGGAAAAC CTATCTTGAT GCCCGACGAG CCGGTGATTT CATGCTGTCT CGGCAGTCGT GGGATGCGAC GTACAATGAT GCTTCCAGCT TCCTGAACAC GCTCAAAAGC GATAGTGAAG AAAACGTCGG TCACTGGAAA AATGCGCAGT ATGACGCCTT ACTAAACCAG GCCACGCAGA TCACTGATGC GACAAAGCGT AATGCGTTGT ATCAGCAGGC AGAAGTGATC ATCAACCAAC AGGCACCGCT GATTCCTATC TACTATCAGC CGTTAATCAA ACTGCTTAAA CCCTACGTTG GCGGTTTTCC GCTGCATAAT CCCCAGGATT ATGTCTACAG CAAAGAGTTG TATATCAAGG CACATTGA
|
Protein sequence | MYTRNLLWLV SLVSAAPLYA ADVPANTPLA PQQVFRYNNH SDPGTLDPQK VEENTAAQIV LDLFEGLVWM DGEGQVQPAQ AERWEILDGG KRYIFHLRSG LQWSDGQPLT AEDFVLGWQR AVDPKTASPF AGYLAQAHIN NAAAIVAGKA DVTSLGVKAT DDRTLEVTLE QPVPWFTTML AWPTLFPVPH HVIAKHGDSW SKPENMVYNG AFVLDQWVVN EKITARKNPK YRDAQHTVLQ QVEYLALDNS VTGYNRYRAG EVDLTWVPAQ QIPAIEKSLP GELRIIPRLN SEYYNFNLEK PPFNDVRVRR ALYLTVDRQL IAQKVLGLRT PATTLTPPEV KGFSATTFDE LQKPMSERVA MAKALLKQAG YDASHPLRFE LFYNKYDLHE KTAIALSSEW KKWLGAQVTL RTMEWKTYLD ARRAGDFMLS RQSWDATYND ASSFLNTLKS DSEENVGHWK NAQYDALLNQ ATQITDATKR NALYQQAEVI INQQAPLIPI YYQPLIKLLK PYVGGFPLHN PQDYVYSKEL YIKAH
|
| |