Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00813 |
Symbol | yliA |
ID | 8113244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 850884 |
End bp | 852722 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644847079 |
Product | hypothetical protein |
Protein accession | YP_002998652 |
Protein GI | 251784348 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.991571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGGCGG TTGAAAATCT GAATATTGCC TTTATGCAGG ACCAGCAGAA AATAGCTGCG GTCCGCAATC TCTCTTTTAG TCTGCAACGC GGTGAGACGC TGGCAATTGT TGGCGAATCC GGCTCCGGTA AGTCAGTGAC TGCGTTGGCA TTGATGCGCC TGTTGGAACA GGCGGGCGGT TTAGTACAGT GCGATAAAAT GCTGTTGCAG CGGCGCAGTC GCGAAGTGAT TGAACTTAGC GAGCAGAACG CTGCACAAAT GCGCCATGTT CGCGGTGCGG ATATGGCGAT GATATTTCAG GAGCCGATGA CATCGCTGAA CCCGGTATTT ACTGTGGGTG AACAGATTGC CGAATCAATT CGTCTGCATC AGAACGCCAG TCGTGAAGAA GCGATGGTCG AGGCGAAGCG GATGCTGGAT CAGGTACGCA TTCCTGAGGC ACAAACCATT CTTTCACGTT ATCCGCATCA ACTCTCTGGC GGGATGCGCC AGCGAGTGAT GATTGCGATG GCGCTGTCAT GCCGCCCGGC GGTGCTGATT GCCGATGAGC CAACCACCGC GCTGGATGTC ACTATTCAGG CGCAGATCCT GCAATTAATC AAAGTATTGC AAAAAGAGAT GTCGATGGGC GTTATCTTTA TCACTCACGA TATGGGCGTG GTGGCAGAGA TTGCCGATCG GGTACTGGTG ATGTATCAGG GCGAGGCGGT GGAAACGGGT ACCGTCGAAC AGATTTTTCA TGCACCGCAA CATCCTTACA CCCGTGCGCT GTTAGCTGCT GTTCCGCAAC TTGGTGCGAT GAAAGGGTTA GATTATCCCC GACGTTTCCC GTTGATATCG CTTGAACATC CAGCGAAACA GGCCCCCCCC ATCGAGCAGA AAACGGTGGT GGATGGCGAA CCTGTTTTAC GAGTGCGTAA TCTTGTCACC CGTTTCCCTT TGCGCAGCGG TTTGTTGAAT CGCGTAACGC GGGAAGTGCA TGCCGTTGAG AAAGTCAGTT TTGATCTCTG GCCTGGCGAA ACGCTATCGC TGGTGGGCGA GTCTGGCAGC GGTAAATCCA CTACCGGGCG GGCGTTGCTG CGCCTGGTCG AATCGCAGGG CGGCGAAATT ATCTTTAACG GTCAGCGAAT CGATACCTTG TCACCCGGCA AACTTCAGGC ATTACGCCGG GATATTCAGT TTATTTTTCA GGACCCTTAC GCTTCGCTGG ACCCACGTCA GACCATCGGT GATTCGATTA TCGAACCGCT GCGTGTACAC GGTTTATTGC CAGGTAAAGA CGCGGCTGCA CGCGTTGCGT GGTTGCTGGA GCGCGTGGGC CTGTTACCTG AACATGCCTG GCGTTACCCG CATGAGTTTT CCGGCGGTCA GCGCCAGCGC ATCTGCATTG CTCGCGCGTT GGCATTGAAT CCAAAAGTGA TCATTGCCGA CGAAGCCGTT TCGGCGCTGG ATGTTTCTAT TCGCGGGCAG ATTATCAACT TGTTGCTCGA TCTCCAGCGT GATTTCGGCA TTGCGTATCT GTTTATCTCC CACGATATGG CGGTGGTAGA GCGGATTAGT CATCGTGTGG CGGTGATGTA TCTCGGGCAA ATTGTTGAAA TTGGTCCACG GCGCGCGGTC TTCGAAAACC CGCAGCATCC TTATACGCGT AAATTACTGG CGGCAGTTCC GGTCGCTGAA CCGTCCCGAC AACGACCGCA GCGTGTACTG CTGTCGGACG ATCTTCCCAG CAATATTCAT CTGCGTGGCG AAGAGGTGGC AGCCGTCTCG TTGCAATGCG TCGGGCCGGG GCATTACGTC GCACAACCAC AATCAGAATA CGCATTCATG CGTAGATAA
|
Protein sequence | MLAVENLNIA FMQDQQKIAA VRNLSFSLQR GETLAIVGES GSGKSVTALA LMRLLEQAGG LVQCDKMLLQ RRSREVIELS EQNAAQMRHV RGADMAMIFQ EPMTSLNPVF TVGEQIAESI RLHQNASREE AMVEAKRMLD QVRIPEAQTI LSRYPHQLSG GMRQRVMIAM ALSCRPAVLI ADEPTTALDV TIQAQILQLI KVLQKEMSMG VIFITHDMGV VAEIADRVLV MYQGEAVETG TVEQIFHAPQ HPYTRALLAA VPQLGAMKGL DYPRRFPLIS LEHPAKQAPP IEQKTVVDGE PVLRVRNLVT RFPLRSGLLN RVTREVHAVE KVSFDLWPGE TLSLVGESGS GKSTTGRALL RLVESQGGEI IFNGQRIDTL SPGKLQALRR DIQFIFQDPY ASLDPRQTIG DSIIEPLRVH GLLPGKDAAA RVAWLLERVG LLPEHAWRYP HEFSGGQRQR ICIARALALN PKVIIADEAV SALDVSIRGQ IINLLLDLQR DFGIAYLFIS HDMAVVERIS HRVAVMYLGQ IVEIGPRRAV FENPQHPYTR KLLAAVPVAE PSRQRPQRVL LSDDLPSNIH LRGEEVAAVS LQCVGPGHYV AQPQSEYAFM RR
|
| |