Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01469 |
Symbol | ydeN |
ID | 8113759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1535733 |
End bp | 1537415 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644847707 |
Product | hypothetical protein |
Protein accession | YP_002999280 |
Protein GI | 251784976 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCTG CATTAAAGAA AAGTGTCGTA AGTACCTCGA TATCTTTGAT ACTGGCATCT GGTATGGCTG CATTTGCTGC TCATGCGGCA GATGATGTAA AGCTGAAAGC AACCAAAACA AACGTTGCTT TCTCAGACTT TACGCCGACT GAATACAGTA CCAAAGGAAA GCCAAATATT ATCGTACTAA CCATGGATGA TCTTGGTTAT GGACAACTTC CTTTTGATAA GGGATCTTTT GACCCAAAAA CAATGAAAAA TCGTGAAGTT GTCGATACCT ACAAAATAGG GATAGATAAA GCCATTGAAG CTGCACAAAA ATCAACGCCG ACGCTCCTTT CATTAATGGA TGAAGGCGTA CGGTTTACTA ACGGCTATGT GGCACACGGT GTTTCCGGCC CCTCCCGTGC CGCAATAATG ACCGGTCGAG CTCCCGCCCG CTTTGGTGTC TATTCCAATA CCGATGCTCA GGATGGTATT CCGCTAACAG AAACTTTCTT GCCTGAATTA TTCCAGAATC ATGGTTATTA CACTGCAGCA GTAGGTAAAT GGCACTTGTC AAAAATCAGT AATGTGCCGG TACCGGAAGA TAAACAAACA CGTGACTATC ATGACAACTT CACCACATTT TCTGCGGAAG AATGGCAACC TCAAAACCGT GGCTTTGATT ACTTTATGGG ATTCCACGCT GCAGGAACGG CATATTACAA CTCCCCTTCA CTGTTCCAAA ATCGTGAACG TGTCCCCGCA AAAGGTTATA TCAGCGATCA GTTAACCGAT GAGGCAATTG GCGTTGTTGA TCGTGCCAAA ACACTTGACC AGCCTTTTAT GCTTTACCTG GCTTATAATG CTCCGCACCT GCCAAATGAT AATCCTGCAC CGGAGCAATA TCAGAAGCAA TTTAATACCG GTAGTCAAAC GGCAGATAAC TACTACGCTT CCGTTTATTC TGTTGATCAG GGTGTAAAAC GCATTCTCGA ACAACTGAAG AAAAACGGAC AGTATGACAA TACAATTATT CTCTTTACCT CCGATAATGG TGCGGTTATC GATGGTCCTC TGCCGCTGAA CGGGGCGCAA AAAGGCTATA AGAGTCAGAC CTATCCTGGC GGTACTCACA CCCCAATGTT TATGTGGTGG AAAGGAAAAC TTCAACCCGG TAATTATGAC AAGCTGATTT CCGCAATGGA TTTCTACCCG ACAGCTCTTG ATGCAGCCGA TATCAGCATT CCAAAAGACC TTAAGCTGGA TGGCGTTTCC TTGCTGCCCT GGTTGCAAGA TAAGAAACAA GGCGAGCCAC ATAAAAATCT GACCTGGATA ACCTCTTATT CTCACTGGTT TGATGAGGAA AATATTCCAT TCTGGGATAA TTACCACAAA TTTGTCCGTC ATCAGTCAGA CGATTACCCG CATAACCCCA ACACTGAGGA CTTAAGCCAA TTCTCTTATA CGGTGAGAAA TAACGATTAT TCGCTTGTCT ATACAGTAGA AAACAATCAG TTAGGTCTAT ACAAACTGAC GGATCTACAG CAAAAAGATA ACCTTGCCGC CGCCAATCCG CAGGTCGTTA AAGAGATGCA AGGCGTGGTA AGAGAGTTTA TCGACAGCAG CCAGCCACCG CTTAGCGAGG TAAATCAGGA GAAGTTTAAC AATATCAAGA AAGCACTAAG CGAAGCGAAA TAA
|
Protein sequence | MKSALKKSVV STSISLILAS GMAAFAAHAA DDVKLKATKT NVAFSDFTPT EYSTKGKPNI IVLTMDDLGY GQLPFDKGSF DPKTMKNREV VDTYKIGIDK AIEAAQKSTP TLLSLMDEGV RFTNGYVAHG VSGPSRAAIM TGRAPARFGV YSNTDAQDGI PLTETFLPEL FQNHGYYTAA VGKWHLSKIS NVPVPEDKQT RDYHDNFTTF SAEEWQPQNR GFDYFMGFHA AGTAYYNSPS LFQNRERVPA KGYISDQLTD EAIGVVDRAK TLDQPFMLYL AYNAPHLPND NPAPEQYQKQ FNTGSQTADN YYASVYSVDQ GVKRILEQLK KNGQYDNTII LFTSDNGAVI DGPLPLNGAQ KGYKSQTYPG GTHTPMFMWW KGKLQPGNYD KLISAMDFYP TALDAADISI PKDLKLDGVS LLPWLQDKKQ GEPHKNLTWI TSYSHWFDEE NIPFWDNYHK FVRHQSDDYP HNPNTEDLSQ FSYTVRNNDY SLVYTVENNQ LGLYKLTDLQ QKDNLAAANP QVVKEMQGVV REFIDSSQPP LSEVNQEKFN NIKKALSEAK
|
| |