Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_10054 |
Symbol | J |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 783980 |
End bp | 787378 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | tail:host specificity protein |
Protein accession | ACT42636 |
Protein GI | 253976966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAAG GAAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC ACGCAGTTGC TGAGTGTGAT CGATGCCATC AGCGAAGGGC CGATTGAAGG TCCGGTGGAT GGCTTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACACTGAGGG GAATACCAAC ATATCCGGTG TCACGGTGGT GTTCCGGGCT GGTGAGCAGG AGCAGACTCC GCCGGAGGGA TTTGAATCCT CCGGCTCCGA GACGGTGCTG GGTACGGAAG TGAAATATGA CACGCCGATC ACCCGCACCA TTACGTCTGC AAACATCGAC CGTCTGCGCT TTACCTTCGG TGTACAGGCA CTGGTGGAAA CCACCTCAAA GGGTGACAGG AATCCGTCGG AAGTCCGCCT GCTGGTTCAG ATACAACGTA ACGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAGGG CAAAACCACC TCGCAGTATC TGGCCTCGGT GGTGATGGGT AACCTGCCGC CGCGCCCGTT TAATATCCGG ATGCGCAGGA TGACGCCGGA CAGCACCACA GACCAGCTGC AGAACAAAAC GCTCTGGTCG TCATACACTG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTCGGCGTG CAGGTGGACT CGGAGCAGTT CGGCAGCCAG CAGGTGAGCC GTAATTATCA TCTGCGCGGG CGTATTCTGC AGGTGCCGTC GAACTATAAC CCGCAGACGC GGCAATACAG CGGTATCTGG GACGGAACGT TTAAACCGGC ATACAGCAAC AACATGGCCT GGTGTCTGTG GGATATGCTG ACCCATCCGC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTGCCGG ACGGCTTTGG CGGCACGGAG CCGCGCATCA CCTGTAATGC GTACCTGACC ACACAGCGTA AGGCGTGGGA TGTGCTCAGC GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG CAGGACCGAC CGTCGGATAA GACGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT GATGGCGCGC CGTTCCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCATAA TGCCGTTGAG GTGAACTGGA TTGACCCGAA CAACGGCTGG GAGACGGCGA CAGAGCTTGT TGAAGATACG CAGGCCATTG CCCGTTACGG TCGTAATGTT ACGAAGATGG ATGCCTTTGG CTGTACCAGC CGGGGGCAGG CACACCGCGC CGGGCTGTGG CTGATTAAAA CAGAACTGCT GGAAACGCAG ACCGTGGATT TCAGCGTCGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTTATTGAA ATCTGCGATG ATGACTATGC CGGTATCAGC ACCGGTGGTC GTGTGCTGGC GGTGAACAGC CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG TACCGCGCTG ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTTCAGTC CGTCACCGAC GGCGTGAAGG TAAAAGTGAG CCGTGTTCCT GACGGTGTTG CTGAATACAG CGTATGGGAG CTGAAGCTGC CGACGCTGCG CCAGCGACTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC GACGGCACGT ATGCCATCAC CGCCGTGCAG CATGTGCCGG AAAAAGAGGC CATCGTGGAT AACGGGGCGC ACTTTGACGG CGAACAGAGT GGCACGGTGA ATGGTGTCAC GCCGCCAGCG GTGCAGCACC TGACCGCAGA AGTCACTGCA GACAGCGGGG AATATCAGGT GCTGGCGCGA TGGGACACAC CGAAGGTGGT GAAGGGCGTG AGTTTCCTGC TCCGTCTGAC CGTAACAGCG GACGACGGCA GTGAGCGGCT GGTCAGCACG GCCCGGACGA CGGAAACCAC ATACCGCTTC ACGCAACTGG CGCTGGGGAA CTACAGGCTG ACAGTCCGGG CGGTAAATGC GTGGGGGCAG CAGGGCGATC CGGCGTCGGT ATCGTTCCGG ATTGCCGCAC CGGCAGCACC GTCGAGGATT GAGCTGACGC CGGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCCGT TTATGACCCG ACGGTACAGT TTGAGTTCTG GTTCTCGGAA AAGCAGATTG CGGATATCAG ACAGGTTGAA ACCAGCACGC GTTATCTTGG TACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA CCGGGCCATG ATTATTACTT TTATATCCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC GTGGAGGCCG TCGGTCGGGC GAGCGATGAT GCGGAAGGTT ACCTGGATTT TTTCAAAGGC AAGATAACCG AATCCCATCT CGGCAAGGAG CTGCTGGAAA AAGTCGAGCT GACGGAGGAT AACGCCAGCA GACTGGAGGA GTTTTCGAAA GAGTGGAAGG ATGCCAGTGA TAAGTGGAAT GCCATGTGGG CTGTCAAAAT TGAGCAGACC AAAGACGGCA AACATTATGT CGCGGGTATT GGCCTCAGCA TGGAGGACAC GGAGGAAGGC AAACTGAGCC AGTTTCTGGT TGCCGCCAAT CGTATCGCAT TTATTGACCC GGCAAACGGG AATGAAACGC CGATGTTTGT GGCGCAGGGC AACCAGATAT TCATGAACGA CGTGTTCCTG AAGCGCCTGA CGGCCCCCAC CATTACCAGC GGCGGCAATC CTCCGGCCTT TTCCCTGACA CCGGACGGAA AGCTGACCGC TAAAAATGCG GATATCAGTG GCAGTGTGAA TGCGAACTCC GGGACGCTCA GTAATGTGAC GATAGCTGAA AACTGTACGA TAAACGGTAC GCTGAGGGCG GAAAAAATCG TCGGGGACAT TGTAAAGGCG GCGAGCGCGG CTTTTCCGCG CCAGCGTGAA AGCAGTGTGG ACTGGCCGTC AGGTACCCGT ACTGTCACCG TGACCGATGA CCATCCTTTT GATCGCCAGA TAGTGGTGCT TCCGCTGACG TTTCGCGGAA GTAAGCGTAC TGTCAGCGGC AGGACAACGT ATTCGATGTG TTATCTGAAA GTACTGATGA ACGGTGCGGT GATTTATGAT GGCGCGGCGA ACGAGGCGGT ACAGGTGTTC TCCCGTATTG TTGACATGCC AGCGGGTCGG GGAAACGTGA TCCTGACGTT CACGCTTACG TCCACACGGC ATTCGGCAGA TATTCCGCCG TATACGTTTG CCAGCGATGT GCAGGTTATG GTGATTAAGA AACAGGCGCT GGGCATCAGC GTGGTCTGA
|
Protein sequence | MGKGSSKGHT PREAKDNLKS TQLLSVIDAI SEGPIEGPVD GLKSVLLNST PVLDTEGNTN ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA LVETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVMG NLPPRPFNIR MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ QVSRNYHLRG RILQVPSNYN PQTRQYSGIW DGTFKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSAMRCMP VWNGQTLTFV QDRPSDKTWT YNRSNVVMPD DGAPFRYSFS ALKDRHNAVE VNWIDPNNGW ETATELVEDT QAIARYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE ICDDDYAGIS TGGRVLAVNS QTRTLTLDRE ITLPSSGTAL ISLVDGSGNP VSVEVQSVTD GVKVKVSRVP DGVAEYSVWE LKLPTLRQRL FRCVSIREND DGTYAITAVQ HVPEKEAIVD NGAHFDGEQS GTVNGVTPPA VQHLTAEVTA DSGEYQVLAR WDTPKVVKGV SFLLRLTVTA DDGSERLVST ARTTETTYRF TQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAAPSRI ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE KQIADIRQVE TSTRYLGTAL YWIAASINIK PGHDYYFYIR SVNTVGKSAF VEAVGRASDD AEGYLDFFKG KITESHLGKE LLEKVELTED NASRLEEFSK EWKDASDKWN AMWAVKIEQT KDGKHYVAGI GLSMEDTEEG KLSQFLVAAN RIAFIDPANG NETPMFVAQG NQIFMNDVFL KRLTAPTITS GGNPPAFSLT PDGKLTAKNA DISGSVNANS GTLSNVTIAE NCTINGTLRA EKIVGDIVKA ASAAFPRQRE SSVDWPSGTR TVTVTDDHPF DRQIVVLPLT FRGSKRTVSG RTTYSMCYLK VLMNGAVIYD GAANEAVQVF SRIVDMPAGR GNVILTFTLT STRHSADIPP YTFASDVQVM VIKKQALGIS VV
|
| |