Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01755 |
Symbol | insL-4 |
ID | 8113706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1823579 |
End bp | 1824697 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644847976 |
Product | hypothetical protein |
Protein accession | YP_002999549 |
Protein GI | 251785245 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.495752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGATGA ATTACTCTCA CGATAACTGG TCAGCAATTC TGGCCCATAT TGGTAAGCCC GAAGAACTGG ATACTTCGGC ACGTAATGCC GGGGCTCTAA CCCGCCGCCG CGAAATTCGT GATGCTGCAA CTCTGCTACG TCTGGGGCTG GCTTACGGCC CCGGGGGGAT GTCATTACGT GAAGTCACTG CATGGGCTCA GCTCCATGAC GTTGCAACAT TATCTGACGT GGCTCTCCTG AAGCGGCTGC GGAATGCCGC CGACTGGTTT GGCATACTTG CCGCACAAAC ACTTGCTGTA CGCGCCGCAG TTACGGGTTG TACAAGCGGA AAGAGATTGC GTCTTGTCGA TGGAACAGCA ATCAGTGCGC CCGGGGGCGG CAGCGCTGAA TGGCGACTAC ATATGGGATA TGATCCTCAT ACCTGTCAGT TCACTGATTT TGAGCTAACC GACAGCAGAG ACGCTGAACG GCTGGACCGA TTTGCGCAAA CGGCAGACGA GATACGCATT GCTGACCGGG GATTCGGTTC GCGTCCCGAA TGTATCCGCT CACTTGCTTT TGGAGAAGCT GATTATATCG TCCGGGTTCA CTGGCGAGGA TTGCGCTGGT TAACTGCAGA AGGAATGCGC TTTGACATGA TGGGTTTTCT GCGCGGGCTG GATTGCGGTA AGAACGGTGA AACCACTGTA ATGATAGGCA ATTCAGGTAA TAAAAAAGCC GGAGCTCCCT TTCCGGCACG TCTCATTGCC GTATCACTTC CTCCCGAAAA AGCATTAATC AGTAAAACCC GACTGCTCAG CGAGAATCGT CGAAAAGGAC GAGTAGTTCA GGCGGAAACG CTGGAAGCAG CGGGCCATGT GCTATTGCTA ACATCATTAC CGGAAGATGA ATATTCAGCA GAGCAAGTGG CTGATTGTTA CCGTCTGCGA TGGCAAATTG AACTGGCTTT TAAGCGGCTC AAAAGTTTGC TGCACCTGGA TGCTTTGCGT GCAAAGGAAC CTGAACTCGC GAAAGCGTGG ATATTTGCTA ATCTACTCGC CGCATTTTTA ATTGACGACA TAATCCAGCC ATCGCTGGAT TTCCCCCCCA GAAGTGCCGG ATCCGAAAAG AAGAACTAA
|
Protein sequence | MPMNYSHDNW SAILAHIGKP EELDTSARNA GALTRRREIR DAATLLRLGL AYGPGGMSLR EVTAWAQLHD VATLSDVALL KRLRNAADWF GILAAQTLAV RAAVTGCTSG KRLRLVDGTA ISAPGGGSAE WRLHMGYDPH TCQFTDFELT DSRDAERLDR FAQTADEIRI ADRGFGSRPE CIRSLAFGEA DYIVRVHWRG LRWLTAEGMR FDMMGFLRGL DCGKNGETTV MIGNSGNKKA GAPFPARLIA VSLPPEKALI SKTRLLSENR RKGRVVQAET LEAAGHVLLL TSLPEDEYSA EQVADCYRLR WQIELAFKRL KSLLHLDALR AKEPELAKAW IFANLLAAFL IDDIIQPSLD FPPRSAGSEK KN
|
| |