Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00731 |
Symbol | A |
ID | 8113016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 768595 |
End bp | 770520 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644846998 |
Product | hypothetical protein |
Protein accession | YP_002998571 |
Protein GI | 251784267 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.634997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATATAT CGAACAGTCA GGTTAACAGG CTGCGGCATT TTGTCCGCGC CGGGCTTCGC TCACTGTTCA GGCCGGAGCC ACAGACCGCC GTTGAATGGG CGGATGCTAA TTACTATCTC CCGAAAGAAT CCGCATACCA GGAAGGGCGC TGGGAAACAC TGCCCTTTCA GCGGGCCATC ATGAATGCGA TGGGCAGCGA CTACATCCGT GAGGTGAATG TGGTGAAGTC TGCCCGTGTC GGTTATTCCA AAATGCTGCT GGGTGTTTAT GCCTACTTTA TAGAGCATAA GCAGCGCAAC ACCCTTATCT GGTTGCCGAC GGATGGTGAT GCCGAGAACT TTATGAAAAC CCACGTTGAG CCGACTATTC GTGATATTCC GTCGCTGCTG GCGCTGGCCC CGTGGTATGG CAAAAAGCAC CGGGATAACA CGCTCACCAT GAAGCGTTTC ACTAATGGGC GTGGCTTCTG GTGCCTGGGC GGTAAAGCGG CAAAAAACTA CCGTGAAAAG TCGGTGGATG TGGCGGGTTA TGATGAACTT GCTGCTTTTG ATGATGATAT TGAACAGGAA GGCTCTCCGA CGTTCCTGGG TGACAAGCGT ATTGAAGGCT CGGTCTGGCC AAAGTCCATC CGTGGCTCCA CGCCAAAAGT GAGAGGCACC TGTCAGATTG AGCGTGCAGC CAGTGAATCC CCGCATTTTA TGCGTTTTCA TGTTGCCTGC CCGCATTGCG GGGAGGAGCA GTATCTTAAA TTTGGCGACA AAGAGACGCC GTTTGGCCTC AAATGGACGC CGGATGACCC CTCCAGCGTG TTTTATCTCT GCGAGCATAA TGCCTGCGTC ATCCGCCAGC AGGAGCTGGA CTTTACTGAT GCCCGTTATA TCTGCGAAAA GACCGGGATC TGGACCCGTG ATGGCATTCT CTGGTTTTCG TCATCCGGTG AAGAGATTGA GCCACCTGAC AGTGTGACCT TTCACATCTG GACAGCGTAC AGCCCGTTCA CCACCTGGGT GCAGATTGTC AAAGACTGGA TGAAAACGAA AGGGGATACG GGAAAACGTA AAACCTTCGT AAACACCACG CTCGGTGAGA CGTGGGAGGC GAAAATTGGC GAACGTCCGG ATGCTGAAGT GATGGCAGAG CGGAAAGAGC ATTATTCAGC GCCCGTTCCT GACCGTGTGG CTTACCTGAC CGCCGGTATC GACTCCCAGC TGGACCGCTA CGAAATGCGC GTATGGGGAT GGGGGCCGGG TGAGGAAAGC TGGCTGATTG ACCGGCAGAT TATTATGGGC CGCCACGACG ATGAACAGAC GCTGCTGCGT GTGGATGAGG CCATCAATAA AACCTATACC CGCCGGAATG GTGCAGAAAT GTCGATATCC CGTATCTGCT GGGATACTGG CGGGATTGAC CCGACCATTG TGTATGAACG CTCGAAAAAA CATGGGCTGT TCCGGGTGAT CCCCATTAAA GGGGCATCCG TCTACGGAAA GCCGGTGGCC AGCATGCCAC GTAAGCGAAA CAAAAACGGG GTTTACCTTA CCGAAATCGG TACGGATACC GCGAAAGAGC AGATTTATAA CCGCTTCACA CTGACGCCGG AAGGGGATGA ACCGCTTCCC GGTGCCGTTC ACTTCCCGAA TAACCCGGAT ATTTTTGATC TGACCGAAGC GCAGCAGCTG ACTGCTGAAG AGCAGGTCGA AAAATGGGTG GATGGCAGGA AAAAAATACT GTGGGACAGC AAAAAGCGAC GCAATGAGGC ACTCGACTGC TTCGTTTATG CGCTGGCGGC GCTGCGCATC AGTATTTCCC GCTGGCAGCT GGATCTCAGT GCGCTGCTGG CGAGCCTGCA GGAAGAGGAT GGTGCAGCAA CCAACAAGAA AACACTGGCA GATTACGCCC GTGCCTTATC CGGAGAGGAT GAATGA
|
Protein sequence | MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADANYYL PKESAYQEGR WETLPFQRAI MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE PTIRDIPSLL ALAPWYGKKH RDNTLTMKRF TNGRGFWCLG GKAAKNYREK SVDVAGYDEL AAFDDDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHFMRFHVAC PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT LGETWEAKIG ERPDAEVMAE RKEHYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES WLIDRQIIMG RHDDEQTLLR VDEAINKTYT RRNGAEMSIS RICWDTGGID PTIVYERSKK HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI SISRWQLDLS ALLASLQEED GAATNKKTLA DYARALSGED E
|
| |