Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01867 |
Symbol | unknown |
ID | 8114414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1937329 |
End bp | 1938516 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644848086 |
Product | hypothetical protein |
Protein accession | YP_002999659 |
Protein GI | 251785355 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00194] excinuclease ABC, C subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000955333 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTGCCAGCC AGAATCTGGA GTTTGAAGAA GCGGCACGTA TTCGCGACCA AATTCAGGCG GTGCGACGCG TCACCGAAAA ACAATTCGTT TCCAATACCG GCGACGACCT CGACGTTATT GGTGTGGCGT TCGATGCGGG CATGGCTTGT GTCCACGTAT TGTTCATTCG TCAGGGCAAA GTGCTCGGCA GCCGCAGCTA TTTCCCGAAA GTGCCTGGCG GTACGGAACT GAGCGAGGTG GTAGAAACCT TCGTAGGTCA GTTCTATTTA CAAGGCAGCC AGATGCGCAC CTTACCGGGT GAGATCCTGC TCGATTTTAA TCTTAGCGAT AAAACGCTGC TCGCCGATTC CCTTTCAGAA CTGGCGGGAC GCAAGATTAA TGTTCAAACC AAACCTCGCG GCGATCGGGC GCGTTATCTG AAACTCGCGC GCACCAATGC GGCGACGGCC TTAACCAGCA AACTTTCGCA GCAATCTACC GTTCACCAGC GGCTTACAGC ACTTGCCAGT GTGTTGAAAT TGCCGGAAGT GAAGCGGATG GAGTGCTTTG ACATCAGCCA TACCATGGGT GAACAAACCG TCGCTTCCTG TGTGGTGTTT GATGCTAACG GCCCGCTGCG TGCGGAGTAT CGGCGCTATA ACATTACTGG CATCACGCCG GGCGATGATT ATGCGGCGAT GAATCAGGTG CTGCGTCGGC GTTATGGTAA AGCCATCGAC GACAGTAAGA TCCCGGATGT GATACTTATC GACGGCGGCA AAGGCCAGCT TGCGCAGGCG AAAAATGTCT TCGCCGAACT GGATGTCTCA TGGGATAAAA ATCATCCGCT GCTACTTGGC GTTGCCAAAG GAGTAGATCG TAAGGCTGGG CTGGAAACGC TGTTCTTTGA GCCGGAAGGT GAGGGATTCA GTTTGCCGCC AGATTCTCCC GCGCTGCATG TTATCCAGCA TATTCGCGAT GAATCACACG ATCACGCGAT TGGCGGGCAC CGTAAAAAAC GGGCGAAGGT CAAAAATACC AGTTCCCTGG AAACCATTGA AGGCGTCGGG CCAAAACGCC GGCAAATGTT GTTGAAATAT ATGGGCGGTT TGCAAGGTTT ACGTAACGCC AGCGTCGAGG AAATTGCAAA AGTGCCGGGT ATTTCGCAAG GTCTGGCAGA AAAGATCTTC TGGTCGTTGA AACATTGA
|
Protein sequence | TASQNLEFEE AARIRDQIQA VRRVTEKQFV SNTGDDLDVI GVAFDAGMAC VHVLFIRQGK VLGSRSYFPK VPGGTELSEV VETFVGQFYL QGSQMRTLPG EILLDFNLSD KTLLADSLSE LAGRKINVQT KPRGDRARYL KLARTNAATA LTSKLSQQST VHQRLTALAS VLKLPEVKRM ECFDISHTMG EQTVASCVVF DANGPLRAEY RRYNITGITP GDDYAAMNQV LRRRYGKAID DSKIPDVILI DGGKGQLAQA KNVFAELDVS WDKNHPLLLG VAKGVDRKAG LETLFFEPEG EGFSLPPDSP ALHVIQHIRD ESHDHAIGGH RKKRAKVKNT SSLETIEGVG PKRRQMLLKY MGGLQGLRNA SVEEIAKVPG ISQGLAEKIF WSLKH
|
| |