Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B1042 |
Symbol | |
ID | 3752807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 1168060 |
End bp | 1170057 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637765891 |
Product | Phage terminase GpA |
Protein accession | YP_371800 |
Protein GI | 78061892 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0571086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.208633 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCGA TCGCCCCGCC TGAAGCGCGG CGATATAGCT CCGGATATGA CGCGTTGCGC CGCGGGCTAC TGGATGCGCG CCGGCGCAAT ATCCAACCGC CGCCCAAGCT GACGCTGAGC GAGTGGGCTG ATAAGTACGC GGTTCTGTCT CGCGAGACCA GCGCGCAAAC TGGTCGCTTC CGTGCGTTTC CGTATCAGAA CGGGATCATG GACGCGGTGA CCGATCCGAC GGTCGAGACC ATTACGGTCC AGAAATCGGC GCGAGTCGGC TATACGAAGA TCCTCGACCA CGTCGCGGGG TACTTCATCC ATCAGGATCC GTCGCCGATG CTGGTGGTGC AGCCTCGCGT CGAGGATGCC GAGGACTACA GCACCACTGA GATCGAGCCG ATGCTCCGGG ACACCCCGGT GCTTGCGGAG ATCGTCGGCG ATCTGAAGAG GAAGGACGCG AAGCAAAAGA TCCTGAAGCG CGTCTACCGA AACGGTGCAT CGACGTCGTT CGTCGGAGCG AATAGTCCGG GCGGCTTCCG CCGGATCACT GCACGGATCG TTGCGTTCGA TGAGGTCGAC GGCTACCCGG TCCAGGGGGC CGGGAAGGAA GGCGACCAGA TCAAGTTGGG GATCAAGCGA ACTGAGTCGT TTTGGAACCG GAAAATCATT CTCGGCAGTA CGCCGACGGT GAAGGGTTTC AGCCGAATCG AGAAGAGTTT CGAGAATAGC GACCAGCGGC GGTATCACGT GCCATGTCCG CACTGCGGCG AATTCCAGGT GCTTGAGTGG GGTGGCCCGG ATACGCCTCA CGGGATGAAG TGGGACAAGG ATGCCGACGG CGTTGGCTTG CCTGAGACCG CGTACTACGT CTGCCGCCAC AACGGCTGCA TCATCCATGA CGTCGACAAG CCGGATATGG TCGAGCGTGG CGAATGGCGC GCGGCGAAGC CGTTTGCCGG TCACGCGGGT TTTCATATCT GGGCCGGCTA CAGCCTGTTC CCGAATGCGT CGTGGCGCAA TCTGGTCAAG GAATGGTTGG AGGTAAAGGA CGACCCGCTG GCGCGGCAGA CGTTCATCAA CCTGGTGCTT GGCGAGACGT ACGAGGATCG CAGCGACCGC GCGCTGAAGG AAGATCGACT TGCGGCTCGC GCCGAGGTGT GGTCGGCCGA AGTTCCTGAC GGCGTCGCCG TGCTCACGGC GGGTGTCGAC ACGCAAGGTG ATCGCTTCGA GGTCGAAGTG ATCGGTTGGG GGCGCAACGA GGAGAGCTGG TCGGTCGCCT ATGAGGTGAT CGAGGGCGAT ATGGAAACCC CCGATCCGTG GGACCGACTC GATGGGTTCC TGCAGCGGAT CTGGCATCGC GCGGATGGTC GCGGCTTCGA GATCATGGCG GTTTGCCATG ACTCGGGCGG CAATCACACG CAGAAGGTAT ACGAGTTCTC GAAGGCGCGT TTGGGGCGCC GTATCTGGGC AATCAAGGGC GAGTCGGCTG TAGCCGGAAA GCGCAATCCG GTGTGGCCGA CGAAGAAACC GACGCGCAAG ACGCGCGCGT CGTTTCGCCC GGTAATTATC GGCGTGAACG CGGCGAAAGA CACGATTCGG AATCGACTTC ATGTCGATGC GCCGGGGCCC GGGTATATGC ACTTCCCGAC AGATCGAGAC ATCAACTACT ACGCGCAACT CACCGCTGAA CGTCAAGTTC GCAAAGTCTC GGGCGGCCAG GTCTACAAGG TATGGGAGCT ACCGCGCGGC CGTACGAACG AGGCGCTCGA TTGCCGTGTG TACGGATACG CAGCTCTATG CGGCCTGCTG CACCTTGGGT TGCGCCTGAA CGTGACAGCC GACGAAGTTC GTGCCGCGTA CACGCCTTTG CCTTACATAG CACCTGAACC GACGAAGGAA GAGCCAGTCA TGGAGGCTCC AATTCCTGTA CAGGATCGTG GGCCGATGGT GAAGAAGGTG GGCGGGTCAA ACAGCGGAGG CAAGTCGCGC GCGAGCCGCC TCGCATAA
|
Protein sequence | MESIAPPEAR RYSSGYDALR RGLLDARRRN IQPPPKLTLS EWADKYAVLS RETSAQTGRF RAFPYQNGIM DAVTDPTVET ITVQKSARVG YTKILDHVAG YFIHQDPSPM LVVQPRVEDA EDYSTTEIEP MLRDTPVLAE IVGDLKRKDA KQKILKRVYR NGASTSFVGA NSPGGFRRIT ARIVAFDEVD GYPVQGAGKE GDQIKLGIKR TESFWNRKII LGSTPTVKGF SRIEKSFENS DQRRYHVPCP HCGEFQVLEW GGPDTPHGMK WDKDADGVGL PETAYYVCRH NGCIIHDVDK PDMVERGEWR AAKPFAGHAG FHIWAGYSLF PNASWRNLVK EWLEVKDDPL ARQTFINLVL GETYEDRSDR ALKEDRLAAR AEVWSAEVPD GVAVLTAGVD TQGDRFEVEV IGWGRNEESW SVAYEVIEGD METPDPWDRL DGFLQRIWHR ADGRGFEIMA VCHDSGGNHT QKVYEFSKAR LGRRIWAIKG ESAVAGKRNP VWPTKKPTRK TRASFRPVII GVNAAKDTIR NRLHVDAPGP GYMHFPTDRD INYYAQLTAE RQVRKVSGGQ VYKVWELPRG RTNEALDCRV YGYAALCGLL HLGLRLNVTA DEVRAAYTPL PYIAPEPTKE EPVMEAPIPV QDRGPMVKKV GGSNSGGKSR ASRLA
|
| |