Gene Bcep18194_B1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1042 
Symbol 
ID3752807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1168060 
End bp1170057 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content62% 
IMG OID637765891 
ProductPhage terminase GpA 
Protein accessionYP_371800 
Protein GI78061892 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0571086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.208633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCGA TCGCCCCGCC TGAAGCGCGG CGATATAGCT CCGGATATGA CGCGTTGCGC 
CGCGGGCTAC TGGATGCGCG CCGGCGCAAT ATCCAACCGC CGCCCAAGCT GACGCTGAGC
GAGTGGGCTG ATAAGTACGC GGTTCTGTCT CGCGAGACCA GCGCGCAAAC TGGTCGCTTC
CGTGCGTTTC CGTATCAGAA CGGGATCATG GACGCGGTGA CCGATCCGAC GGTCGAGACC
ATTACGGTCC AGAAATCGGC GCGAGTCGGC TATACGAAGA TCCTCGACCA CGTCGCGGGG
TACTTCATCC ATCAGGATCC GTCGCCGATG CTGGTGGTGC AGCCTCGCGT CGAGGATGCC
GAGGACTACA GCACCACTGA GATCGAGCCG ATGCTCCGGG ACACCCCGGT GCTTGCGGAG
ATCGTCGGCG ATCTGAAGAG GAAGGACGCG AAGCAAAAGA TCCTGAAGCG CGTCTACCGA
AACGGTGCAT CGACGTCGTT CGTCGGAGCG AATAGTCCGG GCGGCTTCCG CCGGATCACT
GCACGGATCG TTGCGTTCGA TGAGGTCGAC GGCTACCCGG TCCAGGGGGC CGGGAAGGAA
GGCGACCAGA TCAAGTTGGG GATCAAGCGA ACTGAGTCGT TTTGGAACCG GAAAATCATT
CTCGGCAGTA CGCCGACGGT GAAGGGTTTC AGCCGAATCG AGAAGAGTTT CGAGAATAGC
GACCAGCGGC GGTATCACGT GCCATGTCCG CACTGCGGCG AATTCCAGGT GCTTGAGTGG
GGTGGCCCGG ATACGCCTCA CGGGATGAAG TGGGACAAGG ATGCCGACGG CGTTGGCTTG
CCTGAGACCG CGTACTACGT CTGCCGCCAC AACGGCTGCA TCATCCATGA CGTCGACAAG
CCGGATATGG TCGAGCGTGG CGAATGGCGC GCGGCGAAGC CGTTTGCCGG TCACGCGGGT
TTTCATATCT GGGCCGGCTA CAGCCTGTTC CCGAATGCGT CGTGGCGCAA TCTGGTCAAG
GAATGGTTGG AGGTAAAGGA CGACCCGCTG GCGCGGCAGA CGTTCATCAA CCTGGTGCTT
GGCGAGACGT ACGAGGATCG CAGCGACCGC GCGCTGAAGG AAGATCGACT TGCGGCTCGC
GCCGAGGTGT GGTCGGCCGA AGTTCCTGAC GGCGTCGCCG TGCTCACGGC GGGTGTCGAC
ACGCAAGGTG ATCGCTTCGA GGTCGAAGTG ATCGGTTGGG GGCGCAACGA GGAGAGCTGG
TCGGTCGCCT ATGAGGTGAT CGAGGGCGAT ATGGAAACCC CCGATCCGTG GGACCGACTC
GATGGGTTCC TGCAGCGGAT CTGGCATCGC GCGGATGGTC GCGGCTTCGA GATCATGGCG
GTTTGCCATG ACTCGGGCGG CAATCACACG CAGAAGGTAT ACGAGTTCTC GAAGGCGCGT
TTGGGGCGCC GTATCTGGGC AATCAAGGGC GAGTCGGCTG TAGCCGGAAA GCGCAATCCG
GTGTGGCCGA CGAAGAAACC GACGCGCAAG ACGCGCGCGT CGTTTCGCCC GGTAATTATC
GGCGTGAACG CGGCGAAAGA CACGATTCGG AATCGACTTC ATGTCGATGC GCCGGGGCCC
GGGTATATGC ACTTCCCGAC AGATCGAGAC ATCAACTACT ACGCGCAACT CACCGCTGAA
CGTCAAGTTC GCAAAGTCTC GGGCGGCCAG GTCTACAAGG TATGGGAGCT ACCGCGCGGC
CGTACGAACG AGGCGCTCGA TTGCCGTGTG TACGGATACG CAGCTCTATG CGGCCTGCTG
CACCTTGGGT TGCGCCTGAA CGTGACAGCC GACGAAGTTC GTGCCGCGTA CACGCCTTTG
CCTTACATAG CACCTGAACC GACGAAGGAA GAGCCAGTCA TGGAGGCTCC AATTCCTGTA
CAGGATCGTG GGCCGATGGT GAAGAAGGTG GGCGGGTCAA ACAGCGGAGG CAAGTCGCGC
GCGAGCCGCC TCGCATAA
 
Protein sequence
MESIAPPEAR RYSSGYDALR RGLLDARRRN IQPPPKLTLS EWADKYAVLS RETSAQTGRF 
RAFPYQNGIM DAVTDPTVET ITVQKSARVG YTKILDHVAG YFIHQDPSPM LVVQPRVEDA
EDYSTTEIEP MLRDTPVLAE IVGDLKRKDA KQKILKRVYR NGASTSFVGA NSPGGFRRIT
ARIVAFDEVD GYPVQGAGKE GDQIKLGIKR TESFWNRKII LGSTPTVKGF SRIEKSFENS
DQRRYHVPCP HCGEFQVLEW GGPDTPHGMK WDKDADGVGL PETAYYVCRH NGCIIHDVDK
PDMVERGEWR AAKPFAGHAG FHIWAGYSLF PNASWRNLVK EWLEVKDDPL ARQTFINLVL
GETYEDRSDR ALKEDRLAAR AEVWSAEVPD GVAVLTAGVD TQGDRFEVEV IGWGRNEESW
SVAYEVIEGD METPDPWDRL DGFLQRIWHR ADGRGFEIMA VCHDSGGNHT QKVYEFSKAR
LGRRIWAIKG ESAVAGKRNP VWPTKKPTRK TRASFRPVII GVNAAKDTIR NRLHVDAPGP
GYMHFPTDRD INYYAQLTAE RQVRKVSGGQ VYKVWELPRG RTNEALDCRV YGYAALCGLL
HLGLRLNVTA DEVRAAYTPL PYIAPEPTKE EPVMEAPIPV QDRGPMVKKV GGSNSGGKSR
ASRLA