Gene Bxe_A3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBxe_A3047 
Symbol 
ID4002154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia xenovorans LB400 
KingdomBacteria 
Replicon accessionNC_007951 
Strand
Start bp1542280 
End bp1543440 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID637946594 
Productputative bacteriophage tail protein GP47,Mu-like 
Protein accessionYP_557984 
Protein GI91782778 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.965464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000022489 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATATG CACGGAAAAC ACTCGCGCAG ATCCGGTCTG ACGCGATGGC GGACATTGCG 
GCCGCGCTGC AGGGCTCGGA TCCTCTCCTG CGGTTCGCGG CGCTAAAAAT CATCGGGGTT
GTGCTCGCGG GCATGACCAA CGAGGAATAC GGGTACCTCG ACTGGATTGC GAAACAGACG
AACCCTTTCA CGGCCGACGA CGAGTACCTC GAGGCGTGGG GGGCACTGAA GAAGGTCTAT
CGCAAGGATG CGAGCGCCGC GAGCCTGTCA GCGACGTTCA CGGGCGTCGC CGGCAAGCTG
CTCGACGACG GTACGCCGGT GGTGCGAAGC GACGGAGCAA CCTACACCAC GTCGGGCACG
CAGACCGTCG TCGGGACTTC GGTCACAGTG ACGATCGTGG CCGACGTCGC GGGGGCTGCC
GGCAATGCGG ATCCCGGTAC CGTCGTCGCT CTCGACATCG CAGTCGACGG CATTCAGTCA
ACCGGCGCAG TCATTGGCAC AGTTTCGTCG GGCGCCGATA TCGAGGACCA GGAGGATTAT
CGCGCGCGCG TATTGGCGAA GTATCAGCAG CCTCCGCAGG GCGGCGCGGC GCCGGATTAT
GTGGAATGGG CGACTGACGT CGCTGGCGTC ACACGCGCAT GGTGCGCGCC CAACGGCTTC
GGCGCTGGAA CTGTGGTCGT GTACGTCATG CTCGATGACG CGCAGGCAGC GCATGGTGGC
TTTCCGCAAG GCACCGACGG GGTGTCGCAA CACGATCAGG GGCCCGGTGG TCTGCCGCGT
GGAACGGTAG CGACCGGTGA TCAGTTAGTT GTCGCCGATG CAATCGTCAC GCTCCAGCCG
GGTACGGCGC TCGTATGGAT TTGTTCACCT GTCGAGAACG TACTGTCGTT CGAACTGACC
GGGTCGGCAG GATGGTCGAC GGCGATCCGG AACGCGGTCA AGGCGCAGAT TTCTGATGTC
TTCTTTCGCA ACGGCGATCC GCGCGGCGGC ACGATCGACA GATCGGATAT CAATTCGGCG
ATCGCTGCAG TGCCAGGAAC CGCTGGTTTC GTCATTACTT CCATCACCGG CGTGATATCC
GGCACGCCGA CCACATACCC CGCAAATATC ACCGGCAGTT TCGGCTCGCT GCCCGTGCTC
GGGGAAGTCA CTTTCGGCTG A
 
Protein sequence
MPYARKTLAQ IRSDAMADIA AALQGSDPLL RFAALKIIGV VLAGMTNEEY GYLDWIAKQT 
NPFTADDEYL EAWGALKKVY RKDASAASLS ATFTGVAGKL LDDGTPVVRS DGATYTTSGT
QTVVGTSVTV TIVADVAGAA GNADPGTVVA LDIAVDGIQS TGAVIGTVSS GADIEDQEDY
RARVLAKYQQ PPQGGAAPDY VEWATDVAGV TRAWCAPNGF GAGTVVVYVM LDDAQAAHGG
FPQGTDGVSQ HDQGPGGLPR GTVATGDQLV VADAIVTLQP GTALVWICSP VENVLSFELT
GSAGWSTAIR NAVKAQISDV FFRNGDPRGG TIDRSDINSA IAAVPGTAGF VITSITGVIS
GTPTTYPANI TGSFGSLPVL GEVTFG