Gene BURPS668_A0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0803 
Symbol 
ID4885792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp777233 
End bp778594 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content70% 
IMG OID640130743 
Producthypothetical protein 
Protein accessionYP_001061802 
Protein GI126444399 
COG category[S] Function unknown 
COG ID[COG3522] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03353] type VI secretion protein, VC_A0114 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.533915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTC TGCCGGTAGG ACCGGTCGCG TGGAGCGACG GCATGCTGAT CGAGACGCAG 
CACTTCCAGC AGCTCGAGCG GCATCTCGCG CATCAGGCCT CGCTGCGGCT CGGTCAGACG
TCGAATCACG GCTGGGGCTT CACGCTGCTC GATCTCGACC AGGACGGCCT GGGGCTCGGC
CGGCTCGGGC TGCGCCACGC GCGCGGCGTG TTCCAGGACG GCACCGCGTT CTCGCTGCCG
TCGGACGATC CGCTGCCGCC GCCGCTGGAA ACCGAGCTCG CGCAGGCGGG CGACATCGCG
TGCCTCGCGC TGCAAGCCGC GCGCACGGGC GGCCCGGAGA TGGCGTTCGG CGACGTCGAG
CTGGCGTCGC GCTATCGCGC GGTGTCGACC GAGGTGCCGG ATCTCGCGGT CGGGCTCGAC
GCGCCCGGCA CGCCGCGGCG CCTGACGATC GAGACGGGCC AGCTCGTCAC GCGCCTGTGC
TGGAAGTCGC AGCTGCGCTC GGACGAGGTC GCGCTGCCGA TCGCGCGCGT GGCGGGGCGC
AACGCGAGCC GCACGGTGTC GCTGGATCCG CGCTTCATTC CGCCGCTGCT CGACACGCGC
GCGCACCTGG TGCTGCGCTC GCTGATCGAC GAGCTGCAGA GCACGCTGCG CGTGCGGCTC
GCGAGCACGT CCGCGCAGCG CGTGCTGTCG ACGGGCGGGG GCGTGGCCGA TCTGATCGAG
CTGCTGCTGC GCCAGGCGAT CGCCGAGTAC CGGATGCGCT TGGCGAACCT CGACGCGTTC
GATCCGCTGC CGCCGGCGAT GCTGTATCAC GAACTGGTCG GCCTGCTCGG GCGGCTGAGC
GTGCTGCCGG GCGTCGACGA GGAACTGGCC GACCGCGAGC TCGGCTACGA CCACGACGAT
CTGCAGACGA GCTTCGAGCC GCTCGCGATG ATGCTGCGCC AGGCGCTCGC GCGCGTGATC
GAGACACCGG TGCTGCCGCT GCGCTTCGAG GATCGCGGCG ATCAGGTGCA CATCTGCATC
GTCGACAAGC AGTGGAACCT GAAGAAACTG ATTTTTGCGT TTTCGGCCGC GATGCCGGCG
GAGAAGCTGC GGCAACTGTT GCCGCAGCAG ACGAAGCTGG GCGCCGTCGA GCAGATCCAG
AAGCTCGTGG ACCTGCAACT GCCGGGCGCG CGGCTGAACG CGCTGCCCAA TCCCCCGCGC
CAGATTCCCT ACTACGCCCA AAGCACGTAC TTCGAAGTGG AATCGACCGA TCCGTTCTGG
AAGCAGACCC TCGCCGGCTC GGCGATGGCG CTGCGCATCG TCGGCGATTT CCCCGATCTT
CGCTTCGAAG CCTGGGGGCT GAGAGACGGC AAGGTGGCGT GA
 
Protein sequence
MSSLPVGPVA WSDGMLIETQ HFQQLERHLA HQASLRLGQT SNHGWGFTLL DLDQDGLGLG 
RLGLRHARGV FQDGTAFSLP SDDPLPPPLE TELAQAGDIA CLALQAARTG GPEMAFGDVE
LASRYRAVST EVPDLAVGLD APGTPRRLTI ETGQLVTRLC WKSQLRSDEV ALPIARVAGR
NASRTVSLDP RFIPPLLDTR AHLVLRSLID ELQSTLRVRL ASTSAQRVLS TGGGVADLIE
LLLRQAIAEY RMRLANLDAF DPLPPAMLYH ELVGLLGRLS VLPGVDEELA DRELGYDHDD
LQTSFEPLAM MLRQALARVI ETPVLPLRFE DRGDQVHICI VDKQWNLKKL IFAFSAAMPA
EKLRQLLPQQ TKLGAVEQIQ KLVDLQLPGA RLNALPNPPR QIPYYAQSTY FEVESTDPFW
KQTLAGSAMA LRIVGDFPDL RFEAWGLRDG KVA