Gene BURPS1106A_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0789 
Symbol 
ID4900988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp774231 
End bp776048 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content72% 
IMG OID640134019 
Producthypothetical protein 
Protein accessionYP_001065071 
Protein GI126453725 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.975565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC TCCTTCCCCA CTACGAACGC GAGATCGCGC TGCTGCGGCG CTCGATGCGC 
GAGTTCGCGC GCCGCCACCC GAAAATCGCG ACGCGCCTGG GCATCGCCGA CGGCCAGGCC
GACGACATGC ACGTGGAACG CCTGATCCAG TCGTTCGCGC TCGTCGGCGC GCAGATCGGC
GCGGCGCTCG ACGACGAATA TCCGGAGTTC ACCGAGGCGC TCGTCGAAAC CGTGTGCCCC
GACTATCTGC GCCCGTTTCC CGCGTGCTCG ATCGCGCAGT TCGACGCGCC GCCGCTGTTC
GGCCAGCCGG CGGAGGGCGT GACGATCGCG CGCGGCACCA TGCTCGAGAG CCGGATCGGC
GCGAATCGGT TCCGCACCGC CTACGACGTC ACGCTCGCGC CCGTGTCGAT CGTCGACGCG
CGCTTTCGGC CGGCGTCGGC CGCGCCGTCC GCCGCGCGGA TGCCGCCGCA ATCGACGGGC
ATCGTATCGA TCACCTTCGA TGCGCTGACC GCGCAGCCGG TGCTCGCGGC GCTGCGCGCG
CGCGGCCTGC GCGTGCATCT GCACGGCGAC GCGCCGCTCG TCGCCGCGCT CGCCGACACG
CTCGCGATGC GCGCGCCGGC CGCGTTCATC GAGCTCGACG GCGACGGGCG CTGGAAACCG
CTGTCGAAGG TGCCGCTCGC GCGCGTCGGC TTCGCCGATG CGCACGCGCT GCTCGATCCC
GCGCCCGGCG CGGCGCCGTT CCGGCTGCTG ATGGAATATT TCGCATTCCC GGCGAAATTC
GATTTCGTCG ACATCGACGT CGCGCGCATC GCGCGCGCGG CGGGCCCGTG CCGGCGCCTG
TCGCTGCACT TGCCCGTCGT CGACGTCGCG ATGGCGTCGC AGCACGCGCG CGCGCTCGAC
ACGCTCGGCG CGTCGAACCT GCGGCTGTTC TGCACGCCGA TCGTCAATCT GTTCAGCCAG
GACGCGATGC CGATCTCGCT GCGCGACGCC GAGGCCGTGT ATCCCGTCAC GCCGCAGGCG
CTGAAATCGA ACGGCATCGA GGTGAGGTCG ATCGACGCGG TGCGCATCGC GCGCGAGGGC
GAGCCCGGCG CGAGCCTCGA CGTGACGCCG TACCGCTCGC TGCTGCACGG CCGGCACGGC
GGCCGCGACG CGCCCGTCTA CTGGGTCGCG CAGCGCGAGC GCTTCGCGCC CGGCTCGCCA
CCCGCGGCGC TGCGGCTCGT CGACGCCGAC GGCGCGAACG CGCGGCTGGG CGCCGATCAG
CTGAACGTCG AGCTGACCTG CACGAACGGC GACTATCCGC AGACGCTGCC GATCGGCGAG
CCGGACGGCG ATCTGCTCAA CGAGAAGGAC AATCTGCCGG GCCGGATCGC GCTCCTGCGC
CGGCCGACGC CCGCCCGCCG GTTCGCGCGC GAGCACGGCG CGCTGTGGCG CATGATCGCG
GCGATGACGC CGCACGCGCT GCTGTTGCAG CCGTCGGGGC TCGGCGCGCT GAAGGCGCTG
CTCGTTCAGC ACGCCGCGCG CGCGTCGAGC GCGGCGCCGC AGATCGACGC GATCGCGAAT
CTCGACCACA AGGTCGCCGT GCGCTGGATG GCGGTGAAGC CGATGCCGAC CTTCGTGCGC
GGCATCGAAA TCGCGCTGAC GCTGAACGAG GCCGCATTCG TCACGAGCGG CCTGAAGACG
TTCATCGACG TGATGGACGG TTTCTTCGCG CTGCATGCGC CGCCGAACGG CTTCGTGCAG
CTCGTCGCGT ACTCGCGCGA CACGGGCCGC GAGCTGCATC GGTGCGCGCC GCGCGGCGCG
ATCGCGCAGC TCGTGTAG
 
Protein sequence
MNDLLPHYER EIALLRRSMR EFARRHPKIA TRLGIADGQA DDMHVERLIQ SFALVGAQIG 
AALDDEYPEF TEALVETVCP DYLRPFPACS IAQFDAPPLF GQPAEGVTIA RGTMLESRIG
ANRFRTAYDV TLAPVSIVDA RFRPASAAPS AARMPPQSTG IVSITFDALT AQPVLAALRA
RGLRVHLHGD APLVAALADT LAMRAPAAFI ELDGDGRWKP LSKVPLARVG FADAHALLDP
APGAAPFRLL MEYFAFPAKF DFVDIDVARI ARAAGPCRRL SLHLPVVDVA MASQHARALD
TLGASNLRLF CTPIVNLFSQ DAMPISLRDA EAVYPVTPQA LKSNGIEVRS IDAVRIAREG
EPGASLDVTP YRSLLHGRHG GRDAPVYWVA QRERFAPGSP PAALRLVDAD GANARLGADQ
LNVELTCTNG DYPQTLPIGE PDGDLLNEKD NLPGRIALLR RPTPARRFAR EHGALWRMIA
AMTPHALLLQ PSGLGALKAL LVQHAARASS AAPQIDAIAN LDHKVAVRWM AVKPMPTFVR
GIEIALTLNE AAFVTSGLKT FIDVMDGFFA LHAPPNGFVQ LVAYSRDTGR ELHRCAPRGA
IAQLV