Gene BURPS1106A_A2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2029 
Symbol 
ID4903342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1996159 
End bp1997901 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content73% 
IMG OID640145134 
Producthypothetical protein 
Protein accessionYP_001076062 
Protein GI126456422 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCA CCACGCCCAA TCGCTATTAC GAGGACGAGC TCGTTCGCCT GCGCGAGCTC 
GCCGCCGAGT TCGCCCGCGC GCACCCGCTG CTCGCGCCGA TGCTCGGCGC GCCGTCGGGC
GACCCGGACG TCGAGCGGCT GCTCGAAGGC GTCGCGTTCC TGACGGGGCT CGCGCGGCAA
AAGCTCGACG AAGGCCTGCC GGAGCTCGTG CAGGCGCTCG CGAACCTGCT GTTTCCGCAC
TCGCTGCGCC CGGTGCCGGC GGCCACGCTG ATCGCGTTCG AGCCGCGCGG CGCGCTGCGC
GAGCGCGCGG TGATCGCGGC CGGCACCGAG ATCGAATCGG TGCCCGTCGA CGGCACCGCC
TGCCGGTTTC GCACGTGCGG CGAGCTCGAC ATCGAGCCGA TCGCGCTCGC CGGCTGCCGG
TTCGTGCCGC CCGCGCACGG CGGGCCCGCG CTGCGGCTCG ACTTCGAGAT GCTCGGCCTC
GACGCGAGCG AATGGGACGC GACGCGCATT CGGCTGTTCA TCGGCGGCGA GCGGCTGCAC
GCAAGCCGCC TGTTCGCGCT GCTGATGCAG CACGTCGTAG CGGTCGAGAT CGCAGGCGGC
CCGCCCGAGC TGCCCGGCCC GCGCTGCGCG CTCGGCGCCC GCGCGCTGCG CCCGGCGGGC
TTCGACGACG CGCTGCTGCC CTGCCCCGAG CGGGCGTTCC CCGGCTTCCG GCTGCTGCAC
GAATATTTCG CGTTCGCCGA GAAATTCCTG TTCGTCGAGC TGGGCGGCCT CGAGCGCTGG
CGCGCGGCGC GCGCGGGCGC GCAGTTCAGC GTATGGCTCG CGCTCGACAG CGCGCCCGAC
TGGCTGCCCG GCATCGATCG CGACAGCTTC CGGCTGAACG TCGCCGCCGC GCTGAACCTG
TTCGCGCACG AAGCGGTGCC GATCCAGCAC GAGCATCGCG CGACCGATTA CCGGCTGCAG
CCCGAAGGCG ACACGTCGGG CCACTACCGG ATCTACTCGG TCGACCGCGT GATCGGCTAC
CGCCCCGGCC ACGCGGTCGA CCGCCATTAC GTGCCGTTCG GCGTCGCGGG CGACGACGCG
AACGCCGCGA GCTACCGGCT GATTCGCCGC GCCGCGCTCG ACGGCCACGG GCAGGATCTC
CATCTGGCGC TCGCCTACCC GCCCGGCGAG GCGCTCGCCA CGGAAACGCT GTCGATCGGC
CTGTCGTGCA CGAACGGCGC GTTGCCCGCG CGCCTGAAGA TCGGCGACGT GTGCCGCGCG
ACCGACAGCT CGCCCGAGCG CTTCACGTTC GCCAACATCG CGCCCGTGAG CCCGCCGCTC
GATCCGCCGC TCGGCGAACC GCTGCTGTGG CGCACCATCA GCCATCTCGC GCTGAACTTC
CTGTCGCTCG GCGACGCCGA TCATCTGAAG CGCATGCTCG CGCTCCACGC GTTCGGCGAA
CGCGGCGACG ACGCGCGCGC GCAAGCCGAC CGCCGCCGCA TCGACGGCAT CGAATCGGTC
GACGTGCGGG CCGAGACGCG GATCATCGGC GAGCGGATGC TGAGCGGCCA GCGCGTGGCG
CTGCGCTGCA GCGCCCATGC GTTCGGCGGC GCGGGCGAGC TGTATCTGTT CGGCTGCGTG
CTCGAGCGCT TTCTGGCCGA ATACGCGGCG ATCAACACCT ACACGCGCGT CGAGATCGAC
GCGTCGCCCG ACGGCGTGCG CTTCGCGTGG CCGCCGCGAA TGGGGGCGCA ATGCCTGCTC
TAG
 
Protein sequence
MATTTPNRYY EDELVRLREL AAEFARAHPL LAPMLGAPSG DPDVERLLEG VAFLTGLARQ 
KLDEGLPELV QALANLLFPH SLRPVPAATL IAFEPRGALR ERAVIAAGTE IESVPVDGTA
CRFRTCGELD IEPIALAGCR FVPPAHGGPA LRLDFEMLGL DASEWDATRI RLFIGGERLH
ASRLFALLMQ HVVAVEIAGG PPELPGPRCA LGARALRPAG FDDALLPCPE RAFPGFRLLH
EYFAFAEKFL FVELGGLERW RAARAGAQFS VWLALDSAPD WLPGIDRDSF RLNVAAALNL
FAHEAVPIQH EHRATDYRLQ PEGDTSGHYR IYSVDRVIGY RPGHAVDRHY VPFGVAGDDA
NAASYRLIRR AALDGHGQDL HLALAYPPGE ALATETLSIG LSCTNGALPA RLKIGDVCRA
TDSSPERFTF ANIAPVSPPL DPPLGEPLLW RTISHLALNF LSLGDADHLK RMLALHAFGE
RGDDARAQAD RRRIDGIESV DVRAETRIIG ERMLSGQRVA LRCSAHAFGG AGELYLFGCV
LERFLAEYAA INTYTRVEID ASPDGVRFAW PPRMGAQCLL