Gene BURPS668_A1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1996 
Symbol 
ID4886527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1932464 
End bp1933522 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID640131934 
Producttype III secretion system protein HrcU 
Protein accessionYP_001062991 
Protein GI126442638 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1377] Flagellar biosynthesis pathway, component FlhB 
TIGRFAM ID[TIGR01404] type III secretion protein, YscU/HrpY family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.920837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG AAAAGACCGA GGAACCCACC GACAAGAAGC TGCGTGACGC GCGTCGAGAC 
GGTGAGGTAT CCCGTAGCAC GGATCTCTCC GACGCCGTAT CCATGTCCGC TGCGATTCTG
TTGCTGGTTG CGGCCGCCGA TCATTTCGGC GATGCAATGC GAGCGTTGGT CAACGGCGCG
CTAGCGTTCG TTTCCGCCGA TCATTCTCTC GTCGAGATGA CCGCGCGGCT GTACCAGTTC
GGCGGCATCG CGCTATCGGC GGTCATGCCG CTGCTGTTCG TCGCGGCGCT CGCGGGTATC
GGCGGATCGG TCCTCCAGGT CGGGCTGCAG ATATCGCTGA AACCGGTCAT GCCGAATCTC
GGCGCACTCA ATCCGGCTGA AGGTCTGAAG AAGCTTTTTT CGCCGCGTAG CGCGATCGAG
TCCATCAAGA TGATCGTCAA GGCCGTCATC GTGTTCTGCG TGGCGTGGAA AACGATCGTA
TGGCTGTTCC CGCTCATCGC CGGCGCGCTG TATCAATCGC CGCCCGAACT GTCACGCATA
TTCCGGGAGA TCCTGGCGAA GTGGCTGATG GTGGTGGCCG GTCTATGCCT TCTGATGGGG
GCGGCCGACG TGAAACTCCA GCGCTTCATG TTCATGCAGA AGATGAAGAT GACGAAGGAC
GAGGTGAAGC GCGAATCCAA AAACGACGAA GGCGATCCGC TGCTCAAGGG CGAGCGCAAG
CGGCTTGCGC GCGAACTGGC GGCCGCGCCG CCACAGCATC AGGTCGCGCA CGCGAATTTC
GTCGTCGTCA ACCCCACCCA CTACGCGGTC GCGGTTCGTT ACGCGCCCGA CGAGCATCCG
CTCCCCCGCG TGGTCGCGAA GGGCCTCGAC GAAGCGGCCA TCGCACTGCG GCGGGCCGCG
CAAGACGCGA ACATCCCGAT CATCGGCAAT CCCCCTGTCG CGCGCGCGTT GTTCCGAATT
GGCGTCGAGG AGCCGGTGCC CGAAGAACTG TTCGAGATCG TTGCCGCGAT CCTGCGCTGG
ATCGACGCGA TCGGCCCGCG CCGAAACGAA CGGGCCTGA
 
Protein sequence
MSDEKTEEPT DKKLRDARRD GEVSRSTDLS DAVSMSAAIL LLVAAADHFG DAMRALVNGA 
LAFVSADHSL VEMTARLYQF GGIALSAVMP LLFVAALAGI GGSVLQVGLQ ISLKPVMPNL
GALNPAEGLK KLFSPRSAIE SIKMIVKAVI VFCVAWKTIV WLFPLIAGAL YQSPPELSRI
FREILAKWLM VVAGLCLLMG AADVKLQRFM FMQKMKMTKD EVKRESKNDE GDPLLKGERK
RLARELAAAP PQHQVAHANF VVVNPTHYAV AVRYAPDEHP LPRVVAKGLD EAAIALRRAA
QDANIPIIGN PPVARALFRI GVEEPVPEEL FEIVAAILRW IDAIGPRRNE RA