Gene BURPS1710b_A0528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0528 
Symbol 
ID3693131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp713890 
End bp715026 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID637730782 
ProductHep_Hag family protein 
Protein accessionYP_335687 
Protein GI76818786 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGCGG TCATCCGAAA CGCCGTCAAT CTGGCGCCCG ATGCGAACGG GGACTTCTCC 
GGCCGCTCGG CGATGCCGAT CGAAATGGCC GCGAACGCCG CGCTGAGATC GCTGAAGAAA
AATCCGGGCG ACGCCGGCCA TGCCGCTCCG GCATACCTGC CTGCCGAGCG GATCGGCCAG
TTGCGGGAAA AGGTCCGAAG GACCATCGAG GCGCTCGAAT CGAACCGCCC ACCGAAACCG
CAGCCGCGGT CGACACCACC GCAATCGACG CCACCCAAGC CGACGCAGCA CCCGACCGCG
CCCAACCCGA ACGTACCCGA CGCATCGACG CCTGATGCAT CGACGCCTGA CGCTTCGACG
CCCGACGCAT CGACGCCCGA CGCATCGACG CCCAGCCGAC CTGCCCCTGC CCCCCGAGCG
GGCACGGGCG CGCCCGCTGC TTCGGCGGCG ACGCGCGCCC CCGCCTTTGC AAACCGCGTG
CGCAAGCCGA ATCCGGCTAT GCCCGCCGCG TCGTCGCATG CGATCGCGAG CGACTTCGCG
TCGAGCAACG CGTTCGCGAT CGGCGACGAC TCGACCGCCG TCGGAGCGCA AGCGATCGCG
TTCAGCGAGC AATCGATCGC CATAGGCTCG CGCGCGATTG CCGCCGGCGC CCGTTCGATC
GCCGTCGGCA CGGACGCGAC AGCAGCCGCC CCCGATTCGG TCGCCCTCGG CTCGGGCTCC
ATCGCCGAAC GCGAAGGCAC GGTGTCCGTC GGCAGAGACG GCCACGAACG CCAGATCACC
CATGTCGCAT CCGGCACCGA GCCCACCGAC GCCGTCAACG TCACGCAACT GCGCGCGGCA
ATGTCGAACG CCAACGCGTA CACGAACCAG CGCATCGGCG ATCTTCAGCA GAGCATCACC
GACACCGCGC GCGACGCGTA TTCCGGCGTC GCCGCCGCGA CCGCGCTGAC GATGATTCCC
GATGTCGACC GCGACAAGAG GGTGTCGATC GGCGTCGGCG GCGCGGTCTA CAAGGGCCAT
CGCGCCGTCG CGCTCGGCGG CACCGCGCGC ATCAACGAAA ACCTCAAGGT GCGGGCGGGC
GTCGCGATGA GCGCGGGCGG CAATGCCGTG GGCATCGGCA TGAGCTGGCA ATGGTAA
 
Protein sequence
MDAVIRNAVN LAPDANGDFS GRSAMPIEMA ANAALRSLKK NPGDAGHAAP AYLPAERIGQ 
LREKVRRTIE ALESNRPPKP QPRSTPPQST PPKPTQHPTA PNPNVPDAST PDASTPDAST
PDASTPDAST PSRPAPAPRA GTGAPAASAA TRAPAFANRV RKPNPAMPAA SSHAIASDFA
SSNAFAIGDD STAVGAQAIA FSEQSIAIGS RAIAAGARSI AVGTDATAAA PDSVALGSGS
IAEREGTVSV GRDGHERQIT HVASGTEPTD AVNVTQLRAA MSNANAYTNQ RIGDLQQSIT
DTARDAYSGV AAATALTMIP DVDRDKRVSI GVGGAVYKGH RAVALGGTAR INENLKVRAG
VAMSAGGNAV GIGMSWQW