Gene BURPS1710b_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1940 
Symbol 
ID3688900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2113730 
End bp2114776 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content73% 
IMG OID637728396 
ProductCpaB family Flp pilus assembly protein 
Protein accessionYP_333339 
Protein GI76810330 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3745] Flp pilus assembly protein CpaB 
TIGRFAM ID[TIGR03177] Flp pilus assembly protein CpaB 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0235089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTGCT GCCGACCACG CTCACGAGCC GCGCGACGGT GCAGATCAAC CCGACCAACA 
TCATATGAAG TACCCGAGCG ACTAGGCCGC AGCGGCGCGC CGCCGGTTTT CCCCAACGTC
ATTCTCGCTG TTGCACTGAT AACAACCATG GCCAATCATC TGACCAAGAT CATCGCGGGG
CTGCTGATCG GGATCGCGAT CCTGCTCGGC ATTTACGCAT GGCTGCTCGG GCGCAAGCCG
GCGCCTGTCG CGCCGGGCGC CGCGCCCGCC GTGGCGACGG CGATGGTGCC CGTCGTCGTC
GCGGCGCGCG CGCTGCCCGC CGGGCAGCCG ATTCCCGCCG ATGCGCTGAA GGTGCAGCAG
ACGCCGACGC CGATCGCCGG CGCCTTCCCG AATCCGATGC TCGTGACGGG CCGCATCCCG
GCGAGCGACA TCGGCGCGCA GGCGCCGGTG CTCGAGAGCG AGCTGATGTC GGGCCTCGCC
GACCAGATCG CGCCCGGCGA GCGTGCCGTC GCGATCAAGG TCGACGATAC GAACGCGGTC
GGCAACCGGC TGCGTCCCGG CAATTTCGTC GACGTGTTCG TGAACCTGAA GCGCGAAGGC
GGCTTCGGTG CGACCGGCTC CGAGATCGCG CAGACCCAGG CGCGGCTGCT GCTGTCGCGG
GTGCGCGTGC TGTCGTTCGG CGATGCGACG GTGGAGCGCG ACGGCACGCC GGGCCCGACG
GGCGCGGGCG CGCGCACCGC GGTGCTCGCC GTGCCGACCG CGCAGGTCGA CGCGCTCACG
CTCGCCGAGG CGAGCGGGCG GCTCGTGCTC GCGCTGCGCA GCCCGCGCGA CGAAGACATC
GCCGCGCAGA CGGTGGCGAT CCGCGCGCCG GCCGGCGCCG GGCCGTCGAA TCAGGCGGCG
ACGGGGCTCG TGCTGAGCGA ACTGTCGGGC AGCGGGGCTC CCGCGCAGGC GCCGCGCGCG
GCTCCGACGC GAGTGACGGC CGCGCCGCAT GCGGCGGGCA GCATCGAAGT GATCCGGGGA
GGGCGAGCCG AGACGCTCGC CTATTGA
 
Protein sequence
MRCCRPRSRA ARRCRSTRPT SYEVPERLGR SGAPPVFPNV ILAVALITTM ANHLTKIIAG 
LLIGIAILLG IYAWLLGRKP APVAPGAAPA VATAMVPVVV AARALPAGQP IPADALKVQQ
TPTPIAGAFP NPMLVTGRIP ASDIGAQAPV LESELMSGLA DQIAPGERAV AIKVDDTNAV
GNRLRPGNFV DVFVNLKREG GFGATGSEIA QTQARLLLSR VRVLSFGDAT VERDGTPGPT
GAGARTAVLA VPTAQVDALT LAEASGRLVL ALRSPRDEDI AAQTVAIRAP AGAGPSNQAA
TGLVLSELSG SGAPAQAPRA APTRVTAAPH AAGSIEVIRG GRAETLAY