Gene BURPS1106A_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1889 
Symbol 
ID4901040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1845679 
End bp1846674 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content73% 
IMG OID640135119 
Producttype II secretion system protein 
Protein accessionYP_001066154 
Protein GI126452248 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4965] Flp pilus assembly protein TadB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCGG CGGACGTCGT TGCCGTCGGC GCGTTTTTCG CGATCGTCGT CGCGGGCTTC 
ATCGTGCGCG CGCTGCGCGA TCTCGCGCGG CGGCGGCCCG CCGCGCGCGT GCGCTCGCGC
GTCGAGGCGC TGCGCGAGCC GCGCGCCGCG GCGCGGCCGG CGGCGCCCGC GCGCGCGTCG
CGCGTCGGGC TGCAATTGTT CACCCGCACG CACGGCGAAG GCGAGGGCGG CGCGCTGCGC
GCCTGGCTGC GCGCGCGCGG CGAGCACGTG CGCACGGCGG CGGGCGGCGG CGGCGTGCGC
GCGATCGCGT TCGCGTCCGC GCTCGCGGCG CTTGCCGGTT TCGTCGGCGC GTCGTTCGCG
GGCTTCGCGC CCTGGCTGCG GCCCGCGCTC GCGGCGGCGC TCGCGGCCGG CGCGGCGCGC
GCCGTCTACC GGATTCTGAT CGGGCGCTTC AAGCAGCGCT TCCTCTCGGT GTTCCCGGAC
GCGCTCGATC TGATCATTCG CGCGGTGCGC GCGGGCATTC CGGTCGCGCA GGCGATCGGC
ACCGCGGGCC GCGAAAGCGA GGAGCCCGTG CGCGCGACGT TTCGCGCGAT GGGCGACGCG
CTGCGCGTCG GCGCGGATCT GAAGGACGTG CTCGAGCAGC AGGCCGAGCG CCTGCAGCTC
GCCGATTTCT CGTTCTTCGG CGTGTGCCTC GTCTTGCAGC GCGAGACGGG CGGCAATCTG
ACGGAAACGC TCGAGAACCT CTCGGGCATC ATCCGCACGC GCCGCGACAT CCGGATGAAG
ACGCGCGCGC TGACGGCCGA AGGGCGCATC GCGAGCAAGA TCATCGCGGC CGTGCCGTTC
GCGATCGCCG GGTTCCTGTT CGTCGTGAAC CGTCCATACG TCAATCTGCT GTTCCACACG
CGCGCGGGGC ACAAGATGCT GATCCTCGCC GCGGTGCTGC TCACCGTCGG TCTCGCGATG
ATTCGCAAGA TCGCCAACCT GGACACTTCG CGATGA
 
Protein sequence
MSAADVVAVG AFFAIVVAGF IVRALRDLAR RRPAARVRSR VEALREPRAA ARPAAPARAS 
RVGLQLFTRT HGEGEGGALR AWLRARGEHV RTAAGGGGVR AIAFASALAA LAGFVGASFA
GFAPWLRPAL AAALAAGAAR AVYRILIGRF KQRFLSVFPD ALDLIIRAVR AGIPVAQAIG
TAGRESEEPV RATFRAMGDA LRVGADLKDV LEQQAERLQL ADFSFFGVCL VLQRETGGNL
TETLENLSGI IRTRRDIRMK TRALTAEGRI ASKIIAAVPF AIAGFLFVVN RPYVNLLFHT
RAGHKMLILA AVLLTVGLAM IRKIANLDTS R