Gene BURPS1106A_A2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2087 
Symbol 
ID4906079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2050986 
End bp2052863 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content70% 
IMG OID640145192 
ProductYscC/HrcC family type III secretion outer membrane protein 
Protein accessionYP_001076120 
Protein GI126457615 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02516] type III secretion outer membrane pore, YscC/HrcC family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTATG CCGACACGGC CAGACGATAC GCAGCCGCGC TTGCCGCGTC GCTGCTGATG 
ACGGGCGCGG CGTTCGCCGC GCCGACCGAC GCGGCGCCGC TCGCCGACGC GCCCGCGGCG
GCGAGCGCGA CGCCCGACGC ACGGGACGGC GACGCGTCGC GCGTGGCCGC GCCGCCGGCG
CGCGCGCCGC AGGACGACGA GCGCCACTTC GTCGCGAACG ACGCGAGCAT CAGCGTGCTG
CTCAATGCGC TGTCGGGCCG GCTGCACAAG CCGATCGTCG CGAGCGAGAA GGTGCGCCGC
AAGCACGTGA CGGGCGAGTT CGACCTCGCG CAGCCCCGCG CGCTGCTCGC GCGGCTCGGC
GAATCGATGT CGCTGCTCTG GTACGACGAC GGCGCGTCGA TCTACATCTA CGACAACTCG
GAGATCAAGA ACGCGGTCGT CTCGATGCGC CATGCGACGG TGCGCAACCT GCGCGATTTC
ATCCGGCAGA CGCGGCTCTA CGATCCGCGC TTTCCGGTGC GCGGCGACGA CCTGAGCAAC
ACGTTCTACG TGACGGGCGC GCCCGTCTAC GTGAATCTCG TCGCCGCCGC CGCGCGCTAT
CTCGACGAGG TGCGCTCGAA CGAGGCGAGC GACCGGCAGG TCGTGCGCGT CGTGCAGCTT
CACAACAGCT TCGTCGTCGA CCGCCAGTAC ACGCTGCGCG ACAAGGAAGT CGACATCCCG
GGCATGGCGA CCGTGCTCGG CCGCATCTTC GGCCCGGCGC GGCCGGGCGC GCCGGCGGAC
TCGCCCGTCG CGGCGGCCGA CGCCACGGCG CGCGGCGGCG CGGGCGGCGC GGCGGGCAAG
CCGGCGTTCT CGCTTGCCGA TGCGTTGCCC GCGCCGCTCG ACGCCGGCAA CGCGCCGGGC
GGCGCGGGCT CGACGCATTC GACAAACCCG GCGAACGCCG CGAGCCCGAT GGGCGGCGCG
GCGGGCGGCG TCGCGCTGCC CGCGTCGGAC GGCGTGCGCG CGGTGGCGTA TCCGGACACG
AACAGCGTGA TCCTCGTCGG CCGGCTCGAC AAGGTGCAGG ACATGGAGGC GCTGATCCGC
TCGCTCGATG TCGAGAAGCG GCAGATCGAG CTGTCGCTGT GGATCATCGA CATCCGCAAG
AGCCGGCTCG ATCAGCTCGG CATCGACTGG CAGGGTGCGC TCAATGCGCC GGGTATCGGC
GTCGGCTTCA ACAATCGCGG CGGCAACGTG ACGACGCTCG ACGGCACCAG GTTCCTCGCA
TCGGTCGCGG CGCTCAGCCA GACGGGCGAC GCCACCGTGA TCTCGCGGCC GATCGTGCTC
ACGCAGGAGA ACGTGCCGGC GACGTTCGAC AGCAACCAGA CGTTCTACGC AAAGCTGATC
GGCGAGCGCA CGGTGCAGCT CGATCACGTG ACCTACGGCA CGCTCGTCAA CGTGCTGCCG
CGGCTCACGC GCGATGGGTC GCAGGTCGAG ATGATCGTCG ACATCGAGGA CGGCAACACC
GACGGCGCGA CGAGCGACGG CCAGATCGTC ATCGACAACA ACACGATGCC GCTCGTGAAC
CGCACCGAGA TCAACACGGT CGCGCGCGTG CCGCACGAGA TGAGCCTGCT CATCGGCGGC
AACACGCGCG ACGACGTCAC GCGCCGCACG TTCCGGATTC CCGGGCTGGC CAGCATTCCG
CTGATCGGCG GGCTGTTTCG CGGGCATTCG GATCGGCACG AGCAGGTGGT GCGCGTGTTC
CTGATCCAGC CGAAGCTGCT GCGCGCGGGC GCGGCCTGGC CCGACGGCCA GCCGTGGGAA
TCGGGCGATC CGGCGGACAA CGCGACGCTG CGCGCGACCG TGCAGATGCT CAAACCCTAC
ATGGACGACA AGTCATGA
 
Protein sequence
MMYADTARRY AAALAASLLM TGAAFAAPTD AAPLADAPAA ASATPDARDG DASRVAAPPA 
RAPQDDERHF VANDASISVL LNALSGRLHK PIVASEKVRR KHVTGEFDLA QPRALLARLG
ESMSLLWYDD GASIYIYDNS EIKNAVVSMR HATVRNLRDF IRQTRLYDPR FPVRGDDLSN
TFYVTGAPVY VNLVAAAARY LDEVRSNEAS DRQVVRVVQL HNSFVVDRQY TLRDKEVDIP
GMATVLGRIF GPARPGAPAD SPVAAADATA RGGAGGAAGK PAFSLADALP APLDAGNAPG
GAGSTHSTNP ANAASPMGGA AGGVALPASD GVRAVAYPDT NSVILVGRLD KVQDMEALIR
SLDVEKRQIE LSLWIIDIRK SRLDQLGIDW QGALNAPGIG VGFNNRGGNV TTLDGTRFLA
SVAALSQTGD ATVISRPIVL TQENVPATFD SNQTFYAKLI GERTVQLDHV TYGTLVNVLP
RLTRDGSQVE MIVDIEDGNT DGATSDGQIV IDNNTMPLVN RTEINTVARV PHEMSLLIGG
NTRDDVTRRT FRIPGLASIP LIGGLFRGHS DRHEQVVRVF LIQPKLLRAG AAWPDGQPWE
SGDPADNATL RATVQMLKPY MDDKS