Gene BURPS1106A_1793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1793 
Symbol 
ID4899856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1754657 
End bp1755667 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content73% 
IMG OID640135023 
Producttype II secretion system protein 
Protein accessionYP_001066062 
Protein GI126452133 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2064] Flp pilus assembly protein TadC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.105995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCCA GCCGCCTCGG CGCAATCGCG CTCGTTCTCG GCGCAATCGG CGTGCTGATG 
CTCGCCGCGC TCGCGATCAT GCAGGCCGTG CTCGCGCGGC GCACCGGCCG CACGCTCGCG
GACGCGCTCG ATCAGCGCGC CGCCGCGTTG GAGGCGGCCG CCGCGCGGGT CGCGGCGGGG
GCGGCCGGCG CGGCGCGCGC GGGCATGCCC GAGGCGGCGC CTGACGCGCG CCGTCCGCGC
TTCGCGGCGC TGCTCGATCG CGCGGGCCGG TTCGGAATGC GGCTGCTCGA TACGCGGCTC
GGCAAGCAGA TCGTCGCCGA CGAAGACCGG ATGCTGCTCG AACAGTGCGG CTACGTCGAC
GCGCACACGC GCGGCATCTT CCTGAGCGCG CGGATCGCGT GTGCGATCGC GCTGCCCGCC
GCCGTCGCGC TCGTCGGCGG CGAGCCGGTC CGCACGCATC TGGGCGCGTG GGTCGCGCTG
TCGGTGATCG CCGGCTTCAT GCTGCCGAAG ACCTACGTGC GCCGCCGCGC GGCGGCGCGC
CGCCAGTCCG TCGTCGACGA GATGCCGCTG CTCGTCGACA TGCTGCGGCT CTTGCAGGGC
GTCGGGCTGT CGCTCGACCA GAGCATCCAG GTCGTCACCA ACGACTTCAG GGGGATGCTG
CCCGTGCTGT CGTCGGAGCT CGGGATCGCG CAGCGGCAGT TCGTCGCGGG GCGCACGCGC
GAGCAGTCGC TGCAGCGTCT CGCGACGAGC TTCGACAACG AGGACCTGCG CGCGATCGTG
CGCCTGCTGA TCCAGGTCGA CAAGCACGGC GGCGCGGTGC AGGAGCCGCT CAAGCAGTTC
GGCGACCGGC TGCGCGAAGT GCGCCGCGCG ATGCTGCGCG AGCGCATCGG CCGCCTTACG
GTGAAAATGA CGGGCGTGAT GATTCTCACG CTGCTGCCCG CGCTGTTCAT CGTGACGGCG
GGGCCGGGGA TGCTCGCCGT CACGCATGCG CTCACGGCCG CGCGCCGCTA G
 
Protein sequence
MDPSRLGAIA LVLGAIGVLM LAALAIMQAV LARRTGRTLA DALDQRAAAL EAAAARVAAG 
AAGAARAGMP EAAPDARRPR FAALLDRAGR FGMRLLDTRL GKQIVADEDR MLLEQCGYVD
AHTRGIFLSA RIACAIALPA AVALVGGEPV RTHLGAWVAL SVIAGFMLPK TYVRRRAAAR
RQSVVDEMPL LVDMLRLLQG VGLSLDQSIQ VVTNDFRGML PVLSSELGIA QRQFVAGRTR
EQSLQRLATS FDNEDLRAIV RLLIQVDKHG GAVQEPLKQF GDRLREVRRA MLRERIGRLT
VKMTGVMILT LLPALFIVTA GPGMLAVTHA LTAARR