Gene BURPS1106A_A0716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0716 
Symbol 
ID4906339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp704324 
End bp705982 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content72% 
IMG OID640143822 
Producthypothetical protein 
Protein accessionYP_001074752 
Protein GI126457957 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGATC GCGCGGCGGC GAATCCGGAT CTCGTGACGC GCGTGCCGAC GCTGTCGTCG 
CTGTCGAGCG CGGCGATGAC GAGCTCCGCC GTATCGACCA CCGGCACGAC GAACACGGTG
GCGTCGGGCG GCGCGGCCGC GGGCGCGCCG AGCTTCGCGC CGCCGGCCGC CGACGCGTTT
CCCCAAGCCG ACGGCGCGCC CCGCAACCCC GCGGTGCTGC AATTCCCGGT GCCGGGCGGC
GCGCCCGCGC CCGCCGACGC GCGGGTGGCC GCGCCCGTCG TCTACAGCGC GCAGGGCGAG
CAGGCCGCGA TCATGAAAGC AGGCCTGCAG CAGGCGAGCT GGAACAACCC GTTCGTGTCG
CACGCGCTGC CCGCGGTGCT GCAACTGCAG CGCCACCTCG CGGCCGGCCC GCTCAATCAG
GCCGCGATCC GCACGCAGCT CGGCCTCGAG GTGCGGCTCT ACCGCGAGCG GCTCGCCGCC
TCCGGCTGCG AATGGGAGCA GATCCGCGAC GCGTCGTACC TGCTCTGCAC GTATCTCGAC
GAAACCGTCA ACGACGCGGC GCGCGAGCAC GCGCAAGTCG TCTACGACGG CGAGCGCAGC
CTGCTCGTCG AATTCCACGA CGACGCGTGG GGCGGCGAGG ACGCGTTCGC CGACCTGTCG
CGCTGGATGA AGACCGAGCC GCCGCCGATT CCGCTTCTGT CGTTCTACGA ACTGATCCTG
TCGCTCGGCT GGCAGGGCCG CTACCGCGTG CTCGACCGCG GCGACGTGCT GCTGCAGGAT
CTGCGCTCGC AACTGCACGC GCTGATCTGG CATCACGTGC CGCCCGAGCC GCTCGGCACC
GAGCTCGTCG CGCCCGCGAA GCGGCGCCGC TCGTGGTGGA CGGCCGGGCG CGCGGCGGCC
GTCGCGCTCG GCGTGCTGGT GCTCGCGTAC GGCGCGATCA GCTTCTGGCT CGATTCGCAG
GGCCGCCCGA TCCGCAACGC GCTCGCCGCG TGGATGCCGC CCACGCGCAC GATCAACATC
GCCGAGACGC TGCCGCCGCC GCTGCCGCAG ATTCTCACCG AAGGGTGGCT CACCGCGTAC
AAGCATCCGC AAGGATGGCT GCTCGTGTTC AAGAGCGACG GCGCGTTCGA CGTCGGCAAG
GCGAACGTGC GGGCGGACTT CATGCACAAC ATCGAGCGGC TCGGCCTCGC GTTCGCGCCG
TGGCCGGGCG ACCTCGAGGT GATCGGTCAC ACCGATTCGC GGCCGATCCG CACGAGCGAG
TTCCCGGACA ACCAGGCGCT GTCCGAAGCG CGGGCGCGCA ACGTCGCCGA CGAACTGCGC
AAGACCGCGC TGCCGGGCGG CGCGCGCGCG CCGGAGAACG CGGTGCAGCG CAACATCGAG
TACTCGGGGC GCGGCGACGC GCAGCCGATC GACACCGCGA AGACGGCCGC CGCGTACGAG
CGCAACCGCC GCGTCGACGT GCTGTGGAAG GTGATTCCCG ACGGCGCGCA GCAATCGGGC
CGCAGCCTGA ACCTGCAGCA GCCGGAGAAG CCCGGGCAGG TGCCGATGCG TCCGGCGATG
CCGGAGGGCG TGGAGATCGC GCCTGACGGG CAACTGCCGT ATGCGACGTC AACCACGATG
CCAGCAACGA GACCGACCAC GGAGGGCCGT CAGCCATGA
 
Protein sequence
MLDRAAANPD LVTRVPTLSS LSSAAMTSSA VSTTGTTNTV ASGGAAAGAP SFAPPAADAF 
PQADGAPRNP AVLQFPVPGG APAPADARVA APVVYSAQGE QAAIMKAGLQ QASWNNPFVS
HALPAVLQLQ RHLAAGPLNQ AAIRTQLGLE VRLYRERLAA SGCEWEQIRD ASYLLCTYLD
ETVNDAAREH AQVVYDGERS LLVEFHDDAW GGEDAFADLS RWMKTEPPPI PLLSFYELIL
SLGWQGRYRV LDRGDVLLQD LRSQLHALIW HHVPPEPLGT ELVAPAKRRR SWWTAGRAAA
VALGVLVLAY GAISFWLDSQ GRPIRNALAA WMPPTRTINI AETLPPPLPQ ILTEGWLTAY
KHPQGWLLVF KSDGAFDVGK ANVRADFMHN IERLGLAFAP WPGDLEVIGH TDSRPIRTSE
FPDNQALSEA RARNVADELR KTALPGGARA PENAVQRNIE YSGRGDAQPI DTAKTAAAYE
RNRRVDVLWK VIPDGAQQSG RSLNLQQPEK PGQVPMRPAM PEGVEIAPDG QLPYATSTTM
PATRPTTEGR QP