Gene BURPS668_A0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0804 
Symbol 
ID4888870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp778677 
End bp780335 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content72% 
IMG OID640130744 
Producthypothetical protein 
Protein accessionYP_001061803 
Protein GI126442377 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.624526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGATC GCGCGGCGGC GAATCCGGAT CTCGTGACGC GCGTGCCGAC GCTGTCGTCG 
CTGTCGAGCG CGGCGATGAC GAGCTCCGCC GTATCGACCA CCGGCACGAC GAACACGGTG
GCGTCGGGCG GCGCGGCCGC GGGCGCGCCG AGCTTCGCGC CGCCGGCCGC CGACGCGTTT
CCCCAAGCCG ACGGCGCGCC CCGCAACCCC GCGGTGCTGC AATTCCCGGT GCCGGGCGGC
GCGCCCGCGC CCGCCGACGC GCGGGTGGCC GCGCCCGTCG TCTACAGCGC GCAGGGCGAG
CAGGCCGCGA TCATGAAAGC AGGCCTGCAG CAGGCGAGCT GGAACAACCC GTTCGTGTCG
CACGCGCTGC CCGCGGTGCT GCAACTGCAG CGCCACCTCG CGGCCGGCCC GCTCAATCAG
GCCGCGATCC GCACGCAGCT CGGCCTCGAG GTGCGGCTCT ACCGCGAGCG GCTCGCCGCC
TCCGGCTGCG AATGGGAGCA GATCCGCGAC GCGTCGTACC TGCTCTGCAC GTATCTCGAC
GAAACCGTCA ACGACGCGGC GCGCGAGCAC GCGCAAGTCG TCTACGACGG CGAGCGCAGC
CTGCTCGTCG AATTCCACGA CGACGCGTGG GGCGGCGAGG ACGCGTTTGC CGACCTGTCG
CGCTGGATGA AGACCGAGCC GCCGCCGATT CCGCTTCTGT CGTTCTACGA ACTGATCCTG
TCGCTCGGCT GGCAGGGCCG CTACCGCGTG CTCGACCGCG GCGACGTGCT GCTGCAGGAT
CTGCGCTCGC AACTGCACGC GCTGATCTGG CATCACGTGC CGCCCGAGCC GCTCGGCACC
GAGCTCGTCG CGCCCGCGAA GCGGCGCCGC TCGTGGTGGA CGGCCGGGCG CGCGGCGGCC
GTCGCGCTCG GCGTGCTGGT GCTCGCGTAC GGCGCGATCA GCTTCTGGCT CGATTCGCAG
GGCCGCCCGA TCCGCAACGC GCTCGCCGCG TGGATGCCGC CCACGCGCAC GATCAACATC
GCCGAGACGC TGCCGCCGCC GCTGCCGCAG ATTCTCACCG AAGGGTGGCT CACCGCGTAC
AAGCATCCGC AAGGATGGCT GCTCGTGTTC AAGAGCGACG GCGCGTTCGA CGTCGGCAAG
GCGAACGTGC GGGCGGACTT CATGCACAAC ATCGAGCGGC TCGGCCTCGC GTTCGCGCCG
TGGCCGGGCG ACCTCGAGGT GATCGGCCAC ACCGATTCGC GGCCGATCCG CACGAGCGAG
TTCCCGGACA ACCAGGCGCT GTCCGAAGCG CGGGCGCGCA ACGTCGCCGA CGAACTGCGC
AAGACCGCGC TGCCGGGCGG CGCGCGCGCG CCGGAGAACG CGGTGCAGCG CAACATCGAG
TACTCGGGGC GCGGCGACGC GCAGCCGATC GACACCGCGA AGACGGCCGC CGCGTACGAG
CGCAACCGCC GCGTCGACGT GCTATGGAAG GTGATTCCCG ACGGCGCGCA GCAATCGGGC
CGCAGCCTGA ACCTGCAGCA GCCGGAGAAG CCCGGGCAGG TGCCGATGCG TCCGGCGATG
CCGGAGGGCG TGGAGATCGC GCCTGACGGG CAACTGCCGT ATGCGACGTC AACCACGATG
CCAGCAACGA GACCGACCAC GGAGGGCCGT CAGCCATGA
 
Protein sequence
MLDRAAANPD LVTRVPTLSS LSSAAMTSSA VSTTGTTNTV ASGGAAAGAP SFAPPAADAF 
PQADGAPRNP AVLQFPVPGG APAPADARVA APVVYSAQGE QAAIMKAGLQ QASWNNPFVS
HALPAVLQLQ RHLAAGPLNQ AAIRTQLGLE VRLYRERLAA SGCEWEQIRD ASYLLCTYLD
ETVNDAAREH AQVVYDGERS LLVEFHDDAW GGEDAFADLS RWMKTEPPPI PLLSFYELIL
SLGWQGRYRV LDRGDVLLQD LRSQLHALIW HHVPPEPLGT ELVAPAKRRR SWWTAGRAAA
VALGVLVLAY GAISFWLDSQ GRPIRNALAA WMPPTRTINI AETLPPPLPQ ILTEGWLTAY
KHPQGWLLVF KSDGAFDVGK ANVRADFMHN IERLGLAFAP WPGDLEVIGH TDSRPIRTSE
FPDNQALSEA RARNVADELR KTALPGGARA PENAVQRNIE YSGRGDAQPI DTAKTAAAYE
RNRRVDVLWK VIPDGAQQSG RSLNLQQPEK PGQVPMRPAM PEGVEIAPDG QLPYATSTTM
PATRPTTEGR QP