Gene BURPS1710b_A2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2089 
Symbol 
ID3692055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2543176 
End bp2544891 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content72% 
IMG OID637732343 
Producthypothetical protein 
Protein accessionYP_337240 
Protein GI76819156 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0458657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGC TACGAAACCT CTCCTCCACG CTGCGCCGCG TATCGATGGC ATCCAACATG 
CTCGATCGCG CGGCGGCGAA TCCGGATCTC GTGACGCGCG TGCCGACGCT GTCGTCGCTG
TCGAGCGCGG CGATGACGAG CTCCGCCGTA TCGACCACCG GCACGACGAA CACGGTGGCG
TCGGGCGGCG CGGCCGCGGG CGCGCCGAGC TTCGCGCCGC CGGCCGCCGA CGCGTTTCCC
CAAGCCGACG GCGCGCCCCG CAACCCCGCG GTGCTGCAAT TCCCGGTGCC GGGCGGCGCG
CCCGCGCCCG CCGACGCGCG GGTGGCCGCG CCCGTCGTCT ACAGCGCGCA GGGCGAGCAG
GCCGCGATCA TGAAAGCAGG CCTGCAGCAG GCGAGCTGGA ACAACCCGTT CGTGTCGCAC
GCGCTGCCCG CGGTGCTGCA ACTGCAGCGC CACCTCGCGG CCGGCCCGCT CAATCAGGCC
GCGATCCGCA CGCAGCTCGG CCTCGAGGTG CGGCTCTACC GCGAGCGGCT CGCCGCCTCC
GGCTGCGAAT GGGAGCAGAT CCGCGACGCG TCGTACCTGC TCTGCACGTA TCTCGACGAA
ACCGTCAACG ACGCGGCGCG CGAGCACGCG CAAGTCGTCT ACGACGGCGA GCGCAGCCTG
CTCGTCGAAT TCCACGACGA CGCGTGGGGC GGCGAGGACG CGTTCGCCGA CCTGTCGCGC
TGGATGAAGA CCGAGCCGCC GCCGATTCCG CTTCTGTCGT TCTACGAACT GATCCTGTCG
CTCGGCTGGC AGGGCCGCTA CCGCGTGCTC GACCGCGGCG ACGTGCTGCT GCAGGATCTG
CGCTCGCAAC TGCACGCGCT GATCTGGCAT CACGTGCCGC CCGAGCCGCT CGGCACCGAG
CTCGTCGCGC CCGCGAAGCG GCGCCGCTCG TGGTGGACGG CCGGGCGCGC GGCGGCCGTC
GCGCTCGGCG TGCTGGTGCT CGCGTACGGC GCGATCAGCT TCTGGCTCGA TTCGCAGGGC
CGCCCGATCC GCAACGCGCT CGCCGCGTGG ATGCCGCCCA CGCGCACGAT CAACATCGCC
GAGACGCTGC CGCCGCCGCT GCCGCAGATT CTCACCGAAG GGTGGCTCAC CGCGTACAAG
CATCCGCAAG GATGGCTGCT CGTGTTCAAG AGCGACGGCG CGTTCGACGT CGGCAAGGCG
AACGTGCGGG CGGACTTCAT GCACAACATC GAGCGGCTCG GCCTCGCGTT CGCGCCGTGG
CCGGGCGACC TCGAGGTGAT CGGCCACACC GATTCGCGGC CGATCCGCAC GAGCGAGTTC
CCGGACAACC AGGCGCTGTC CGAAGCGCGG GCGCGCAACG TCGCCGACGA ACTGCGCAAG
ACCGCGCTGC CGGGCGGCGC GCGCGCGCCG GAGAACGCGG TGCAGCGCAA CATCGAGTAC
TCGGGGCGCG GCGACGCGCA GCCGATCGAC ACCGCGAAGA CGGCCGCCGC GTACGAGCGC
AACCGCCGCG TCGACGTGCT GTGGAAGGTG ATTCCCGACG GCGCGCAGCA ATCGGGCCGC
AGCCTGAACC TGCAGCAGCC GGAGAAGCCC GGGCAGGTGC CGATGCGTCC GGCGATGCCG
GAGGGCGTGG AGATCGCGCC TGACGGGCAA CTGCCGTATG CGACGTCAAC CACGATGCCA
GCAACGAGAC CGACCACGGA GGGCCGTCAG CCATGA
 
Protein sequence
MSLLRNLSST LRRVSMASNM LDRAAANPDL VTRVPTLSSL SSAAMTSSAV STTGTTNTVA 
SGGAAAGAPS FAPPAADAFP QADGAPRNPA VLQFPVPGGA PAPADARVAA PVVYSAQGEQ
AAIMKAGLQQ ASWNNPFVSH ALPAVLQLQR HLAAGPLNQA AIRTQLGLEV RLYRERLAAS
GCEWEQIRDA SYLLCTYLDE TVNDAAREHA QVVYDGERSL LVEFHDDAWG GEDAFADLSR
WMKTEPPPIP LLSFYELILS LGWQGRYRVL DRGDVLLQDL RSQLHALIWH HVPPEPLGTE
LVAPAKRRRS WWTAGRAAAV ALGVLVLAYG AISFWLDSQG RPIRNALAAW MPPTRTINIA
ETLPPPLPQI LTEGWLTAYK HPQGWLLVFK SDGAFDVGKA NVRADFMHNI ERLGLAFAPW
PGDLEVIGHT DSRPIRTSEF PDNQALSEAR ARNVADELRK TALPGGARAP ENAVQRNIEY
SGRGDAQPID TAKTAAAYER NRRVDVLWKV IPDGAQQSGR SLNLQQPEKP GQVPMRPAMP
EGVEIAPDGQ LPYATSTTMP ATRPTTEGRQ P