Gene BURPS1106A_A2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2022 
Symbol 
ID4904348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1988222 
End bp1990093 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content72% 
IMG OID640145127 
ProductImpA-related N-terminal family protein 
Protein accessionYP_001076055 
Protein GI126457583 
COG category[S] Function unknown 
COG ID[COG3515] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03362] type VI secretion-associated protein, VC_A0119 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.975565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGA GCGAACGGCG CCCGCCCGGC GGCGCGGCGG CGCGCGCGCG CATGCCGATC 
GATGTCGAAG CGCTCGCCGT GCTGGGGCGC ACGGACATCG ATTCCGCCAT GCCCGCGGGC
GCCGACGTGC GCGCCGACGC GAGGTTCGAC GCGCTGCACG CGGAGCTTGC GAAGCTCGCG
TCGCCGGGCG CGAGCGGGCA AGTCGATTGG CGCGCGGCGA CGCATCTCGC CGCCGAATTG
CTGCGCGAGC GCGGCAAGGA TTTGCTCGTC GGCTGCTATC TGGCGGGTGC GTTGCTGCAG
ACGGGCGGCG CGGCGGGGCT GCGCTGCGGA CTCGAAATCG TCGGCGATCT CGTCGAACGT
CATTGGGATG CGATGTCGCC GCCCGTGTCG CGGATGCGGG CGAGGCGCGG GGCGCTGCAA
TGGCTGGTCG ATCGCGTCGA CGCCATGCAC GATGCAGGAG CCGCCGCATG CGGCGGCGCG
TGCTCGGCCG AACTGGTCGC GCAATTGCGC GCGGCCGCGC GGCGCATCGA TGCGCTGCTC
GCCGAGCGCG ACGACGACGC GCCGACGATG CGCGCGGTGC ATGCGTTCGC GGAGCGATTG
CCGGTTGAGG TGGTTGAGGT GGTGGAAGTG GCTGACGAGG CTGATGTGGC TGAGGCGACT
GAGGCGACTG AGGCGACTGA GGCGACTGAG GCGACTGAGG CGGCTGATGT GGCTGAGACG
GCTGAGACGG CCGAGGCCGA TGCGCATGGC TCGACGGGAG GGCCGGCCGC GGAAATCGCG
ATTGCCGCTG CCGAACAGGC TTTGATTGAT CCGGCCGGTC GAGCCGCGCC GAGCGCAGGC
ACGGATACGA ACGCGAACGC AGACGCCGCC AGGCAACCGG CGCGGCTCGA CGAAGCGGCC
GGCCGCGAAC GCGCGCTCGC CGATGCGCTC GCGCAACTGC ATTGCGTCGC GACGGCGTTC
GCGCAAGCGG ACTGGGCCGA CGCGCGCGGC TTCCGGCTGC GCCGCGTCGC GTGCTGGTCG
AGCGTGTGCG CGCTGCCGGA AACGGACGCG GAGAACGGAA GAACGCGGAT CGCCGCGCCG
AGCGCTTCGA TCGTCGGCGC GGCGAAGAAC ATCGACGGGG ATGGCGAGCC TGTGGCGGCG
GTGCGCTTCG CCGAAGCGCA TGCGCAGGCG TTCCCGCTCT GGCTGGATTT GCAGCGCATC
GCCGCGCGCG CGCTCGCGCG CGCGGGGGGC GACGGCGCCG ATGCGCGGCG CGAAGTGGAG
ACGGCGGTTC GTGCGCTGCT TGCGCGGCTG CCGGGCCTCG ACGCGCTGAC GTTCGCGGAC
GGCACGCCGT TCGCCGACGA CGCGACGCGC GCATGGCTCG GCGAGCTTGG CGCGCCTGTT
GTGGCGGCGG ATGCGGTGTC GCCGTCGTCT TTGCCGCTTT CGCCGCGACC TTCGCCGCCT
GAGCGATCGT CGCCGATGGC GGGCGAACCG GCGCGCGCGC CGGGCGATGC GTGCGGGGCG
AGCGCCGACG ATGCAGTGGA CCGAGCGTGC GCGTTTGCCG CGAGCGGCCA GCTCGATCTC
GCGCTCCACG CGATTCAGCA TGCGATCGAT CGTGCGACGA GCGCCGAACA GCGGTTGAGA
GCGCGCGTGC GGTTGTGCGA GCTTGCGCGC GACCATTGGC CGCATGAGGT TCCTGAGGCG
TTCGCGCGCG GCGTGATCGA ACCGATTCGG CGGCACGATT TGCTCGCATG GAATCCGGAG
CTGGCGCTCG ACGGCTTGTC GGCCGCCTAT GCGCTGCTGA TTCGGCGCGA TCGCGAATCG
GCGCACGCGA GGACGGTGCT TGACGAGATC GCGAGCGTCG ACGCGGCGCG GGCCATGCGT
TTGTCGACGT GA
 
Protein sequence
MGMSERRPPG GAAARARMPI DVEALAVLGR TDIDSAMPAG ADVRADARFD ALHAELAKLA 
SPGASGQVDW RAATHLAAEL LRERGKDLLV GCYLAGALLQ TGGAAGLRCG LEIVGDLVER
HWDAMSPPVS RMRARRGALQ WLVDRVDAMH DAGAAACGGA CSAELVAQLR AAARRIDALL
AERDDDAPTM RAVHAFAERL PVEVVEVVEV ADEADVAEAT EATEATEATE ATEAADVAET
AETAEADAHG STGGPAAEIA IAAAEQALID PAGRAAPSAG TDTNANADAA RQPARLDEAA
GRERALADAL AQLHCVATAF AQADWADARG FRLRRVACWS SVCALPETDA ENGRTRIAAP
SASIVGAAKN IDGDGEPVAA VRFAEAHAQA FPLWLDLQRI AARALARAGG DGADARREVE
TAVRALLARL PGLDALTFAD GTPFADDATR AWLGELGAPV VAADAVSPSS LPLSPRPSPP
ERSSPMAGEP ARAPGDACGA SADDAVDRAC AFAASGQLDL ALHAIQHAID RATSAEQRLR
ARVRLCELAR DHWPHEVPEA FARGVIEPIR RHDLLAWNPE LALDGLSAAY ALLIRRDRES
AHARTVLDEI ASVDAARAMR LST