Gene BURPS1106A_A2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2086 
Symbol 
ID4904522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2049868 
End bp2050989 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content74% 
IMG OID640145191 
Producttype III secretion regulator YopN/LcrE/InvE/MxiC 
Protein accessionYP_001076119 
Protein GI126455484 
COG category 
COG ID 
TIGRFAM ID[TIGR02511] type III secretion effector delivery regulator, TyeA family
[TIGR02568] type III secretion regulator YopN/LcrE/InvE/MxiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.350134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCGA TCATCGGCGG CGCGTCGGCC GCGCGGCGCG GCTTTTCGAT CGACGGCACG 
GGGAGCGCGG CGAACCGGCT CGACGCGGAG CCGTCGCTCG ACGACGCGCC GCAGACGGGG
GCGGCGGGCG CGGCCGACGT GCAGGCGCAA CTCGCCGGCG TCGACGAGGA AGCCGCGAAC
GCCGCCGCGC AATTCGGCCG GTTCCGCGCG TCGGAGCGCA AGGGGCGGCG CAGCGACGAG
CTCGAACGGA TTCTCGACAC GGACGCCGAC GAGAAGCTCG ACGAGCTCGC CGCGCTGCTC
GGCGGCCGCG CGGCGCGCGG CCGCGCGGAC CTCGCGACGC TGCTGCGCGA CGCGCGCGAG
CGCTTTCGCG ACGAGAGCGA TCTGTTGCTC GCGCTGCGCG AGCTGCGCCG GCGGCGCCGG
CTCGACGGCG AATCCGTCGA CGCGCTCGAG CGCGCGATCG ACGAACTGCT CGCGGGCGAC
GGCGCCAAGC GGATCAAGGC GGGCATCAAC GCGGCGCTCA AGGCGAAGGT GTTCGGCGCG
CGGATGCAGC TCGATGCGCG CCGGCTGCGC GAGCTGTACC GGCAGTTCCT CGAGTTCGAC
GGCTCGCACC TCGTCATCTA CGAAGACTGG ATCGAGCAGT TCGGCGCGAG CCGCCGCAAG
CGGATTCTCG ACTACGTGAG CGCCGCGCTG TCGTACGACA TGCAGTCGCA CGATCCGAGC
TGCGGGTGCG CGGCCGAGTT CGGCCCGCTG CTCGGCACGC TGCATCGCGC GCGCATGCTC
GCGTCGGCCG ACGAGCAGTT CGTCGGCCGG CTGCTCGACG ACGCGCTCGC GCGCGATTGC
GGGCTCACCG AGGCGCGCGC GCTCGCGACG ATGCTGGGCG GCCTGCAACG GCCGTTCTCG
GTCGCCGACG TGCTGCTGGG CACGCTCGGC GATCTGCTCG AGCCGCTCGC GCCCGCCCGT
CGCTCGCAGT TGTTGCAGCT CGCGCTGCGC GCGTTCGCGG GCGTGCCGAT CGCGCTCTAC
GGCGACGCCG ACGCGCGCCG CGCGGCGCTC GGCGCGCTCG AGGAACTGAT CGGCGCGACG
TATGCGCGCG AGCGGCGGCA GGCGCGCCCG CGCGCCGACT GA
 
Protein sequence
MSSIIGGASA ARRGFSIDGT GSAANRLDAE PSLDDAPQTG AAGAADVQAQ LAGVDEEAAN 
AAAQFGRFRA SERKGRRSDE LERILDTDAD EKLDELAALL GGRAARGRAD LATLLRDARE
RFRDESDLLL ALRELRRRRR LDGESVDALE RAIDELLAGD GAKRIKAGIN AALKAKVFGA
RMQLDARRLR ELYRQFLEFD GSHLVIYEDW IEQFGASRRK RILDYVSAAL SYDMQSHDPS
CGCAAEFGPL LGTLHRARML ASADEQFVGR LLDDALARDC GLTEARALAT MLGGLQRPFS
VADVLLGTLG DLLEPLAPAR RSQLLQLALR AFAGVPIALY GDADARRAAL GALEELIGAT
YARERRQARP RAD