Gene BURPS1106A_A2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2494 
Symbol 
ID4904148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2457699 
End bp2458967 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content73% 
IMG OID640145598 
Productsigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001076525 
Protein GI126456650 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.500327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATG CCGCCGTTCA CCGTGCGATC GATGCCGTCT GGCGGATCGA GGCCGCGAGA 
ATCATCGCGC ACGTCGCGCG GCTCGTGCGC GACGTCGGCG TGGCCGAAGA GCTCGCGCAG
GACGCGCTCG TCGCGGCGCT CGAGCACTGG CCGAGCGGCG GCGTGCCGGA CAATCCGGGC
GCATGGCTGA TGACGGCCGC GAAGCGCCGC GCGCTCGATC ATCTGCGGCA GCACGCGCTG
CACGCGCGCA AGCGCGAGCA GATCGGCCTC GATCTCGATG CGCTCGGCGC GCACGTCGCG
CCTGACGTCG CCGACGTGTT CGAAGCGGCG CGCGACGACG ACATCGGCGA CGATCTGCTG
CGGCTCGTGT TCACCGCTTG TCATCCGGTG CTGTCGACCG ACGCGCGCGT CGCGCTGACG
CTGCGGCTGC TCGGGGGGCT GACGACGGGC GAGATCGCGC GCGCGTTTCT CACGCCGGAG
CCGACGATCG CGCAGCGGAT CGTGCGCGCG AAGCGCACGC TATCGGCGGC GAAGGTGCCG
TTCGAGGTGC CGCGCGCACC GGAGCGCGCG GCGCGGCTGG CATCGGTGCT CGAAGTGATC
TATCTGGTTT TCAACGAAGG CTATTCGGCG ACGGCGGGCG ACGACTGGAT GCGCCCCGCG
CTGACCGACG AGGCGCTTCG GCTCGGGCGC GTGCTCGCCG GGCTCGCGCC CGACGAGAGC
GAGGTGCACG GGCTTGTCGC GCTGATGGAG ATCCAGGCGT CGCGGATGCA TGCGCGGGTC
GATGCGCAGG GCCGCCCCGT GCTGCTGCTC GATCAGGATC GCAGCCGCTG GGATCCGCTG
CTGATCCGGC GCGGGCTCGC CGCGCTCGCA CGCTCGGAGG CGCTCGGCGG CGCGAGCGGG
CCCTATGCGC TGCAGGCGGC GCTCGCCGCA TGTCATGCGC GTGCGCGCCA TGCCGACGAT
ACCGACTGGG AGCAGATCGT CGCGCTCTAC GACGCGCTCG CGCAGGTCGC GCCCTCGCCC
GTCGTCGAGC TGAATCGCGC GGTTGCGGTC GGCATGGCGT TCGGGCCAGC GGCGGGGCTC
GAGATCGTCG ACGCGCTCGC GGCCGACCCG GCGCTCGCGC GCTATCACTG GCTGCCGAGC
GTGCGCGGCG ATCTGCTCGC GAAGCTCGGG CGGCGCGCCG AGGCGCAGGC CGAATTCCAG
CGCGCGGCCG ACATGACGCT CAATGCGCGC GAGCGTGAGA TGCTGCTTGC GCGCGCGACG
CAGCGGTGA
 
Protein sequence
MMDAAVHRAI DAVWRIEAAR IIAHVARLVR DVGVAEELAQ DALVAALEHW PSGGVPDNPG 
AWLMTAAKRR ALDHLRQHAL HARKREQIGL DLDALGAHVA PDVADVFEAA RDDDIGDDLL
RLVFTACHPV LSTDARVALT LRLLGGLTTG EIARAFLTPE PTIAQRIVRA KRTLSAAKVP
FEVPRAPERA ARLASVLEVI YLVFNEGYSA TAGDDWMRPA LTDEALRLGR VLAGLAPDES
EVHGLVALME IQASRMHARV DAQGRPVLLL DQDRSRWDPL LIRRGLAALA RSEALGGASG
PYALQAALAA CHARARHADD TDWEQIVALY DALAQVAPSP VVELNRAVAV GMAFGPAAGL
EIVDALAADP ALARYHWLPS VRGDLLAKLG RRAEAQAEFQ RAADMTLNAR EREMLLARAT
QR