Gene BURPS668_A0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0121 
Symbol 
ID4886964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp102248 
End bp103252 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content50% 
IMG OID640130062 
Productputative transcriptional regulator 
Protein accessionYP_001061127 
Protein GI126444440 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0425295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTCA CTCGTACTGC GGCGCAGCGT CCTGCGCCGT TGCATACTAT TTTCATTACG 
CTTGAGGTAA TGAAGCGTAT TGGATTGTCC AGAGAGACTC TGCTAGAAGG AACTGGCATT
TCACCTTCCG ACGTCGAACG TCCTAACGCG ATGGTGACGC ATGCTCAGGA GATGGTCCTA
TTTGCCAATG CGCTGAAAGC TACCGGAAAT TCCGCGATTG GCCTTCACAT CGGCAACGAG
ATACCGGTGA CGGCGTACGG CCTTCGGGCT CACGCAATGC TTGTTAGTCC CACGCTTGGC
GATGCGATGC GCCTGGCGTT CGAACACCCG TTGCTAGGGA TTAGCTATTT TCGGATGAAG
TTGTTGATCG ATGGCGAAGT GGCGCGAATT ATCGTCGGCG GCTACACGTA TCGCCACGAC
TTACTTGTCA TCAACACTGA TATGTGCTTT ACAGCGGTCC GCCGACAGAT GATTGATTTG
ATAGGAAGAT CGCCGCATTT TACGCGCATC GGATTTGCTT ATCCAAAGCC CTCTCATGCA
GATAGCTACT CTGAATACTT CTCATGCCCG GCGATATTTG ATTCAGAAAT AAATTTTCTA
GAGTTTGATG CTTCAATTTT AATGACGAAA TTGCCTCTCG CCCACCCCAT TGAATATGAA
GTGGCGAGAA AAGCTTGCGA GAAACGGGAG TTTGAATTGG CCCATTGGAT GCCGACTGAC
CTCGTTGGGC AGCTACTCTG CATGATGTAT GAAAATCCGA TCTCCCAAGA TGTTTTTGAG
TTCGCCGATC GGCTTGGGAT GTCATTGCGC AGTTTGCAGC GCAAGCTTAA AGATATGGGG
ACATCATTCA GTGCTTTGCA AGACCTTGTG CGCCAGGATA TCGCGTCAAA ATACCTCACT
GAAAAGCACT ATACCGCCAA AGAAATAGCG GCTCGCCTTG GATACAAAAA TGCTTCTGCG
TTTAGCCGTG CGATGAGGCG CTGGTCAAAA ATTTCTGTCG ACTAA
 
Protein sequence
MAVTRTAAQR PAPLHTIFIT LEVMKRIGLS RETLLEGTGI SPSDVERPNA MVTHAQEMVL 
FANALKATGN SAIGLHIGNE IPVTAYGLRA HAMLVSPTLG DAMRLAFEHP LLGISYFRMK
LLIDGEVARI IVGGYTYRHD LLVINTDMCF TAVRRQMIDL IGRSPHFTRI GFAYPKPSHA
DSYSEYFSCP AIFDSEINFL EFDASILMTK LPLAHPIEYE VARKACEKRE FELAHWMPTD
LVGQLLCMMY ENPISQDVFE FADRLGMSLR SLQRKLKDMG TSFSALQDLV RQDIASKYLT
EKHYTAKEIA ARLGYKNASA FSRAMRRWSK ISVD