Gene BURPS668_A0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0420 
Symbol 
ID4885969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp385532 
End bp386563 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID640130361 
Producttranscriptional regulator 
Protein accessionYP_001061426 
Protein GI126445047 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGCA TCCATGAATC CGGTAACGAT GCCGCGCAGT GTGCCCGCTT TACTTTTGCT 
CAAAAAGTAG CCTATAAACG AAATTATTTT GATTTCAAAT CAACAATGAA GGTCACGCTA
GACGAACTTC AGGCCTTCGC GGCCGTGGTC GACACGGGTT CGATCACCGC GGCCGCGCAA
CAGCTCGGCC TCACCGTGTC GGCGACGAGC CGCACGCTCG CGCGGCTCGA GGGCAAGCTC
AAGACCACGC TGCTGCGCCG GACCACGCGC CGCCTCGAGC TGACCGAGGA GGGCCGGACG
TTCCTCAACA GCGCGCGGGC AATCATCGAT TCGGTCGAAA GCGCGGAAGA GCAGATGCTC
GCGCGGCGCG AGAAGCCGTC CGGCCGGCTG CGGGTCGACG CCGCGTCGCC GTTCATGCTG
CATGTGATCG TGCCGCTCGT GCGCGGCTAT CGGGAGCGCT TCCCGCGCGT GGAGCTGGAG
CTGAACAGTA ACGAGGGCGT CATCGATCTG CTCGAGCGGC GCACCGACGT CGCGATCCGG
ATCGGCCGCC TGAAGGATTC GACGCTGCAT AGCCGGCTCA TCGGCAATAG CCGGCTGCGC
ATCCTCGCGA GCCCCGCGTA TCTCGACGCG CACGGCCAGC CGCGCAAGGC CGGCGATCTC
GGCAAGCATG CGCTGCTCGG CTTCAATCAG CCGGAATCGC TGAACGTGTG GCCGATCCTC
GGCGCGGACG GCGAGCCTTG CCGGATCGAG CCGGCCGTGT GGTCGTCGAG CGGCGAGACG
CTCAGACAGC TCGCGCTCGA CGGCGCGGGC ATCGTCTGCC TGTCGGATTT CATGACCGCG
CAGGATCGCG AAGCGGGCCG CCTCGTGCAG ATCCTCGCGC GCCACACGCA AGACGTGCGG
CAGCCGATTC ATGCGGTCTA TTACCGGAAC ACGGCGATTT CGTCGCGAAT CGCGTCATTC
GTCGATTATC TGGTCGACGC GCTCGGCGGC GGGAATGCCG CGCAAAAGGC GGCGGCATGG
ACGCGTCCGT GA
 
Protein sequence
MMGIHESGND AAQCARFTFA QKVAYKRNYF DFKSTMKVTL DELQAFAAVV DTGSITAAAQ 
QLGLTVSATS RTLARLEGKL KTTLLRRTTR RLELTEEGRT FLNSARAIID SVESAEEQML
ARREKPSGRL RVDAASPFML HVIVPLVRGY RERFPRVELE LNSNEGVIDL LERRTDVAIR
IGRLKDSTLH SRLIGNSRLR ILASPAYLDA HGQPRKAGDL GKHALLGFNQ PESLNVWPIL
GADGEPCRIE PAVWSSSGET LRQLALDGAG IVCLSDFMTA QDREAGRLVQ ILARHTQDVR
QPIHAVYYRN TAISSRIASF VDYLVDALGG GNAAQKAAAW TRP