Gene BURPS668_A1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1497 
Symbol 
ID4888908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1440838 
End bp1441851 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID640131436 
ProductAraC family transcription regulator 
Protein accessionYP_001062493 
Protein GI126443160 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.141505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGAGC GCAGCGATCG CCTCGATTTC TACATTCGCG ACGAGGCCGC CCGCCGGGCG 
ATCACCGAGC CGCACCGGCA TGCGTACTTC CAGATCCAGT TCAACCTCGG CGGCGACACC
GAGCAGCGCA TCGGCGGGTT CACGCGAGCG TTCCCGCGCG GCGCGCTCGC GTTCGTGCTG
CCGTACCGCG AGCACCTGAT CGCGCATCCG CCGGGCGCGC ACTTCGTCGT GATCAATTTC
TCGCAAACGT TCCTGCGCGC CGATCTCGAC GTCGATCCGC TCGATCTCGA GGATGTCTGC
GCGCAGCGCG CGCCCGAGCT TGCGCCGTTT CGCTTCCAGG AGCATCTGGA CTTCATCCTG
ACCGGCGCGG CATTCGACGA CGCGCGCCGC CTCGCGCAGC GGATGCTAGA AGCCAACCGC
GCGCGCACGT TCGGCTCGGT GCCGCTGCTG CGCGGCTATC TGCTGCAGTT GATCGGGCTC
GTCTGCACAC AATACGCGGG GCCGCTCACG AAGCTCGCCC AGAGCGGCGC GCACCGCACG
GGCCGCCGCG ACGCGTTCGC GCGCGTGCTG CGCCACGTCC GCGCGAACCT GACGAACGAC
GCGCTCACGC TCGCGGGCAC CGCGCGCGCG GCGTGCCTGT CGCCGAACTA CCTCGCGCAC
CTGATCCGCA AGGAGACGGG CAGCACGTTC ACCGATCTCG TCACCGCGCG GCGGATCGCG
CTTGCCCAAT CGCTGCTCGC GCATACGACG CGGCGCATCG CCGACATCGC GCACGCGGTC
GGGTTTCGCG ACGAGGGCTA TTTCTCGCGG CGCTTTCGCG CGTGCGTCGG CGTATCGCCG
AAGGAGTATC GCGACGCGAA CGGCGCGCCC GGCCCGGCCG ATGCGCTCGA TTCGGCCGAT
GCGCTCGATT CGGTCGATAC GGCTGGGCCG CGCGCCGCGC CCGGGCGCGG CGAAACGCGC
GGCGCGGCCG GCGCGAAGAG CCCGGCGCGC GCGGCCGCGA AGCCGCGCGC GTAG
 
Protein sequence
MPERSDRLDF YIRDEAARRA ITEPHRHAYF QIQFNLGGDT EQRIGGFTRA FPRGALAFVL 
PYREHLIAHP PGAHFVVINF SQTFLRADLD VDPLDLEDVC AQRAPELAPF RFQEHLDFIL
TGAAFDDARR LAQRMLEANR ARTFGSVPLL RGYLLQLIGL VCTQYAGPLT KLAQSGAHRT
GRRDAFARVL RHVRANLTND ALTLAGTARA ACLSPNYLAH LIRKETGSTF TDLVTARRIA
LAQSLLAHTT RRIADIAHAV GFRDEGYFSR RFRACVGVSP KEYRDANGAP GPADALDSAD
ALDSVDTAGP RAAPGRGETR GAAGAKSPAR AAAKPRA