Gene BURPS668_A2148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2148 
Symbol 
ID4886833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2085732 
End bp2086787 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID640132085 
ProductAraC-type DNA-binding domain-containing proteins 
Protein accessionYP_001063142 
Protein GI126443771 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGA TGCACGCGCC CCGCGGCGGC ACGCTGCTGC AATTCTTTTC GACCGACGAC 
ATGCCGCTCG CGCGCGCGGC GGCGTTCTGG AGCGCGCACG TGTTCCAGTG CGAGGACGTG
CGCGCGTCGT CCGCGCGCGC GTTTCACGGG CACGGCTTTC TGTGCCGCTG CGAGCGCGGC
CGGTTCGTTC GCTTTCGCGG CGCGTCGCTC GACACGCGCA TCGGCGCGGC GTGGCTGAGC
GCCGCGCCAG CCGATGCGTA CGTGACGATC TGCGCGCTGC ATGCGGGCGA GTGCACGGTC
GAAGCGCCCG GCTTGCCGGA TGTGCGCTTT CGTGCGAACG AGCTGTTCAT GCTGGACGGC
GGGCAGCCGA TGCGCGTGCG CTGGAGCGAG CCGTGTTTCA GCGCGCTCAG GCTGCCGCGC
GCATCGGTGG GGCGCACGCT CGGCCAGGCG GCGATGGACG CGTCGCCGGG CGCGGCTTCG
TTGCAGGAGG CGCGGCTCGC GCCGTTTCTC GCGGCGGAGC TCGCGCTGAT CGGCGGTCGC
GGCCCGACGC TGTCGTCGGA CGAGCTCGAT TACATGCTCG CGCGCGCAGC CGAGCTCGGC
CGCACGCTGC TTCAGGCGGC GCTGTCGTCG CGCGCGCGGC GCGGCGCGCC CGCGCGCGCC
GACCGGCTGC AGGCCGCGTA TCGCTACATC GAGCAGCATC TTCATCTGCC GACGCTCACA
CCCGAGCGGA TCGCCGACGC GATCCATTGC TCGCGCACGC AGCTCTATCG GCTGTTCCGC
CATGAATCGC AGACGGTGAA GGCGGCGCTG CGCGACGCGC GGCTGAACCG CAGCCTCGGC
TATCTCGAGC AGCCCGAGGT TACGCTCAGC ATCGGCGAGA TCGCGCATGC TTGCGGTTTT
CCCGATCAGT CGACGTTCGG CAAGCTGTTT CGCCGGCGCT TCGGCAGAAC GCCCGGCGAG
GTGCGCCGCG CCGCGCGGGG GCGTTGCAAT GAAACCGTGT TGCCCGACTG CGCGGAAAGC
GGCGACGCGG CGGATGTGCA AACGCCGCGG CGGTGA
 
Protein sequence
MAKMHAPRGG TLLQFFSTDD MPLARAAAFW SAHVFQCEDV RASSARAFHG HGFLCRCERG 
RFVRFRGASL DTRIGAAWLS AAPADAYVTI CALHAGECTV EAPGLPDVRF RANELFMLDG
GQPMRVRWSE PCFSALRLPR ASVGRTLGQA AMDASPGAAS LQEARLAPFL AAELALIGGR
GPTLSSDELD YMLARAAELG RTLLQAALSS RARRGAPARA DRLQAAYRYI EQHLHLPTLT
PERIADAIHC SRTQLYRLFR HESQTVKAAL RDARLNRSLG YLEQPEVTLS IGEIAHACGF
PDQSTFGKLF RRRFGRTPGE VRRAARGRCN ETVLPDCAES GDAADVQTPR R