Gene BURPS668_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1154 
Symbol 
ID4882365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1132628 
End bp1133809 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content69% 
IMG OID640127082 
Productaminotransferase, class I/II 
Protein accessionYP_001058203 
Protein GI126439116 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGA GCGATCTCAA GCCGCCCCGC TGGGCCCTGT CCGAACGCGC ACGCAAGCTC 
ACGAGCTCGG CGATCCGCGA AATCCTGAAA GTGACCGAGC GCCCCGAGGT GATCTCGTTC
GCGGGCGGCC TGCCCTCGCC CGCGACGTTC CCCGCCGAGC GCATGCGCGA GGCCGCGGAG
CGCGTGCTGC GCGATTCGCC CGCCGCCGCG CTGCAATACA GCGCGACGGA AGGCTTCCTG
CCGCTGCGCG AGTGGATTGC CGAGCGCTAC CGCGTGCGCA CGACGCAGGT GCTCGTCACG
ACGGGCTCGC AGCAGGCGCT CGATCTGCTC GGCAAGGTGC TGATCGATCC GGCGAGCCGC
GTGCTCGTGG AGACGCCGAC CTACCTCGGC GCGCTGCAGT CGTTCTCGCT GTACGAGCCG
ATCTACGCGC AGGTGCCGAC CGACGACGCG GGCCTGCTGC CCGAGGCGCT CACGCCCGAG
CTGACGAAGG ACGCGCGATT GTTGTATGCG CAGCCGAACT TCCAGAACCC GACGGGCCGC
CGCCTGAGCG TCGAGCGCCG CCGCGCGCTC GCCGCCTTCG CGCAGACGAG CCCGTTCCCG
GTGCTCGAGG ACGATCCGTA CGGCGCGCTG AACTACGCGG GCGAGCCGCT GCCGACGATG
CTGTCGATGG CGCCCGATCA CATCGTCCAT CTCGGCACGT TCTCGAAGGT GCTCGCGCCC
GGCCTGCGGA TCGGCTATAT CATTGCGCCC GAGGAGCTGC ACTTCAAGCT CGTGCAGGCC
AAGCAGGCAA CCGACCTGCA CACGCCGTCG CTCACGCAGC GCATCGCCCA CGAAGTCATC
CAGGACGGCT TCCTCGATGC GCACATCCCG ACGATCCGCA AGCTCTACGG CGCGCAGTGC
GAAGCGATGC TCGCGTCGCT CGCGCGGCAC ATGCCGCAAG GCGTGAGCTG GAACCGCCCG
GAAGGCGGGA TGTTCATCTG GGTGACGCTG CCCGCGCAGA TCGACAGCAT GCAGCTCCTC
GAGACGGCGG TCGCGAACAA CGTCGCGTTC GTGCCGGGCG CGCCGTTCTT CGCGAACGAC
GCGCAGAAGA ACACGCTGCG GCTGTCGTTC GTCACCGTGC CGCCGGAGAA GATCGAGGAA
GGCGTCGCGC GGCTCGGCAA GCTGTTGCGC GAGCGCCTGT GA
 
Protein sequence
MNPSDLKPPR WALSERARKL TSSAIREILK VTERPEVISF AGGLPSPATF PAERMREAAE 
RVLRDSPAAA LQYSATEGFL PLREWIAERY RVRTTQVLVT TGSQQALDLL GKVLIDPASR
VLVETPTYLG ALQSFSLYEP IYAQVPTDDA GLLPEALTPE LTKDARLLYA QPNFQNPTGR
RLSVERRRAL AAFAQTSPFP VLEDDPYGAL NYAGEPLPTM LSMAPDHIVH LGTFSKVLAP
GLRIGYIIAP EELHFKLVQA KQATDLHTPS LTQRIAHEVI QDGFLDAHIP TIRKLYGAQC
EAMLASLARH MPQGVSWNRP EGGMFIWVTL PAQIDSMQLL ETAVANNVAF VPGAPFFAND
AQKNTLRLSF VTVPPEKIEE GVARLGKLLR ERL