Gene BURPS1106A_1163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1163 
Symbol 
ID4900968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1142932 
End bp1144113 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content69% 
IMG OID640134393 
Productaminotransferase, class I/II 
Protein accessionYP_001065442 
Protein GI126455457 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.488636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGA GCGATCTCAA GCCGCCCCGC TGGGCCCTGT CCGAACGCGC ACGCAAGCTC 
ACGAGCTCGG CGATCCGCGA AATCCTGAAA GTGACCGAGC GCCCCGAGGT GATCTCGTTC
GCGGGCGGCC TGCCCTCGCC CGCGACGTTC CCCGCCGAGC GCATGCGCGA GGCCGCGGAG
CGCGTGCTGC GCGATTCGCC CGCCGCCGCG CTGCAATACA GCGCGACGGA AGGCTTCCTG
CCGCTGCGCG AGTGGATTGC CGAGCGCTAC CGCGTGCGCA CGACGCAGGT GCTCGTCACG
ACGGGCTCGC AGCAGGCGCT CGATCTGCTC GGCAAGGTGC TGATCGATCC GGCGAGCCGC
GTGCTCGTGG AGACGCCGAC CTACCTCGGC GCGCTGCAGT CGTTCTCGCT GTACGAGCCG
ATCTACGCGC AGGTGCCGAC CGACGACGCG GGCCTGCTGC CCGAAGCGCT CACGCCCGAG
CTGACGAAGG ACGCGCGATT GTTGTATGCG CAACCGAACT TCCAGAACCC GACGGGCCGC
CGCCTGAGCG TCGAGCGCCG CCGCGCGCTC GCCGCCTTCG CGCGGACGAG CCCGTTCCCG
GTGCTCGAGG ACGATCCGTA CGGCGCGCTG AACTACGCGG GCGAGCCGCT GCCGACGATG
CTGTCGATGG CGCCCGATCA CATCGTCCAT CTCGGCACGT TCTCGAAGGT GCTCGCACCC
GGCCTGCGGA TCGGCTATAT CATTGCGCCC GAGGAGCTGC ACTTCAAGCT CGTGCAGGCC
AAGCAGGCGA CCGACCTGCA CACGCCGTCG CTCACGCAGC GCATCGCCCA CGAAGTCATC
CAGGACGGCT TCCTCGATGC GCACATCCCG ACGATCCGCA AGCTCTACGG CGCGCAGTGC
GAAGCGATGC TCGCGTCGCT CGCGCGGCAC ATGCCGCAAG GCGTGAGCTG GAACCGCCCG
GAAGGCGGGA TGTTCATCTG GGTGACGCTG CCCGCGCAGA TCGACAGCAT GCAGCTCCTC
GAGACGGCGG TCGCGAACAA CGTCGCGTTC GTGCCGGGCG CGCCGTTCTT CGCGAACGAC
GCGCAGAAGA ACACGCTGCG GCTGTCGTTC GTCACCGTGC CGCCGGAGAA GATCGAGGAA
GGCGTCGCGC GGCTCGGCAA GCTGTTGCGC GAGCGCCTGT GA
 
Protein sequence
MNPSDLKPPR WALSERARKL TSSAIREILK VTERPEVISF AGGLPSPATF PAERMREAAE 
RVLRDSPAAA LQYSATEGFL PLREWIAERY RVRTTQVLVT TGSQQALDLL GKVLIDPASR
VLVETPTYLG ALQSFSLYEP IYAQVPTDDA GLLPEALTPE LTKDARLLYA QPNFQNPTGR
RLSVERRRAL AAFARTSPFP VLEDDPYGAL NYAGEPLPTM LSMAPDHIVH LGTFSKVLAP
GLRIGYIIAP EELHFKLVQA KQATDLHTPS LTQRIAHEVI QDGFLDAHIP TIRKLYGAQC
EAMLASLARH MPQGVSWNRP EGGMFIWVTL PAQIDSMQLL ETAVANNVAF VPGAPFFAND
AQKNTLRLSF VTVPPEKIEE GVARLGKLLR ERL