Gene BURPS1106A_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2254 
Symbol 
ID4902428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2240211 
End bp2241959 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content70% 
IMG OID640135483 
Producthypothetical protein 
Protein accessionYP_001066518 
Protein GI126454480 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0437004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTG TCGTTCGCCT TACCGCTTCC GCCACCCGCG CGCTGCCGCG CTGGCTGCTG 
CTCACGCTCT GCATCGTCTA CGCGGCGTTC GGGCTGTTCG GCCGCGATCC GTGGAAGAAC
GAGGACGCGG CAGGCTTCGG CGTGATGTGG ACGATGGCGC AAGGCGGCCT GCACGACTGG
CTGCTGCCCA ATCTCGTCGG CAAATTCGTC ACGTCCGACG GGCCGCTCGG CTACTGGCTC
GGCGGCCTCG CGATTCGCGC GCTGCCGTGG GTCGACGCGA GCAACGCGTC GCGCGTCTAC
ACGGGTGTGC TGTTCTGCGT CGCGTGCGCA TTCGTCTGGT ACGCGGCCTA TCTGCTCGGC
CGGCGCGCCG AGATCCAGCC GTTCAAGTAC GCGTTCGGCG GCGAGCCCGA GCCGCGCGAC
TACGGGCGCA CGCTCGCCGA CGGCGCGCTG CTCGTGCTGC TCGCGTGCTT CGGCCTTGCC
GAGCGCGGCC ACGAAACGAC GCCGCAGCTC GCGCAGTTCG CATGCATCGC GACGTTCGTC
TACGGACTCG TGCGCGCGAT CGACAAGCCG ACGCAAGGCG CGCTCTGGTG GGGCCTCGCG
CTCGGCCTCG TCGCGCTGTC GGGCAACCCG GTGCTCGTCG CCGCGCTCGC GCTCGGCACG
CTCGCGCTCT ATCTCGTCAC GCCCGAGATC CGCTGCGTGC AACTGCCGGC GATCGGGCTG
CCGCTCGCCG TGGCCGTGTT CGCGATCTGG CCGCTCGCCG CGTACATCGC GTTTCCCGAC
GACGCGAACT GGTTCTTCAA CCAATGGCTG CACGGCAGCC TGATGCGCTT CTCCGGCCCG
CCCACGACGG TGCTCGCGTA CGCGGCGAAG AACCTGCCGC TCTTCACGTG GCCCGCGTGG
CCGCTCGCGA TCTGGGCATG GGTGAGCTGG GCGGGGCTGC GCCGCCGGCC GCACATCGCG
ATTCCGCTGT CGGTCGCCGC GCCGCTCCTC GCGCTCGTGA TCCTGCAGAG CCAGCAGACG
AACCGGATGT ACATGCTGCT GCTGCCCGCC CTCGCCGTCA TCGCGACGTT CGCGCTGCCG
ACGCTCAAGC GCGGCGCGAT CAACGCGATC GACTGGTTCG CGGTGCTGAG CTTCACGATC
CTCGGCACGT TCGTGTGGCT CGTGTGGCTC GCGTCGCTCA CGGGCTTCCC GCATCCGCTC
GCGCGCAACC TCGGCCGCCT GGTGCCGGGC TACGAGCCGC ACTTCAAGGT GCTGTCGTTC
GTGTGCGCGG TCGCCGCGAC CGCATGCTGG CTGATGCTCG TGCGCTGGCG CATCTCGCGG
CAGCCGAAGG TGCTCTGGCG CAGCGTGGTG CTGTCGAGCG CCGGCACGAC GCTGATGTGG
GTGCTGCTGA TGACGCTGTG GCTGCCGATC GTCAATTACA GCCGGACCTA TCGCGACGTC
GCGCAGCAGA TCGCCGCGCA CCTGCCGTCC GATTACGAAT GCATCTCGCC CGTGCGGCTC
GGCGACGCGC AGATCGCGAC GTTCGCGTAT TTCGGCGACA TGCACTTCTC GTTCACCGAT
GACTGCGACG TGATCCTGCG CCAGGATCGC GCGGACTTCG GCGAGCCGAG TTCGATCTCG
CAATACGTGT GGCGCCTCGT GTGGGAAGGC CGCCGCGTCG CCGACCGCGA CGAGCGCTTC
CGCCTGTACG AGCGAATCGA GCGCCCGAAG ACGCCCGTCA AGCGCCGCCC GCCGCGCAAG
GCCCGCTGA
 
Protein sequence
MKPVVRLTAS ATRALPRWLL LTLCIVYAAF GLFGRDPWKN EDAAGFGVMW TMAQGGLHDW 
LLPNLVGKFV TSDGPLGYWL GGLAIRALPW VDASNASRVY TGVLFCVACA FVWYAAYLLG
RRAEIQPFKY AFGGEPEPRD YGRTLADGAL LVLLACFGLA ERGHETTPQL AQFACIATFV
YGLVRAIDKP TQGALWWGLA LGLVALSGNP VLVAALALGT LALYLVTPEI RCVQLPAIGL
PLAVAVFAIW PLAAYIAFPD DANWFFNQWL HGSLMRFSGP PTTVLAYAAK NLPLFTWPAW
PLAIWAWVSW AGLRRRPHIA IPLSVAAPLL ALVILQSQQT NRMYMLLLPA LAVIATFALP
TLKRGAINAI DWFAVLSFTI LGTFVWLVWL ASLTGFPHPL ARNLGRLVPG YEPHFKVLSF
VCAVAATACW LMLVRWRISR QPKVLWRSVV LSSAGTTLMW VLLMTLWLPI VNYSRTYRDV
AQQIAAHLPS DYECISPVRL GDAQIATFAY FGDMHFSFTD DCDVILRQDR ADFGEPSSIS
QYVWRLVWEG RRVADRDERF RLYERIERPK TPVKRRPPRK AR