Gene BURPS1106A_A2477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2477 
Symbol 
ID4904763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2438195 
End bp2439379 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID640145581 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001076508 
Protein GI126457677 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.333211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAATCG CCATCGTCAC GCACGTCGTG CGCCATAACG ACGGCCAGGG ACGCGTCAAT 
TACGAGATCG CGCGGGCGGC GCTCGCGGAG AATTGCCAGG TGACGCTCGT CGCCTCGCAC
GTCGCGCCCG AACTGCTCGC GGATGCGCGC GTGCGCTGGA TCGCCGTGAA GGCGGGCCGC
TTCTGGCCGT CGAACCTCGT CAAGCAGCAG GTGTTCGCGA TCAAGAGCGC ATGGTGGCTG
CGCCGTCATC GCGAAGCTTA CGACGTGCTG CACGTCAACG GCTTCATCTC ATGGGTGCGC
GCCGACGTGA ATACCGCTCA CTTCGTCCAT AGCGGCTGGT TCGCGAGCCG CTATTACCCG
TTCGGCCTGT CGAAGGGGCT GTGGTCCGCG TATCAGTACG TCTATACGCG CGTGAACACG
CGGCTCGAGC GCTGGGCGTA CCGGCGCGCG CGCGCGATCA CGGCCGTGTC GCAGAAGGTC
GCCGACGAGA TCCGGCGAAT CGGCATCGAC GGCGGCAGGA TCGGCGTCAT CTATAACGGC
GTCGACGCGC AGGCGTTCGC GAACGCGGTG CCCGATCGCC GCGCGTTCGG CTTGCCGGCC
GAACCGTTCA TGCTGCTGTT CGTCGGCGAT CTGCGCACGC CGCGCAAGAA TCTCGGCACC
GTGCTCAAGG CGCTCGCGCA TCTGCCGCCG AACGTGCATC TCGCGGTCGC CGGCTATTTG
CCCGGCAGCC CGTATCCGGA CGAGGCTCGC GCGCTGAAGA TCGATTCGCG CGTGCATTTT
CTCGGCCTCG TGAAGAACAT GCCGACGCTG ATGTCGTCGG TCGATGCATA CGTGTTTCCT
TCGCGCTACG AAGCGATGAG CCTGTCGCTG CTCGAGGCGA TGGCGGCGGG GCTGCCCGTC
GTGACCGCGC GCACCGCGGG CGGCGCGGAG ATCATCACGC CGGAGTGCGG GATCGTGCTC
GACGATCCCG ACGATCCGGC CGCGCTCGCG GCCGCGATCG AGCGCCTCGC GCGCTCGCGT
GACGCGTGCC GCGCGATGGG CGAGGCGGCC CGCAGGCTGA TGGAGGGATT CGGATGGGCG
CGCATGGGCG CGCAGTACAT CGCATTGTAT CGGCGGCTGC GCCAGTCGTC GCAGCCGTCG
CCGCTCGCCG GCACCGAGCA TGTCGTGACG CAGGAGCGGT CGTAA
 
Protein sequence
MRIAIVTHVV RHNDGQGRVN YEIARAALAE NCQVTLVASH VAPELLADAR VRWIAVKAGR 
FWPSNLVKQQ VFAIKSAWWL RRHREAYDVL HVNGFISWVR ADVNTAHFVH SGWFASRYYP
FGLSKGLWSA YQYVYTRVNT RLERWAYRRA RAITAVSQKV ADEIRRIGID GGRIGVIYNG
VDAQAFANAV PDRRAFGLPA EPFMLLFVGD LRTPRKNLGT VLKALAHLPP NVHLAVAGYL
PGSPYPDEAR ALKIDSRVHF LGLVKNMPTL MSSVDAYVFP SRYEAMSLSL LEAMAAGLPV
VTARTAGGAE IITPECGIVL DDPDDPAALA AAIERLARSR DACRAMGEAA RRLMEGFGWA
RMGAQYIALY RRLRQSSQPS PLAGTEHVVT QERS