Gene BURPS668_A0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0814 
Symbol 
ID4885959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp790106 
End bp791113 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content73% 
IMG OID640130754 
Productrhamnosyltransferase 
Protein accessionYP_001061813 
Protein GI126443291 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID[TIGR01556] L-rhamnosyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0467732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCT TGGGCGCGCT GGTGATTCTG TATTACCCGA CCGACGAGCA ACTGTCGGGC 
CTGGAGGCGC TCGCGCGCGA CAGCGACGCG CTCGCGGTGA TCGACAACAC GCCGCACGAG
CACGCGGCGG CGCGCGAGCG GGTGCGCGCG CTGTCGGCGC GGGCGCACGG CGAAGCGCGC
GTCGTGTGGC GGCACCACGG CAACCGCGGC GGGGTGGCGG GCGCGTACAA CGCGGGGCTG
TCGGCGCTGT TCGCGCAGGG CGTGGAGGCG GTCGCGCTGT TCGACCAGGA CTCGACGGTG
CCGGCCGCGT ACTTCGCGCG GATGCGCGAC GCGTGCGCGC AACTGGGCAC GCAACCGGGC
CCGCATGCGG GCGCGCATGC GGGCGCGTTC ATCGCGGGCC CGCGGATCTA CGACGCGAAC
GAGCAGCGCT TCCTGCCGGA GCTGATGACG AGCGGAGTGG CGGTGCGCCG CGTGCGGGTG
GAAGGCGAGC GCGCGCCGCA GCGCTGCGCG TTCCTGATCT CGTCGGGCAG CGTGATCTCG
CGGGGCGCGT ACGCGCGGCT CGGCCGCTTC GACGAGGCGC TGTTCATCGA CCACGTCGAC
ACCGAGTACT GCCTGCGCGC GCTGGCGCAC AACGTGCCGC TGTACGTGGT GCCGTCGCTG
GTGCTGACGC ACCGGATCGG CGCGCGGCGC CGGCACAAGG TGGGGCCGTT CGAGCTGACG
GCGATGCATC ATGGGTGGCT GCGCCGATAC TACGGCGCGC GCAACGCGAT GCAGCTGGGG
CTGCAGTACG GGTTGCGGTT TCCGGTGGCG CTGGTGCCGA ATCTGCTGAC GATCTGGCAG
GTGGTCCAGG TGGTGCTGTG CGAGCGGGAG AAGGGCGCGA AGCTGCGCGG GATCGCGCTG
GGCGTGCTCG ACGGGGTGTT CGGGCGCCTG GGGTCGTTCG AGGCGGCGCG CGCGGGCCAC
CGCACGGCAC GCGAGGAGGC GATGCGCGAA GCGCGGCGGC AGTCGTGA
 
Protein sequence
MTTLGALVIL YYPTDEQLSG LEALARDSDA LAVIDNTPHE HAAARERVRA LSARAHGEAR 
VVWRHHGNRG GVAGAYNAGL SALFAQGVEA VALFDQDSTV PAAYFARMRD ACAQLGTQPG
PHAGAHAGAF IAGPRIYDAN EQRFLPELMT SGVAVRRVRV EGERAPQRCA FLISSGSVIS
RGAYARLGRF DEALFIDHVD TEYCLRALAH NVPLYVVPSL VLTHRIGARR RHKVGPFELT
AMHHGWLRRY YGARNAMQLG LQYGLRFPVA LVPNLLTIWQ VVQVVLCERE KGAKLRGIAL
GVLDGVFGRL GSFEAARAGH RTAREEAMRE ARRQS