Gene BURPS668_A1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1919 
Symbol 
ID4887452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1865631 
End bp1866950 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content75% 
IMG OID640131857 
Productrhamnosyl transferase 
Protein accessionYP_001062914 
Protein GI126445416 
COG category[G] Carbohydrate transport and metabolism
[C] Energy production and conversion 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000883643 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGG TAATCGTGAC GGCGATCGGG TCGGCGGGGG ACGTGCACCC GCTGCTGGGG 
GTGAGCCGGG CGCTGGCCGC GCGCGGCCAC GACGTGGTGT TCTGCACGCA TGCGCCGTTC
GAGGCGGCGG TGCGCGCGAG CGGCTTCGCG TTCGTGCCGG TGGGCACGGC CGAGGCGTAT
GCGCAGGCGA TGGCGGACCC GGCGCTGTGG GATCCGCGCA CGTCGTTCCG GACGCTGTGG
CGGGTGATCG CGCCGGTGCT GCGGCCGCAC TTCGATACGC TGCGCGCGCT GAGCGACGCG
GACACGGTGC TGGTGGGCAC GCTGTGGGCG TTCTCGGCGC GGCTGATGCA GGAGCGCTTC
GGCGCGCGCT ACGTGTCGGT GCAGGTGTCG CCGTCGACGC TGCTGTCGGC GCACGCGCCG
CCGACGCACA AGCGGCTGAC GATCCCGAAG GGACTGCCGC TGGCGGTGAA GGCGGGGCTG
ATGACGCTGA TCGAGCGGCA GGTGCTGGAC CGGGTGTGCG GCCCGGAGCT GAACGCGGCG
CGGCGGGCGC TGGGGCTGGC GCCGGCCCGG CGGATCCTGG GGCGGTGGCT GCATTCGACG
GACGGGGTGC TGTGCCTGTT TCCGTCGTGG TTCGCGCCGG CGCAGCCGGA CTGGCCGGCG
AATCACCTGC AAAGCGGGTT TGCGCTGTTC AACGACGTGG GGCCGGTGCC GGCGGATGCG
GAGCTGGACG CGTTCGTCGC GTCGGGCGAG GCGCCGGTGG TGTTCACGGC GGGCTCGACG
CTGGTGGACG GGCACGCGTA CGAGCGGGCG GTGACGCAGG TGCTGCGGGC GACGGGCGTG
CGGGGGATCC TGCTCGCGCC GGACGCGCCG GCGGCATCGG ATGGGACGAT GGGGCCAATG
GAGAGGACGG CGGAGAGGAC GGCGCGGGCG AATGGCGTGG CGCTGCTCAA GCGCCGCTAC
GTGCCGCTCG CGGCGCTGCT GCCGCGGTGC CGGGCGCTGG TGCATCACGG CGGGATCGGC
ACGGCGTCGC TGGCGTACGC GGCGGGCGTG CCGCAGGTGG TGACGCCGTT CGCGCACGAT
CAGTTCGACA ACGCGCAGCG GGTGGCGGCG AGCGGCTGCG GGGTGCGGCT GGACGCGCCG
GTGCGCGGCG AGCCGCTGGC ACGGGCATTG GCGCGGGTGC TGGGCGACGC GGCGATGGCC
GCGCGCTGCG CCGAGGTGCG CGCGCGCATG GCGGCGCAGC CCGACGGCTG CGACGAGGCG
GCGCGCTTCA TCGAGCGCTT CGCGCCGGGC GTCGCGGCGC GGCAGGCGCA GCCGGCATGA
 
Protein sequence
MAKVIVTAIG SAGDVHPLLG VSRALAARGH DVVFCTHAPF EAAVRASGFA FVPVGTAEAY 
AQAMADPALW DPRTSFRTLW RVIAPVLRPH FDTLRALSDA DTVLVGTLWA FSARLMQERF
GARYVSVQVS PSTLLSAHAP PTHKRLTIPK GLPLAVKAGL MTLIERQVLD RVCGPELNAA
RRALGLAPAR RILGRWLHST DGVLCLFPSW FAPAQPDWPA NHLQSGFALF NDVGPVPADA
ELDAFVASGE APVVFTAGST LVDGHAYERA VTQVLRATGV RGILLAPDAP AASDGTMGPM
ERTAERTARA NGVALLKRRY VPLAALLPRC RALVHHGGIG TASLAYAAGV PQVVTPFAHD
QFDNAQRVAA SGCGVRLDAP VRGEPLARAL ARVLGDAAMA ARCAEVRARM AAQPDGCDEA
ARFIERFAPG VAARQAQPA