Gene BURPS1106A_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3123 
Symbol 
ID4902570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3042917 
End bp3044074 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID640136349 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001067361 
Protein GI126453504 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.435597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCTT TCCAGTCCGA CATGAGTTCC GCTTCCGCCC CGCGCATCGT TCTCGTCTGC 
AATACCGCCT GGGCGATCTA TACGTACCGG CAAGGCCTGC TTCGCATGCT GATCGCGCGC
GGCGCGCAGG TGACCGTGCT CGCGCCGCGC GACCGCACCG TCGAGCCGCT CGTGCGCATG
GGCTGCCGCT ACGCGGAGCT GCCCGTCGCC TCGAAAGGCA CGAGCCCGCG CGAGGACCTG
CGCACGCTCA TCGCGCTGTA TCGGCACTAC CGCGCGATCC GGCCCGACCT CGTGTTCCAT
TACACGATCA AGCCGAACAT CTACGGCTCG ATCGCCGCGT GGCTCGCGCG CGTGCCGTCG
ATCGCGGTGA CGACGGGCCT CGGCTACGTG TTCATCCAGC AGAGCCACGC CGCACGCGTC
GCGAAGCAGC TGTACCGCTT CGCGTTGCGC TTTCCGCGCG AGGTCTGGTT CCTGAACCGC
GACGATCTGC ACACGTTCAC GCACGAGCAG CTCCTCGCGC ATCCGGCGCG CGCGCGCCTG
CTGCACGGCG AGGGCGTCGA CCTCGAGCAG TTCGCGCTCG CGCCGCTGCC CGCGCGCGAC
ACGTTCACCT TCGTGCTGAT CGGCCGGCTG CTGTGGGACA AGGGCGTGCG CGAATACGTC
GATGCGGCGC GCATGCTGCG CGCGCGCTAT CCGCACGCGC GCTTCGCGCT GCTCGGCCCC
GTCGGCGTCG ACAATCCGAG CGCGATCTCG CAGGCCGACG TCGACGCGTG GGTGCGCGAA
GGCGTGATCG ATTACCTCGG CGAGGCGCAC GACGTGCGGC CGCACATCGC CCGCGCCGAT
TGCGTCGTGC TGCCGTCCTA TCGCGAAGGC GTGCCGCGCA CGCTGATGGA GGCGTCCGCG
ATGGGCCGGC CGATCGTCGC GACCGACGTG CCGGGCTGCC GCGACGTCGT CGCCGACGGC
AGCACCGGGC TGCTGTGCGC CGCGCGCGAC AGCGCGAGCC TCGCCGCGCA GCTCGCGCGG
ATGCTCGACA TGAGCGCGGC CGAGCGGCGC GCGATGGGCG AGCGCGGCCG GAGAAAGATC
GTCGCGGAAT TCGACGAGGC GAAGGTCGTC GAGCGTTATC ATCAGACCAT TTCGGCCCTG
ACGGGCATCA CACTTTGA
 
Protein sequence
MISFQSDMSS ASAPRIVLVC NTAWAIYTYR QGLLRMLIAR GAQVTVLAPR DRTVEPLVRM 
GCRYAELPVA SKGTSPREDL RTLIALYRHY RAIRPDLVFH YTIKPNIYGS IAAWLARVPS
IAVTTGLGYV FIQQSHAARV AKQLYRFALR FPREVWFLNR DDLHTFTHEQ LLAHPARARL
LHGEGVDLEQ FALAPLPARD TFTFVLIGRL LWDKGVREYV DAARMLRARY PHARFALLGP
VGVDNPSAIS QADVDAWVRE GVIDYLGEAH DVRPHIARAD CVVLPSYREG VPRTLMEASA
MGRPIVATDV PGCRDVVADG STGLLCAARD SASLAAQLAR MLDMSAAERR AMGERGRRKI
VAEFDEAKVV ERYHQTISAL TGITL