Gene BURPS1106A_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1962 
Symbol 
ID4901347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1928562 
End bp1929911 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content69% 
IMG OID640135192 
Productglycosy hydrolase family protein 
Protein accessionYP_001066227 
Protein GI126455294 
COG category[R] General function prediction only 
COG ID[COG3979] Uncharacterized protein contain chitin-binding domain type 3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.639701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTCCC GCATCGTCCC GCGCGCGCTC GCGGCGGGCT GTCTGTTCGC GGCGGCGGGC 
GCGTCGCAGG CGGCGGGCGT GTACGCGCCC TACGTCGACG TGACGCTCTA CCCGACGCCG
CTCGTCGACC AGATCGGCGT GCAGCAAGGC ATCCAGCAAT TCATGCTCGC GTTCGTCGTG
TCGGGCGGCA ACCAGTGCAC GCCGTCATGG GGCGGCGTGC AGCCGATCGG CAACGGCGCG
ACGGGCGATC TGCTCGACAA GATCGCGACG TCGGTCACCG CCTATCGCGC GAAGGGCGGC
GACGTGGCGG TATCGTTCGG CGGCGCGGCC GGCCAACCGC TGATGCAGGC GTGCTCGAGC
GTCGCCGCGC TGAAGGGCGC ATATCAGACC GTGATCGACA CGTACAGCCT CACGCACGTC
GATTTCGACA TCGAAGGCGC GTCGCAGCAG GATTCGGCCG CCGTCGCGCG CAACTTCCAG
GCGGTCGCGC AACTGCAAGC CGACTACGCG GCCAAAGGCA AGCCGCTGCA CGTGACGCTC
ACGCTGCCGG CGATGCCCAC GGGCCTCGTG CAGGACGGCC TGAACGTGCT GAACGCGGCG
CTCGCGAACA ACGTGACGCT CGACGCGGTG AACATCATGA CGATGGATTA CGGCCCGTCC
GGCATCGACA TGGGCGCGGC CGCGATCAGC GCCGCGCAGG GCCTCTACTC GCAGCTCGAC
ACCGCGTACA AGTCGGCCGG CAAGCCGCAG ACCGACGCGC AATTGAAGCA GCTCGTCGGC
GTGACGCCGA TGATCGGCGT GAACGACGTC GCGGGCGAGA TCTTCACGCT CGCGAACGCG
CAGAGCGTGC AGACGACGGC CGCGAACAAC AACTACGGCT TCGTCGGCAT CTGGTCGATC
ACGCGCGACA AGGCATGCGA CGGCAGCTCG CAGTACGCGT CGCCGATCTG CTCGGGCGTC
GCGCAGCAGC CGTACGCGTT CTCGTCGGTC TTCAAGCAAC TGGGCGGCCA TTGGGGCGCG
GGCGTCACCC AGGACCCGAA CTACGGCGGC GGCTCGGACG GCGGCGGCAA GCCCCAGCCG
GGCGCGCCGT GGTCGGCCAC GCAGGTCTAT ACGGCGGGCG CGACGGTCAC GTACCAGGGC
ACGACCTATC AGGCCCAATG GTGGACGCAG GGCGACATTC CGGGGCAGGC GTCGGTGTGG
AAGCCCGTCG GCGGCAACGT GCCGGCCTGG TCATCGACGA CCGCGTATCC GGGCGGCGCG
TGCGTGACGT ATCAGGGCGC GAAGTATTGC GCGAAATGGT GGACGCAGGG CGACGTGCCG
AGCGCGGGCG GCCCCTGGAC GCGAGCGTGA
 
Protein sequence
MLSRIVPRAL AAGCLFAAAG ASQAAGVYAP YVDVTLYPTP LVDQIGVQQG IQQFMLAFVV 
SGGNQCTPSW GGVQPIGNGA TGDLLDKIAT SVTAYRAKGG DVAVSFGGAA GQPLMQACSS
VAALKGAYQT VIDTYSLTHV DFDIEGASQQ DSAAVARNFQ AVAQLQADYA AKGKPLHVTL
TLPAMPTGLV QDGLNVLNAA LANNVTLDAV NIMTMDYGPS GIDMGAAAIS AAQGLYSQLD
TAYKSAGKPQ TDAQLKQLVG VTPMIGVNDV AGEIFTLANA QSVQTTAANN NYGFVGIWSI
TRDKACDGSS QYASPICSGV AQQPYAFSSV FKQLGGHWGA GVTQDPNYGG GSDGGGKPQP
GAPWSATQVY TAGATVTYQG TTYQAQWWTQ GDIPGQASVW KPVGGNVPAW SSTTAYPGGA
CVTYQGAKYC AKWWTQGDVP SAGGPWTRA