Gene BURPS1710b_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2109 
SymbolchiC 
ID3691184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2305739 
End bp2307100 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content69% 
IMG OID637728565 
Productglycosy hydrolase family protein 
Protein accessionYP_333504 
Protein GI76810523 
COG category[R] General function prediction only 
COG ID[COG3979] Uncharacterized protein contain chitin-binding domain type 3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTCA GCATGTTGTC CCGCATCGTC CCGCGCGCGC TCGCGGCGGG CTGTCTGTTC 
GCGGCGGCGG GCGCGTCGCA GGCGGCGGGC GTGTACGCGC CCTACGTCGA CGTGACGCTC
TACCCGACGC CGCTCGTCGA CCAGATCGGC GTGCAGCAAG GCATCCAGCA ATTCATGCTC
GCGTTCGTCG TGTCGGGCGG CAACCAGTGC ACGCCGTCAT GGGGCGGCGT GCAGCCGATC
GGCAACGGCG CGACGGGCGA TCTGCTCGAC AAGATCGCGA CGTCGGTCAC CGCCTATCGC
GCGAAGGGCG GCGACGTGGC GGTATCGTTC GGCGGCGCGG CCGGCCAACC GCTGATGCAG
GCGTGCTCGA GCGTCGCCGC GCTGAAGGGC GCGTATCAGA CCGTGATCGA CACGTACAGC
CTCACGCACG TCGATTTCGA CATCGAAGGC GCGTCGCAGC AGGATTCGGC CGCCGTCGCG
CGCAACTTCC AGGCGGTCGC GCAACTGCAG GCCGACTACG CGGCCAAAGG CAAGCCGCTG
CATGTGACGC TCACGCTGCC GGCGATGCCC ACGGGCCTCG TGCAGGATGG CCTGAACGTG
CTGAACGCGG CGCTCGCGAA CAACGTGACG CTCGACGCGG TGAACATCAT GACGATGGAT
TACGGCCCGT CCGGCATCGA CATGGGCGCG GCCGCGATCA GCGCCGCGCA GGGCCTCTAC
TCGCAGCTCG ACACCGCGTA CAAGTCGGCC GGCAAGCCGC AGACCGACGC GCAATTGAAG
CAGCTCGTCG GCGTGACGCC GATGATCGGC GTGAACGACG TCGCGGGCGA GATCTTCACG
CTCGCGAACG CGCAGAGCGT GCAGACGACG GCCGCGAACA ACAACTACGG CTTCGTCGGC
ATCTGGTCGA TCACGCGCGA CAAGGCATGC GACGGCAGCT CGCAGTACGC GTCGCCGATC
TGCTCGGGCG TCGCGCAGCA GCCGTACGCG TTCTCGTCGG TCTTCAAGCA ACTGGGCGGC
CATTGGGGCG CGGGCGTCAC CCAGGACCCG AACTACGGCG GCGGCTCGGA CGGCGGCGGC
AAGCCCCAGC CGGGTGCGCC GTGGTCGGCC ACGCAGGTCT ATACGGCGGG CGCGACGGTC
ACGTACCAGG GCACGACCTA TCAGGCCCAA TGGTGGACGC AGGGCGACAT TCCGGGGCAG
GCGTCGGTGT GGAAGCCCGT CGGCGGCAAC GTGCCGGCCT GGTCATCGAC GACCGCGTAT
CCGGGCGGCG CGTGCGTGAC GTATCAGGGC GCGAAGTATT GCGCGAAATG GTGGACGCAG
GGCGACGTGC CAAGCGCGGG CGGCCCCTGG GCGCGAGCGT GA
 
Protein sequence
MNFSMLSRIV PRALAAGCLF AAAGASQAAG VYAPYVDVTL YPTPLVDQIG VQQGIQQFML 
AFVVSGGNQC TPSWGGVQPI GNGATGDLLD KIATSVTAYR AKGGDVAVSF GGAAGQPLMQ
ACSSVAALKG AYQTVIDTYS LTHVDFDIEG ASQQDSAAVA RNFQAVAQLQ ADYAAKGKPL
HVTLTLPAMP TGLVQDGLNV LNAALANNVT LDAVNIMTMD YGPSGIDMGA AAISAAQGLY
SQLDTAYKSA GKPQTDAQLK QLVGVTPMIG VNDVAGEIFT LANAQSVQTT AANNNYGFVG
IWSITRDKAC DGSSQYASPI CSGVAQQPYA FSSVFKQLGG HWGAGVTQDP NYGGGSDGGG
KPQPGAPWSA TQVYTAGATV TYQGTTYQAQ WWTQGDIPGQ ASVWKPVGGN VPAWSSTTAY
PGGACVTYQG AKYCAKWWTQ GDVPSAGGPW ARA