Gene BURPS1710b_1758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1758 
Symbol 
ID3690650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1901458 
End bp1903881 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content69% 
IMG OID637728214 
Productlectin repeat-containing protein 
Protein accessionYP_333159 
Protein GI76810477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACAA TACGCATGGC GTCGATCGAC GTCAGACTCT TCATATTCGG ACTGCTGCTC 
TTCTTGTCGG GCCTCGCGCA TGCGGACACG CCGAGCGCCG CGCAGCCGTC GCCCTACGGC
ATCGTGTCCG TCGACATCGA CGGCACGCGG CTGTGCATGG AAGTCGGCGG CGCCGCCGTG
GCGCCGCTCG CGGGCGTCGA CGTCGGCGAA TGCCGGCAGA GTGCCGATCA GCAATGGTCG
CTCGTGCCCA ACGGGCAGAA TTTCCAGGTG GTCGCCAAGC ACAGTCTGCA ATGCCTGACG
ATCGCGAACG CCGCGACGTC GCCGGGCGCG CCACTCGTGC AGTATCCGTG CAACGGCGGC
TCGAATCAGC AATGGAGCGT CTCCCCATCG CGCTCCGGCT ACAAGCTCGT GTCCGCGCAC
GACGCGCTGT GCGCGAGCGC CGCCGGTGCG TCGCCCGGCT CGCGGATGGT TCAGCAGCGA
TGCGACGAGC ATGCGTCGCA AACGCTCTAC GTGTCCGCGC CGCCCGAGCT CGTCGGCGCG
CTCATCGGCA TGAACGGCGG GCTTTGCATC GACGGCGGCG CGGCCGGCGC GCAAGCCGTG
CAACGCACGT GCGACGACGC GCCCGGCCAG CAATGGCGAA TCCGTCCGAA CGGCAGCGCC
TACAAGATCC ACCTCGCCTC GACGAACCTG TGCCTCGGCA CGCGCGACGC GTCGCGGGGC
ACGGGCGCCG CGATCGAGAG CCAACGCTGC GCGAATGTCG CGAGCCAGTT GTGGACGATC
CGCGCGGCAA GCGAAGTCGA CGGCAGGAAC GGCTACGCGA ACGGGTACTG GCAGTTCGTA
TCGGCAAACA GCGGCCAGTG CATCGTCGTG CAGAACGCCT CCACCGCCGA CGGCGCGAAT
CTGATCCAGT ATCCGTGCGG CCAGGGCAAC GCGGGCTCGA ACACGATGTG GCGCGTCAAC
CGGACATCGC GTTCGTCCTG GACGGGCGCG ATCGCGCTGC CGCTCGTTCC GTCGGCCGCC
GCGCACCTGC CGGACGGCAA GATCCTGATG TGGTCGGCGG ATTCGACGAT CTCGTTCGGC
GGCGGCGGCG GCGGCGTCGA CGGCAACACC TACACGACCG TGTTCGATCC GATCGCGCAA
AAGGCAACCG ATGCCGTGCT GACGAGCCTC GGCCACGACA TGTTCTGCCC CGGCCTCAAT
CTGCTCGCGG ACGGCAAGAT CTTCGTGAAC GGCGGCGTGT CCAGCAGAAA GACCAGCGTC
TACGATCCGG CCACGCACGC GTGGAGCCCG TCGAACCTGA TGAACATTGC GCGCGGATAC
CAATCGAGCG TCACGCTGTC CGACGGCGCG GTCTTCACGC TCGGCGGCTC ATGGAACGTC
GCGACGCCCG ACTTCCGGCT CGGCGACAAG TACGGCGAAG TGTGGCGGCC CGACAGCGGC
TGGTCGATCC TGTCGAACGT ACCCGACATC ATCGGCCCCG ATCCCGCGGG CGCCTATCGC
GGCGACAATC ACATGTGGCT CGTCGCGGCC CGCGACCGCT GGGTCTTCTA CGCGGGCCCC
GATGCGACGC TGCGCTGGAT CGACACGACG GGCAACGGCA GGATCGTCGA GGCCGGCAAG
CGCGGCGACG ATGCGTACTC GATCAGCGGC AACGCCGTCA TGTACGACGT CGGCAAGGTC
CTCACCGTCG GCGGCGCGCC GGCGTACGAC AACGGCGTCG CGTCGGCGAG CGCATACGTG
ATCGACATCT CGGCGGGCCC GCAAGTCCCG CCCGTCGTGC GCAAGGTGCA GCCGCTCGCG
TACAGCCGCG GCTTCGTCAA CAGCGTGGTG CTGCCGAACG GGCAAGTCGT CGCGATCGGC
GGGCAGGCGG TGACGATTCC GTTCTCCGAC GATCAATCGG TGCTCGTGCC CGAACTGTGG
GACCCGTCCA CCGAGGCCTT CACGCGCCTC GCGCCGATGA CGGTGCCGCG CAACTATCAC
AGCGAGGCGC TGCTGCTGCC GGACGGCCGC GTGATGGCTT CGGGCGGCGG GCTGTGCGGA
AGCGGCTGCA ACACCAACCA TCCGAACGTG CAGATCCTCA CGCCGCCCTA TCTGCTCAAC
GCGGACGGCA CGGCGGCGAG CCGCCCCGTG ATCGCCGCCG CGCCGGAGCA GGCCGCCAAC
GGCTCGACGA TCGCCGTATC GACGGATGCG CCGATCCGCA GCTTCGCGCT GGTGCGGATG
TCGTCGAGCA CGCACTCGGT CAACACCGAC CAGCGCCGCA TTCCGCTCAC GTTCCGGCAG
TCGTCCGGCG GCGACGGCGG CTATGCGTAC ACCGTCGCGA TTCCGGCGGA CGCCGGCGTC
GCGATACCGG GGCAGTACAT GCTGTTCGCG CTGAACGCGG GGGGCGTGCC GAGCGTCGCG
AAGACGATCC GGATCGGCGC CTGA
 
Protein sequence
MRTIRMASID VRLFIFGLLL FLSGLAHADT PSAAQPSPYG IVSVDIDGTR LCMEVGGAAV 
APLAGVDVGE CRQSADQQWS LVPNGQNFQV VAKHSLQCLT IANAATSPGA PLVQYPCNGG
SNQQWSVSPS RSGYKLVSAH DALCASAAGA SPGSRMVQQR CDEHASQTLY VSAPPELVGA
LIGMNGGLCI DGGAAGAQAV QRTCDDAPGQ QWRIRPNGSA YKIHLASTNL CLGTRDASRG
TGAAIESQRC ANVASQLWTI RAASEVDGRN GYANGYWQFV SANSGQCIVV QNASTADGAN
LIQYPCGQGN AGSNTMWRVN RTSRSSWTGA IALPLVPSAA AHLPDGKILM WSADSTISFG
GGGGGVDGNT YTTVFDPIAQ KATDAVLTSL GHDMFCPGLN LLADGKIFVN GGVSSRKTSV
YDPATHAWSP SNLMNIARGY QSSVTLSDGA VFTLGGSWNV ATPDFRLGDK YGEVWRPDSG
WSILSNVPDI IGPDPAGAYR GDNHMWLVAA RDRWVFYAGP DATLRWIDTT GNGRIVEAGK
RGDDAYSISG NAVMYDVGKV LTVGGAPAYD NGVASASAYV IDISAGPQVP PVVRKVQPLA
YSRGFVNSVV LPNGQVVAIG GQAVTIPFSD DQSVLVPELW DPSTEAFTRL APMTVPRNYH
SEALLLPDGR VMASGGGLCG SGCNTNHPNV QILTPPYLLN ADGTAASRPV IAAAPEQAAN
GSTIAVSTDA PIRSFALVRM SSSTHSVNTD QRRIPLTFRQ SSGGDGGYAY TVAIPADAGV
AIPGQYMLFA LNAGGVPSVA KTIRIGA