Gene BURPS1106A_A2826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2826 
Symbol 
ID4904996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2766340 
End bp2767599 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID640145929 
Productsubstrate-binding repeat-containing protein 
Protein accessionYP_001076855 
Protein GI126457855 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATC CGGCGGGGCG GACGACGGCT TGGGAATATG ACGCGTATGG CAGTTTGCTT 
GTGAAGACGT TGCCGGATGG CAGCGCAGTC AGAACGGAAT TTGACCTCGA TCACCGACCG
GTCTGCATGA CGTTGATAGG CGGCCGGCAG TGGGGCTACG AGTGGGATAC GTTCGGTAAT
CTGCTCGCGC AGATCGATCC ATCGGGGGCG ATATCTCGCT ATACCTATGA CGAGTACGGC
CAGCTTGTTG AGCATACTGG GCCGCGTGGT GCGAGCACAC TGTTCGATTA TCACCCGGAC
GGCAATCTCG CGGCGCAGAT CGATGCGTTG GGGCATCGCA CGCAGTATCG GTACGATGCG
CGCGGCTACC TCGTCGAAGC GATCGATGCG CTCGGACAGC AAAGCCAATA CGAGTACGAC
CGCAACGGCC ATCTGACGCG CGCAATCGAG CCGGGCGGGC GTGAGATTCA CTGTGCGTAC
GACGCCGATG GAAATCTGTC TCGCCATCGT GACCCCATGG GCCACGTGAC GCAGGTGGAG
TACTCGGCGC TCGGACAGGT CAGCAGACGG CTCGCGCCCG ACGGCACCAC CGTTGAATAC
CGCTACGACA GCCACATTAC CAGCGCGGGA TTCCGAACGC GGCCCATCGG TCGGCTGCCG
ATGTTCGCGT GCCAGACTTG CCGGCGCTAC TTCAGGCGCA CGGCCGCCCC CCCACTCGGC
GAGAAACATC TCAAGAAACT CGATCTATTC GTGTCCTTGC TGTCGCATCC GATCTCGTGC
GTTGATGCGG GCGAACAGAT GGGCAGCCTA TCGACCGACA TCGGGAAACG CGTGACGGCC
TGGCGCGCGT GGCTGTTGGA GCTCGACCCG AGCGGCAAGT GGGAGCGCCG CGTGAGGCTC
AGCCATCGAC CTCCGCATTG CCCGAACTGC GGCAGTCACC AGACGCGTTT CGATGAATGC
TCGAACGGCG CCTTCCCACG GTTCAAATGC GCGAATTGCG GGACCAAATT CACCCGACGC
CGCGGCACGC CGTTCGTCAA TGCGAAGATG AGTTCGCCCG AGCGCATGCG CCTGGTCATT
CGGCGCCTGT CGCTGCCGTT GTTGGTCATG CAGGTGGCGG ACCTTGTCGG CACGAGCCAT
GGGATGGTCC GGAAATGGCA CAGCATGTTC ACCGATTTTG CGGATCGGCT CGAACCGAGT
GGCAGTCTTT CAGCGCGGAT CAGGTTGCGC TCGAACTCTG CCAATGCGCC GAACAAATGA
 
Protein sequence
MIDPAGRTTA WEYDAYGSLL VKTLPDGSAV RTEFDLDHRP VCMTLIGGRQ WGYEWDTFGN 
LLAQIDPSGA ISRYTYDEYG QLVEHTGPRG ASTLFDYHPD GNLAAQIDAL GHRTQYRYDA
RGYLVEAIDA LGQQSQYEYD RNGHLTRAIE PGGREIHCAY DADGNLSRHR DPMGHVTQVE
YSALGQVSRR LAPDGTTVEY RYDSHITSAG FRTRPIGRLP MFACQTCRRY FRRTAAPPLG
EKHLKKLDLF VSLLSHPISC VDAGEQMGSL STDIGKRVTA WRAWLLELDP SGKWERRVRL
SHRPPHCPNC GSHQTRFDEC SNGAFPRFKC ANCGTKFTRR RGTPFVNAKM SSPERMRLVI
RRLSLPLLVM QVADLVGTSH GMVRKWHSMF TDFADRLEPS GSLSARIRLR SNSANAPNK