Gene BURPS1710b_A1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1191 
SymbolrhsA2 
ID3692071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1494515 
End bp1495792 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content62% 
IMG OID637731445 
ProductYD repeat-containing protein 
Protein accessionYP_336348 
Protein GI76819503 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.270395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGCGTG CGAGCGCGAT GATCGATCCG GCGGGGCGGA CGACGGCTTG GGAATATGAC 
GCGTATGGCA GTTTGCTTGT GCAGACGTTG CCGGATGGCA GCGCAGTCAG AACGGAATTT
GACCTCGATC ACCGACCGGT CTGCATGACG TTGATAGGCG GCCGGCAGTG GGGCTACGAG
TGGAATACGT TCGGTAATCT GCTCGCGCAG AGCGATCCAT CGGGGGCGAT ATCTCGCTAT
ACCTATGACG AGTACGGCCA GCTTGTTGAG CATACTGGGC CGCGTGGTGC GAGCACACGG
TTCGATTATC ACCCGGACGG CAATCTCGCG GCGCAGATCG ATGCGTTGGG GCATCGCACG
CAGTATCGGT ACGATGCGCG CGGCTACCTC GGCGAAGCAA TCGATGCGCT CGGACAGCAA
AGCCAATACG AGTACGACCG CAACGGCCAT CTGACGCGCG CAATCGAGCC GGGCGGGCGT
GAGATTCACT GTGCGTACGA CGCCGATGGA AATCTGTCTC GCCATCGTGA CCCCATGGGC
CACGTGACGC AGGTGGAGTA CTCGGCGCTC GGACAGGTCA GCAGACGGCT CGCGCCCGAC
GGCACCACCG TTGAATACCG CTACGACAGC CACATTACCA GCGCGGGATT CCGAACGCGG
CCCATCGGTC GGCTGCCGAT GTTCGCGTGC CAGACTTGCC GGCGCTACTT CAGTCGCACG
GCCGCCCCCC CACTCGGCGA GAAACATCTC AAGAAACTCG ATCTATTCGT GTCCTTGCTG
TCGCATCCGA TCTCGTGCGT TGATGCGGGC GAACAGATGG GCAGCCTATC GACCGACATC
GGAAAACGCG TGACGGCCTG GCGCGCGTGG CTGTTGGAGC TCGACCCGAG CGGCAAGTGG
GAGCGCCGCG TGAGGCTCAG CCATCGACCT CCGCATTGCC CGAACTGCGG CAGTCACCAG
ACGCGTTTCG ATGAATGCTC GAACGGCGCC TTCCCACGGT TCAAATGCGC GAATTGCGGG
ACCAAATTCA CCCGACGCCG CGGCACGCCG TTCGTCAATG CGAAGATGAG TTCGCCCGAG
CGCATGCGCC TGGTCATTCG GCGCCTGTCG CTGCCGTTGT TGGTCATGCA GGTGGCGGAC
CTTGTCGGCA CGAGCCATGG GATGGTCCGG AAATGGCACA GCATGTTCAC CGATTTTGCG
GATCGGCTCG AACCGAGTGG CAGTCTTTCA GCGCGGATCA GGTTGCGCTC GAACTCTGCC
AATGCGCCGA ACAAATGA
 
Protein sequence
MGRASAMIDP AGRTTAWEYD AYGSLLVQTL PDGSAVRTEF DLDHRPVCMT LIGGRQWGYE 
WNTFGNLLAQ SDPSGAISRY TYDEYGQLVE HTGPRGASTR FDYHPDGNLA AQIDALGHRT
QYRYDARGYL GEAIDALGQQ SQYEYDRNGH LTRAIEPGGR EIHCAYDADG NLSRHRDPMG
HVTQVEYSAL GQVSRRLAPD GTTVEYRYDS HITSAGFRTR PIGRLPMFAC QTCRRYFSRT
AAPPLGEKHL KKLDLFVSLL SHPISCVDAG EQMGSLSTDI GKRVTAWRAW LLELDPSGKW
ERRVRLSHRP PHCPNCGSHQ TRFDECSNGA FPRFKCANCG TKFTRRRGTP FVNAKMSSPE
RMRLVIRRLS LPLLVMQVAD LVGTSHGMVR KWHSMFTDFA DRLEPSGSLS ARIRLRSNSA
NAPNK