Gene BURPS1106A_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2831 
Symbol 
ID4899536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2786326 
End bp2787546 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID640136057 
Producthypothetical protein 
Protein accessionYP_001067078 
Protein GI126451891 
COG category[S] Function unknown 
COG ID[COG4394] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCGT CCGCCCCGCT TCCCCCGCCC GTTCCCCCGC CCGCCGACAC GGCGTCGCCC 
CTGCAAGCGG CAAGCCCGGT CGCGTGCGAC ATCTTCTGTG CGGTCGTCGA CAACTTCGGC
GACATCGGCG TGTGCTGGCG TCTCGCGCGC CAGCTCGCGC TCGAGCACGG CTGGCAGGTG
CGGATCTTCG TCGACGCGCT CGCGACGTTC GCGCGCCTGC AGCCGGCCGC GTTGCCCGAC
GCCGCGCGGC AGACCGTCGA CGGCATCGTC GTCGAGCACT GGCGCGCGCC CGCGCACGCG
GGCGACACGC TCGAGATCGC CGACATCGTG ATCGAGGCGT TCGCCTGCGA GCTGCCGGGC
GCGTATGTCG CCGCGATGGC GCGCCGCGCG CGGCCGCCCG TCTGGATCAA CCTCGAATAC
CTGAGCGCCG AGGACTGGGT CGGCGAATTC CATCTGCGCC CGTCGCCGCA TCCGCGCTAC
CCGCTCACGA AGACGTTCTT CTTCCCTGGC CTCGGGCCCG GCACGGGCGG CGTGCTGAAG
GAGCGCGATC TCGACGCGCG CCGCGCCGCG TTCGAAACCG GCGACGATGC GCGCCGCACG
TGGTGGCAAA ACGTCGCGGG CGCGCCGATA CCCGCTCCGG ACACCACCGT CGTGTCGCTC
TTCGCGTACG AGAATCCGGC GCTCGACGCG CTGCTCGAAC AGTGGCGCGA CGGCCGCGAG
CCGGTCGCGC TGCTCGTGCC CGAAGGCAGG ATCTCGGCGC GCGTCGCGCG CTTCTTCGGG
GCCGGCGCGT TCGGCGCCGG CGCGCACGCG GCGCGCGGCA GCCTCGTCGC ACACGGTCTC
GCCTTCGTCG CGCAGCCCGA CTACGACCGG CTGCTGTGGG CGAGCGACGT GAACTTCGTG
CGCGGCGAGG ATTCGTTCGT CCGCGCGCAA TGGGCGCGCC GGCCGTTCGT CTGGCAGATC
TATCCGCAGG CCGACGACGC GCATCTGCCG AAGCTCGACG CGGCGCTCGC GCACGTCACC
GCACGCGTCG ATCACGCGAC GCGCGCGGCG ACCGAGCGCT TCTGGCACGC CTGGAACGGC
GCGGGCACGC CCGATTGGAC CGATTTCTGG CGGCACCGCG CGGCGCTCGC CGCGCGCGCC
GCGAGTTGGG CGGACGAGCT CGCGGCCGTC GGCGACCTCG CCGGAAATCT GGCGAATTTT
GCAAAAACTC AGTTAAAATA A
 
Protein sequence
MTSSAPLPPP VPPPADTASP LQAASPVACD IFCAVVDNFG DIGVCWRLAR QLALEHGWQV 
RIFVDALATF ARLQPAALPD AARQTVDGIV VEHWRAPAHA GDTLEIADIV IEAFACELPG
AYVAAMARRA RPPVWINLEY LSAEDWVGEF HLRPSPHPRY PLTKTFFFPG LGPGTGGVLK
ERDLDARRAA FETGDDARRT WWQNVAGAPI PAPDTTVVSL FAYENPALDA LLEQWRDGRE
PVALLVPEGR ISARVARFFG AGAFGAGAHA ARGSLVAHGL AFVAQPDYDR LLWASDVNFV
RGEDSFVRAQ WARRPFVWQI YPQADDAHLP KLDAALAHVT ARVDHATRAA TERFWHAWNG
AGTPDWTDFW RHRAALAARA ASWADELAAV GDLAGNLANF AKTQLK