Gene BURPS1106A_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0414 
Symbol 
ID4900939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp377597 
End bp378925 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content71% 
IMG OID640133644 
Producthypothetical protein 
Protein accessionYP_001064697 
Protein GI126451752 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0412] Dienelactone hydrolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTGCAAT TCGCGTGGGT GGGCTCGGGC ATGGCGTTCG GCAAGGTGAT GACGGGATGG 
ATCGCGGCAT GCGTGCTGTC GGCCGCTCAG GCCGATCCGC TGCCCTCCGA TGCGCCGTCG
GCGCCAGCCG CCGCCCCCTA CGCGCCGCGC GTATCGCACG CGCCCCGCGC AGCCGGCGCG
CTCGCGAACG CCGAGCGCTT CGACTACGGC GATTCGGGCC TGCCGCCCGT CGCCGCGAAT
CTGAACGAAA CGATCATCCG CATTCCGGTC GACGCCGCCG GCGCGATCAC GCTCGAGGCG
ACCGTATACA AGCCGGATGG CCCGGGCCCC TTCCCGCTCG TGGTCTTCAA CCACGGCAAG
AATCCCGGCG ATCTGCGCGC GCAGCCGCGC AGCCGGCCGC TGTCGTTCGC GCGCGAGTTC
GTGCGGCGCG GCTATGCGGT GGTCGCGCCG AACCGCGAGG GCTTCGCCGG CTCGGGCGGC
ACGTACATCC AGGAAGGCTG CGACGTCGAG CGCAACGGCG TCGCGCAGGC GCGCGACGTC
GCCGCGACGA TCGGCTACAT GTCGAAGCTG TCCTACGTCG ATGCGAGGCA CGTCGTCGTC
GCCGGCACGT CGCACGGCGG GCTCGTGTCG CTCGCGTACG GCACCGAGGC CGCGCGCGGC
GTGCGCGGAA TCATCAACTT CTCGGGCGGG CTGCGTCAGG ATCTCTGCGA AGGCTGGCAG
AAGAACCTCG TCGACGCGTT CGACACGTAC GGCTCGCGCA CGCACGTGCC GTCGCTCTGG
CTGTACGGCG AGAACGATTC GGTATGGTCG CCCGCGCTCG TCGCGCAACT GCGCGACGCG
TACATGTCGC ACGGCGCGAG CACGCTCTTC GTCGATTTCG GCCGCTACAA GGACGACGCG
CACCGGATCA TCGTCGATCG CGACGGCGTG CCGATCTGGT GGCCGCCCGT CGCGTCGTTC
CTCGCGCAAC TGAGCCTGCC CACCTCGGTC CGCTATGCGG TCGCGAATCC GCACGAGCCG
AAGGCGAGCG GCTATGCGGC GATCGAATCG GTCGATGCGG TGCCGTTCAT CGACGACGCC
GGCCGCGCCG CATATCGCCG CTTCCTCGCG CAGCATCCGA GCCGCGCGTT CGCGGTGTCG
AGCGAGGGCG CATGGTCGTG GGCCGAAGGC GGCGACGATC CGATGGCGCT CGCGCTCGAA
GGCTGCCGCA AGCAGGGCGC GGGGGCGTGC CAGCTATATG CGGTCGACGA GCGCGTCGTG
TGGCGCGACG CGGGCACGCA GACGGCGGAC GAATCGACGA GCGCGGCGCA CGCGCTCGCG
AGCCGCTGA
 
Protein sequence
MVQFAWVGSG MAFGKVMTGW IAACVLSAAQ ADPLPSDAPS APAAAPYAPR VSHAPRAAGA 
LANAERFDYG DSGLPPVAAN LNETIIRIPV DAAGAITLEA TVYKPDGPGP FPLVVFNHGK
NPGDLRAQPR SRPLSFAREF VRRGYAVVAP NREGFAGSGG TYIQEGCDVE RNGVAQARDV
AATIGYMSKL SYVDARHVVV AGTSHGGLVS LAYGTEAARG VRGIINFSGG LRQDLCEGWQ
KNLVDAFDTY GSRTHVPSLW LYGENDSVWS PALVAQLRDA YMSHGASTLF VDFGRYKDDA
HRIIVDRDGV PIWWPPVASF LAQLSLPTSV RYAVANPHEP KASGYAAIES VDAVPFIDDA
GRAAYRRFLA QHPSRAFAVS SEGAWSWAEG GDDPMALALE GCRKQGAGAC QLYAVDERVV
WRDAGTQTAD ESTSAAHALA SR