Gene BURPS1106A_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2010 
Symbol 
ID4900051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1971924 
End bp1973801 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content71% 
IMG OID640135241 
Productputative non-ribosomal peptide synthase 
Protein accessionYP_001066276 
Protein GI126454659 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGTTT CGACCGATAC GTCCGCCGAA GCCCATTCAC CGGCGCCGAA TTCACTCGAT 
ATGTCAGACA GTGACGCACT GCGCCGGATT GCCGAAGCCG TCGACGGCGG TGCGGCGAAC
ATCGAGCGAA TCGTTCCGCT CGCCCGTGCG CGCGAGCGTA TGCCGACGCG GCCTCGGCTC
GAGCGGTGCG GCGGCGGGCG AGTGACGGCG GCGCACATCA CGCTCGACTC GCGTGCGCGT
CTCGATGCGT TGCTGCACGC ATTGCAACGC GCGATCGACC AGAACGCGGA CCTGCGAACG
TGCATTTTGG GGGCGTGCCT GCGGCGGCCG ATGCAAGTCA CGCTTCGCGA GGTTCGCCTG
CGAGTGCACG CCGCGACGCT CGACCCCGAC CTCGATCCCG CCGCGCAGTT GGCCGCGCTG
AGCACCGGGC CCGGCATGCG CATCGACATG CAACGCCCGC CGTGGGTGCT CGCGTGCATC
GCGCGCATTC CGGGCAGCGG GCAATGGCTG CTGCGGCTCG TGGCAGCCCC GATCGCGGCC
GGATTCGACG CGCTCGACGC GCTGCTTCGC GAGACGGTGA TTCACGGCGA CCGGGAGCCC
GGGCCGGCGC CGTTTCACTG GACTGTGGAA ACGGCTGTTG AATCGTGCGG AGGCGAACCT
GCGTCGTTGC CGACCGCGGG CGCGGTTTGG CCGTCGAACG ACGTATCACG CGCTTGCGAT
CCGGATGCCG CGTCGTGCGT CGAGGCGCGC ATCGCCGCGA TCGCGTCCGA TCTGCCGGGC
GTCGTGCATG GCGGACCACG AGACGATTTG CGCGCGCTCG GACGAACGCC GTTGCAGGCG
CTTCGACTCG CGCGCCGTAT CCGCGACGCA CTGGGCGTGA CCGTACCGGT CGAGTCGATC
CTCGCGAGTC CGACCATCGT CGAGCTTGCC GGGTACGTCG AGCAATTGCG CTCGCGGGAC
GTCCGCGACG GCGCTGCGCC CGTGTCGATC GGCGAAAAAC CGGCGGACGC GGATGCTCGG
GCACAGGCGC AGGCGGATAC GGATACGGCG CACACCGATT GCCTGATCGT CATTCAAGCA
GGCGGCGCCG AACAAGCGCC GGTGTTCTGC ATCCCGGGCG CGGGGGGCAG CGTCGCGTCG
TTCGTTGCGC TTGCGAGCAT GCTGCGCGCC GACATACCGG TATACGGCTT GCAGCCTCGC
GGGCTGGACG GCCTGGGGCC GCCGGACCGG TCCGTCGAAG CGGCTGCGCG CCGGTACGCG
CGGGCCATTC TGGATGCCGC CCCGCCCGGG CCGCCGCGCA TCGTCGGCCA CTCGTTCGGC
GGCTGGATCG CGCTCGAGAC AGCGCGGCTG CTGGACGGCA TGGGAGCGCG CTGCGCCCCG
CTCGTCCTGC TCGATTCGAA TCCGCCGCCC GCGTCACAGG CCTGGCGCGC GCCTTCCGAG
GCAGACATGC TGCGCACGCT CGTCGGCCTG CTCGAGCAGG CCGCGGGCGG CGCCCCATCC
GGGATCGGCG ACGAAGAAAT CGCCCGTTGC GCGGCAGCGG GCGAGGATGC GCGGGATGCG
CTCGTCCACG CCTGCATGGT GAGGACCGCC CTGCTGCCGC CGCGCGCGCC GGTCGAAGCG
GTGCGGCACC TGCGGCGGGT ATTCGAAGCC CATTCGAGCA CCCGCTACGC GCCGGGCGGC
CGATACGCGG GCGACGCAAC GGTGATCGTC GCCAACGGCG ATCGCGACGC GGGCGAGATG
GTGCCGGCGT TCGGATGGGC CGCGCTGATC GAGCGAGTCG AGGTGGCCGT GACGCCGGGC
AATCACATGA GCATGCTCGC GGCGCCGTAT GTTCGTCACG TCGCGCTGAC GATGAAGACG
GTATGGCGCA TGATCTGA
 
Protein sequence
MHVSTDTSAE AHSPAPNSLD MSDSDALRRI AEAVDGGAAN IERIVPLARA RERMPTRPRL 
ERCGGGRVTA AHITLDSRAR LDALLHALQR AIDQNADLRT CILGACLRRP MQVTLREVRL
RVHAATLDPD LDPAAQLAAL STGPGMRIDM QRPPWVLACI ARIPGSGQWL LRLVAAPIAA
GFDALDALLR ETVIHGDREP GPAPFHWTVE TAVESCGGEP ASLPTAGAVW PSNDVSRACD
PDAASCVEAR IAAIASDLPG VVHGGPRDDL RALGRTPLQA LRLARRIRDA LGVTVPVESI
LASPTIVELA GYVEQLRSRD VRDGAAPVSI GEKPADADAR AQAQADTDTA HTDCLIVIQA
GGAEQAPVFC IPGAGGSVAS FVALASMLRA DIPVYGLQPR GLDGLGPPDR SVEAAARRYA
RAILDAAPPG PPRIVGHSFG GWIALETARL LDGMGARCAP LVLLDSNPPP ASQAWRAPSE
ADMLRTLVGL LEQAAGGAPS GIGDEEIARC AAAGEDARDA LVHACMVRTA LLPPRAPVEA
VRHLRRVFEA HSSTRYAPGG RYAGDATVIV ANGDRDAGEM VPAFGWAALI ERVEVAVTPG
NHMSMLAAPY VRHVALTMKT VWRMI