Gene BURPS668_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2110 
Symbol 
ID4885650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2098939 
End bp2101335 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content71% 
IMG OID640128038 
Productnon-ribosomal peptide synthetase module-like protein 
Protein accessionYP_001059145 
Protein GI284159951 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCATCC GGCCCGCACG GGCCGGATGC GATCGGCGAC TCAATATTCC GAGATCGCCA 
TGTATGCGCT CATCAGCATC TCGTCGCGCT CGGCCGCTTC GAGGTAAGCG TCCCAATGCG
GCGTTTGCGC GGCGTCGCGA TGCCAGAACT CGATCTCGCT GCGGCCGTGC TGCAGGCTGC
GCCACGCTTG CGTCAGCAGG CGCCGCTCCT CGTCGCCGAG CGAGCTCAGC TGCGAGACCG
GCACGTCGCC CCCGTCGACG ATCGCCCCGA TCAGGTTCGA CAGATGCGCC CACAGCAGGC
GGATTCCCCA AGGGCGTGAT GATCGAGCAC CGCAACGCCG TCTCCTTCAT CGACTGGGCC
CGCCGCGCCT TCCCCCCCGA GTCGTTCGAC GGCGTCCTCG CCTCCACCTC GGTCTGCTTC
GACCTGTCCG TCTTCGAGAT CTTCGCCACC CTCGCCGCCG CCGGTCGCAT CGTCCTCGTG
CGCGACGTCC TCGCCCTGCC CGAGCTCCCC GACGGACTCG TCCGCCTCGT CAACTCCGTC
CCCTCCGCCA TTCACGCGCT CTTGCAGACC GGCCGCCTCC CCGCCTCCGT GCGCACCGTC
AACCTCGCCG GCGAACCCCT GCGTCAAAGC CTCGTCGACG CCCTCTACGA CGCCGGCGTC
GAGCGCGTCT ACGACCTCTA CGGCCCCTCC GAGGACACCA CCTATTCGAC CTGCGCCCTG
CGCACCCCGC GCGGCCGACC CTCCATCGGC TCCCCCATCT CCAACACCCA GGCCTTCGTC
CTCTCCGCCA CCGGGCAACT GCAGCCCGTC GGCGTGCCCG GCGAGCTCTT CCTCGGCGGC
GCCGGCCTCG CGCGCGGCTA CCTCGGCCGG CCCGAGCTCA CCGCCGAGCG CTTCGTCGAC
AACCCCGTCC TCGACTCCCC CGTGCGCCGC CTCTACCGCA CCGGCGACCT CGTGCGATGG
CTCCCCGACG GACAGCTCGA ATTCCTCGGC CGACTCGATC ATCAGGTCAA GATCCGCGGC
TTTCGCATCG AGCTCGGCGA GATCGACGCG CGCCTCGGCG CCTGCGACGG CGTTCGCGAA
GCCGCCGTCA TCGCCCTCGA GCACGCCGGC GATGCCCAGC TCGTCGCCTA TGTCGTCCCG
CACGCCCCCC AGGCCGCCTC CGCCGCCAAC CTGCGCGCCG CGCTCGCCGC CTTCCTCCCC
GCCTACATGA TCCCCGCCGC GTTCGTCTTC CTCGACGCCC TTCCCCTCAC CCCCAACGGC
AAGCTCGACC GCAAGCGGCT TCCCGTGCCT GACGATGCGC GCGTGTCGTC GCGCGACAGC
GATCCGCCTC GCGGCCCGAC CGAAACGGCC GTCGCCGCCG TGTGGCAGAC GTTGCTCGAT
TACGCGCCCG TCGGCCGGCA CGATCACTTC TTCGAAATGG GCGGCCATTC GCTGACGGCG
CTCAAGCTGC TCGACCACCT GGCGAAGCGC TTCGCCGTGC CGTTGACGGC GGCGATGCTG
TTTCGCAGCC CGAGCATCGA GCAGCTCGCG CGCGAAATCG ACGCCGCCCG CACCGGGCAC
GACGTCGAGC CGCCCGTCGA GCGGTTCCGC GACGGCGCCG CCGCCGTCGC GCCGCTGCTG
CTCGTGCCGC CCATCGGGGG CTCGTCGCTC TGCTACGGCG ACCTCGTCAA CGCGCTCGAC
TATCCCGGCG TCGTCTGGGG CTGTCAGCAG ACGCGCGAGA TCGTCGCCGC GGAAACGACG
GGCAGCGCGG CCGGGCTCGC CGCGCTCTAC GCGCGCGCAT GGCTCGAGCG CGCCGAGCAC
GCGGAGGTTT GCCTGCTCGG CTGGTCGTTC GGCGGCGTCG TCGGCTTCGA GATGGCGGGC
GAACTCGAGA AGCACGGCGT GAAGGTGCGT TGGCTCGGAC TCATCGACAC GCACCTGTCC
GCGCCCGGCG GCGAAACGCT CGGCCGCCAG GCGCTCGCGA CCTTCGCGCT CGATCTCGGC
TTCGCCGCCG ACGAGCTCGC GCAATGGAAG CATCTCGTGC CCGACGGCGA CATCGACGGC
GAAGAAAGCG ACGCGCTGCG GCACCTGTGG ACGATCGGCC GTGACACCGG CCGCCTGCCG
GCCGCGATCA CGCTCGACGA GCTGACCGAA CGCTACCGGA TCACGGCGGC GAACCTGCGC
CGGCTCGCCG GCTACCTGCC GCGGCCCGCG TGGCAAGGCC CCGCCGGCTA CTTTCTGGCC
GCGCGGGACG CCCAAGCGGC AGCCGCGCGC CGATCCGCCG ACGTCTGGCG CACGCGGCTG
CCCGAGCTCG CCGTCACCGA CGTCGACGCG GACCATTTTT CCATCGTCAA GAGCCGGCAC
GCGCAAGCGA TCGCGCGGCT CGTCACGCTA AAACTCGAGG AGCTGATTCC AGCATGA
 
Protein sequence
MRIRPARAGC DRRLNIPRSP CMRSSASRRA RPLRGKRPNA AFARRRDART RSRCGRAAGC 
ATLASAGAAP RRRASSAARP ARRPRRRSPR SGSTDAPTAG GFPKGVMIEH RNAVSFIDWA
RRAFPPESFD GVLASTSVCF DLSVFEIFAT LAAAGRIVLV RDVLALPELP DGLVRLVNSV
PSAIHALLQT GRLPASVRTV NLAGEPLRQS LVDALYDAGV ERVYDLYGPS EDTTYSTCAL
RTPRGRPSIG SPISNTQAFV LSATGQLQPV GVPGELFLGG AGLARGYLGR PELTAERFVD
NPVLDSPVRR LYRTGDLVRW LPDGQLEFLG RLDHQVKIRG FRIELGEIDA RLGACDGVRE
AAVIALEHAG DAQLVAYVVP HAPQAASAAN LRAALAAFLP AYMIPAAFVF LDALPLTPNG
KLDRKRLPVP DDARVSSRDS DPPRGPTETA VAAVWQTLLD YAPVGRHDHF FEMGGHSLTA
LKLLDHLAKR FAVPLTAAML FRSPSIEQLA REIDAARTGH DVEPPVERFR DGAAAVAPLL
LVPPIGGSSL CYGDLVNALD YPGVVWGCQQ TREIVAAETT GSAAGLAALY ARAWLERAEH
AEVCLLGWSF GGVVGFEMAG ELEKHGVKVR WLGLIDTHLS APGGETLGRQ ALATFALDLG
FAADELAQWK HLVPDGDIDG EESDALRHLW TIGRDTGRLP AAITLDELTE RYRITAANLR
RLAGYLPRPA WQGPAGYFLA ARDAQAAAAR RSADVWRTRL PELAVTDVDA DHFSIVKSRH
AQAIARLVTL KLEELIPA