Gene BURPS1106A_A2462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2462 
Symbol 
ID4904297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2426924 
End bp2428513 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content65% 
IMG OID640145566 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001076493 
Protein GI126456487 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.229792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGAC ACATCGTTCA AGCAATCCAG TCTCACGCGT TCGCGACGCC GTACAAATGC 
GCGTTGTCCG ATTCCCGCGG CGATCTCGGC TACGCCGATC TCGACAGCTT CAGCACGCGT
TTTGCGATGC GCCTGCAGGA TCTGGGCTGT CGCCCCGGCG ATCGCGTCGT GATGCTCGCG
AGCCGCCGCG CGCTGCTTGT CGCGGCGATC ATCGGCGTGT TCAAGGCAGG CTGCGTGCAT
GTGCCGCTCG ATCCGCGCAT GCCGGCCGAC CGTCTTCGCT ACATCCTGCA CGACGTCGCG
CCAACGCTCG TGATCGCCGA CGAGGATCTG ATCGACGCGA TCGAACACGC GCTACCGGCC
GCCGCGCCGA TCGTGCCCGT GACCGAACTC GAGCGGCTTC TAGACGACGC CGATTCGCCG
CGGCTCGACG CGCTCGTGCA GCCGTTACCG CTGCCGCCGC TCGACGAGCG AGCGATCGCG
TACTGCATCT ACACGTCGGG CTCGACCGGC CGCCCGAAAG GCGTGCTGAT CAATCATCGC
AGCATCGCGG ATTTCTTCGA GGGCACGCGC GCCGTCTACG ATGTCACGGC GCAATCGCGC
TGCGCGAGCT TCTCGCCGCT GAACTTCGAC GTGTACCTGA TGGACATGCT GTTTCCGCTC
GCGCAGGGCG CGTCGCTATA CGTGCACGAC GACGTGAACG CGCCCGATCT GCTGTTCGAC
GCGATCCGCG TGCACGACGT CACCCACTTT TCCGCGTGGG GAATGATGCT CGGCCTGATT
GCGCAAGCTG AGGAATTCGA ATCCGCGCCG CTGCCGCATC TGAAGACGAT CCTCACCGGC
ACCGACGTGC CCGACGTGAA GACGGTCCAG CGCTGGCTCA GGAAGAGCGC GGGCGTGCAG
GTGATCAACG CCTACGGGCC GACCGAAGCC ACCTGCGCGG CGACCGCGCA CGTGATCCGC
GAGATCGAGC CGGAGCGGCG CACGCTCTAC CCGATCGGCA AGCCGCTCGA GCACGTGCGG
GCGCTGCTCG TCGACGAGGG CGGCAACCGA ATCACGGCGC CGGGCGTGCC GGGCGAGTTG
ATGATCGGCG GCACGCAGGT AATGCAGGGC TACTGGAATC TGCCGGAAGA AACGGCGGCG
CGGCTCGTGC GTCTCGACGG CGTGCCGTTC TATCGAACGG GCGACGTCTG CGCGTATCTC
GCCGACGGCA GCCTCTACTA CATGGGCCGC AAGGATAACG AAGTGAAGAT CGGCGGCTAC
CGGATCCATT TGAGCGAAAT CCAGCGGGTC ATCAACAGCG TGCCGCACGT GTACGGATCG
GAGGTGGTGC TGCTCGAATC GCGCTACGGC GAGACGCTGC TCGCCGCCGG CGTGCTGCTC
GAACGCGGCG CGCCGCTCGA CGCCGATTGC AAGGCCGACG AAATCAGGCA GCGCCTCGCG
GCGGAGCTGC CCGCCTACAT GGTGCCCCGC CACGTCAAGG TTCTCGAGCA GTTTCCGCAG
TTGTCATCGG GAAAGACGGA TCGCAAAGCG CTTCTGTCGA TATTGCAACA GCGCATCAAC
GAAAGTAACC AGGAGGAAGT GAATTCATGA
 
Protein sequence
MARHIVQAIQ SHAFATPYKC ALSDSRGDLG YADLDSFSTR FAMRLQDLGC RPGDRVVMLA 
SRRALLVAAI IGVFKAGCVH VPLDPRMPAD RLRYILHDVA PTLVIADEDL IDAIEHALPA
AAPIVPVTEL ERLLDDADSP RLDALVQPLP LPPLDERAIA YCIYTSGSTG RPKGVLINHR
SIADFFEGTR AVYDVTAQSR CASFSPLNFD VYLMDMLFPL AQGASLYVHD DVNAPDLLFD
AIRVHDVTHF SAWGMMLGLI AQAEEFESAP LPHLKTILTG TDVPDVKTVQ RWLRKSAGVQ
VINAYGPTEA TCAATAHVIR EIEPERRTLY PIGKPLEHVR ALLVDEGGNR ITAPGVPGEL
MIGGTQVMQG YWNLPEETAA RLVRLDGVPF YRTGDVCAYL ADGSLYYMGR KDNEVKIGGY
RIHLSEIQRV INSVPHVYGS EVVLLESRYG ETLLAAGVLL ERGAPLDADC KADEIRQRLA
AELPAYMVPR HVKVLEQFPQ LSSGKTDRKA LLSILQQRIN ESNQEEVNS