Gene BURPS1106A_A2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2124 
Symbol 
ID4905227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2080243 
End bp2081850 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content70% 
IMG OID640145229 
ProductAMP-binding domain-containing protein 
Protein accessionYP_001076157 
Protein GI126455926 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR03098] acyl-CoA ligase (AMP-forming), exosortase system type 1 associated 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAATT TCTTCGATCT GGTCGAACAG GCGGCATGCC GCACGCCCGA TGCCGAGGCG 
CTCGTCGCCG GCGACGCTCG GCTCACCTAT CGCGCACTCG CGGCGCTCGG CCTGGCGTTC
GCCGAGCGGC TGCAAGCGCT CGAGATCGCC CAGGGCGAGC GGCTCGCGAT CTTTCTCGAC
AAGCGCATCG AGACGGTCGT CGCCATGCTC GGCGCCGCGG CGGCCGGCGT CGTGTTCGTG
CCGATCAACC CGGTCCTCAA GCCCGAACAG GTTCATCACA TCCTCGTCGA CAGCGGCGCG
CGCTGTCTCG TCACGTCGTC GCTGCGCGCG CGCATACTCG GCGACGGCGT GCTCGCGGGC
ATGCCGTACG TGATCCTGAC CGATGCGAAG ACGCTTGCAC CGGCGCCGGC TTCGCGCACC
ACGTTTCTGC GATGGATGCA GCCCGATGCG CCGCCCGAGT CCGTCGCGAA CCGGCACGCG
AGCGCCGCGC CCGAGCCGCG AGCGGACACG ATCGACGCCG ATCTCGCCGC ACTCCTTTAC
ACGTCCGGCT CGACGGGCCG CCCCAAGGGC GTGATGCTCA GCCACCGCAA TCTGCTCGAG
GGCGCGTGGA GCGTCGCGCA ATACCTGCGC CATACCGCGC AGGATCGCAT TCTCGCCGCG
CTGCCGCTCA GCTTCGACGC CGGCTTGAGC CAGTTGACGA CCGCATGGGC CGCGGGCGCG
AGCGCCGTGC TCGTCAACTA CCTGATGCCC GCGGACGTCG TCGAGATCTG CGTGCGCGAA
CGCATCACCG GCTTCGCGGG CGTACCGCCG CTCTGGATTC AGCTCGCGCG CGCGGCGTGG
CCCGGCGAAG CGCGCGCGCG GCTGCGCTAT TTCGCGAACA CCGGCGGCCA TCTGCCGCGG
CCGGTGCTGC ACGCGCTGCG CGAACTGTTT CCGAGCGCGT CGCCGTATCT GATGTACGGG
CTGACCGAAG CGTTCCGCTC GACCTACCTC GATCCCGCCG AAGTCGACCG TCGCCCCGAT
TCGATCGGCA AGGCGGTGCC GAACGCCAGA ATTCTCGTCG TGCGCGAGGA CGGCGCGCCG
TGCGCGCCGA ACGAGGTCGG CGAGCTGGTC CACGTCGGCG CGTGCGTGAC GCTCGGCTAC
TGGAACGATC CGGCGCGCAC CGCGCTGCGC TACCGGCCTT CGCCCGAGGC GAAGCCGGGC
GGCGCGCCAC GTGAGACGGC CGTCTGGTCC GGTGATCTCG TGCGCCGCGA CGACGACGGA
TTCCTCTATT TCGTCGCCCG CAACGATGCG CAGATCAAGA GCTCCGGCTA CCGGATCAGC
CCGGAAGAGA TCGAGGAGGT CGCGCACGCG AGCGGGCTCG TGGCCGAAGC GGTCGCGCTC
GGCGTGCCGC ACGACGAACT CGGCGAATCG ATCACGCTCG TCGTCGTGCC GCTCGACGCC
GACACGTTCA GGCCCGACGC GCTGCGCGCG CGATGCGCAC AGCAACTTCC CCCGTACATG
GTGCCGCATA CGATCGCCAC GCGCACGTCG CTGCCGCGCA ATCCGAACGG GAAATTCGAT
CGCGTCGCGC TGCGCGCCGA CGCGGCGAAC CTCGTCGAAA CGCTCTGA
 
Protein sequence
MRNFFDLVEQ AACRTPDAEA LVAGDARLTY RALAALGLAF AERLQALEIA QGERLAIFLD 
KRIETVVAML GAAAAGVVFV PINPVLKPEQ VHHILVDSGA RCLVTSSLRA RILGDGVLAG
MPYVILTDAK TLAPAPASRT TFLRWMQPDA PPESVANRHA SAAPEPRADT IDADLAALLY
TSGSTGRPKG VMLSHRNLLE GAWSVAQYLR HTAQDRILAA LPLSFDAGLS QLTTAWAAGA
SAVLVNYLMP ADVVEICVRE RITGFAGVPP LWIQLARAAW PGEARARLRY FANTGGHLPR
PVLHALRELF PSASPYLMYG LTEAFRSTYL DPAEVDRRPD SIGKAVPNAR ILVVREDGAP
CAPNEVGELV HVGACVTLGY WNDPARTALR YRPSPEAKPG GAPRETAVWS GDLVRRDDDG
FLYFVARNDA QIKSSGYRIS PEEIEEVAHA SGLVAEAVAL GVPHDELGES ITLVVVPLDA
DTFRPDALRA RCAQQLPPYM VPHTIATRTS LPRNPNGKFD RVALRADAAN LVETL