Gene BURPS1106A_A2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2789 
Symbol 
ID4904416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2718741 
End bp2720594 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content70% 
IMG OID640145892 
ProductAMP-binding domain-containing protein 
Protein accessionYP_001076818 
Protein GI126457489 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.472303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACATC CGCGCGAAAC GACGGCCGCC GACACGTTCG CCGACTTCGT CGAACTCACG 
CGATATCGCG CCGCGCATCA GCCCGATCTG CCGGTCTACA CGTTCGTCAC CGGCGGCGAT
CGCGACGAAC GACCTCTCAC ATGCGCGCAG TTGGACAGGC GCGCAAGCGC CGTGGCGGCA
GCGCTATCGG AAATCGCCCG GCCCGGCGAG CGCGTGCTGC TGCTGTTTGC ACCCGGCATC
GACTATATCG CCGCGCTGTT CGGCTGCATG TACGCGGGCG TCGTGGCCGT GCCCGCGTAT
CCGGTCGAGC CCGCGCAGCC CGAGCGCACG CTCCGGCGGC TGCTCGGCAT CGTCGCGGAT
TGCGCGCCCG CCGCGGTGCT GTCGACGGCG GCCGTGCGCG ACGGCATGAA CCGCGTCGAA
ACCGGTTCGC CCGCGCTGCG CGGCCTGCGC TGGATCGAGA TCGACGCACT CTCGCCCGGC
GACGATGCCG CGCACGGCGC GCCGCGCGCG GCCGACCCGC GCGTGCCCGT CTACCTGCAA
TACACGTCGG GTTCGACCGG CGCGCCGAAG GGCGTGATGA TCAGCCACCG GAACCTGCTG
CACAACTCCG CGCTGATCGC GCGCCGCTTC GAGCACGGCG CGAGCAGCCG CGGCGTGATC
TGGCTGCCGC CGTATCACGA CATGGGTCTG ATCGGCGGCA TCCTGCAGCC GCTGTACGTC
GGCTTTCCTG TGACGCTGAT GTCGCACGTC GATTTTCTCA AGCACCCGCT GCGCTGGCTG
CGCGCGATCG GCGAGCGCCG CGCGACGACG AGCGGCGGGC CGAACTTCGC GTATCAGATG
CTCGCGACGA TGCGTATCGC CGACGCCGAT TTCGACAAGC TCGACCTGCG CTCGTGGGAC
GTCGCGTTCG TCGGCGCGGA GCCGATCCGG GCCGCCACGC TGCACGCGTT CGCGCAGCGC
TTCGCGCGCT GCGGCTTCGA TGCGCGCGCG TTCTATCCGT GCTACGGCCT CGCCGAACAC
ACGCTGTTCA TGACGGGCGG ACTGAAATCG CAGCCGCCCG TCGTCGCGAA CGAGCCAAGC
GACGCGCGGC TGCCGCGCGC ATCCGACCAC GCCGACGCCC CCGGCCAGGC CGACCCGGCC
GGCGACGGGC GGCAAGCAGG CGCGCGCGCC GCCGTCGGAT GCGGCGACGC GGCCAGCGAC
AGTTTGGTGC TGATCGTCGA TCCCGACACG CGCGTTCCGT GCGATGATGG CCGGGTCGGC
GAAATCTGGG CGCAAGGGCC GAGCGTCGCG CTCGGTTACT GGAACAATCG CGCGCTCAGC
GAGCAGACCT TCGAGGCCGA GCTGCCCGGC TACGCGGGGC GATTCCTGCG CACCGGCGAT
TACGGCTATC GGTCGGGCTC CGAAGTGTTC GTCACCGGGC GGCTGAAGGA CATGATGCTG
ATTCGCGGCG CGAATCATTA TCCGCACGAC GTCGAGGCGA CGATCGAGGC GCTCGACGCC
GAGCTGTTCC GCCCCGGCGG CTGCGCGGTG TTCGCGCTCG ATACCGGCGC GGCGCCGCAA
GTGACCGTCG TGCGCGAGTT GCGGGCGCGC TATTTGAAGG CATTCGGCGA CGGCGGCCAA
GAAGCCGGCC ACACGCCCGA CGCGCTGTTC GGCAGGCTGC GTCGGGCGAT CAACCTGCAT
CACGGCATTG CGGTACACCA TATCGTCTTC ACGTCGCCTT CTGCGATACC GAAGACGACG
AGCGGAAAGG TCCAGCGGCA CGCCTGTCGC GAACTGTTTC TCAACGACAC GCTGCCGGTG
GTCACCCAGT GGCGCGCGCC GTGCGGCGCG CCGAACGACA TCCGGAACAT CTGA
 
Protein sequence
MTHPRETTAA DTFADFVELT RYRAAHQPDL PVYTFVTGGD RDERPLTCAQ LDRRASAVAA 
ALSEIARPGE RVLLLFAPGI DYIAALFGCM YAGVVAVPAY PVEPAQPERT LRRLLGIVAD
CAPAAVLSTA AVRDGMNRVE TGSPALRGLR WIEIDALSPG DDAAHGAPRA ADPRVPVYLQ
YTSGSTGAPK GVMISHRNLL HNSALIARRF EHGASSRGVI WLPPYHDMGL IGGILQPLYV
GFPVTLMSHV DFLKHPLRWL RAIGERRATT SGGPNFAYQM LATMRIADAD FDKLDLRSWD
VAFVGAEPIR AATLHAFAQR FARCGFDARA FYPCYGLAEH TLFMTGGLKS QPPVVANEPS
DARLPRASDH ADAPGQADPA GDGRQAGARA AVGCGDAASD SLVLIVDPDT RVPCDDGRVG
EIWAQGPSVA LGYWNNRALS EQTFEAELPG YAGRFLRTGD YGYRSGSEVF VTGRLKDMML
IRGANHYPHD VEATIEALDA ELFRPGGCAV FALDTGAAPQ VTVVRELRAR YLKAFGDGGQ
EAGHTPDALF GRLRRAINLH HGIAVHHIVF TSPSAIPKTT SGKVQRHACR ELFLNDTLPV
VTQWRAPCGA PNDIRNI