Gene BMASAVP1_A1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1101 
Symbol 
ID4679541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1084179 
End bp1086086 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content71% 
IMG OID639845375 
Producthypothetical protein 
Protein accessionYP_992441 
Protein GI121601099 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.210473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGAA ATGCCTCGCC AGGCGGCCGG CGCACGACCG CTTCGGCGCA ACGCGCCGGT 
TCGTCCGACA ATCACTCGCG CTCGCCTTCG GTGGCGCTCG CCGCCGGCGC GCCGGGCACC
GTCGGAGCGG CCGGCGAGCG CGCGATCGGA TCGCGCGGCG CGGCGGCCAC CCGCACCGCG
CTCACCGGAT GGCGTGCGTG GTTCGTCGCC GCGGCCGTGC TCTGCGCGTA CCTGTTGCCG
GGCATCCTCG GCCACGATCC GTGGAAGCAG GACGAAACCT ACACGTTCGG CATCATCCAG
CACATGCTCG AAAGCGGCGA TTTTGTCGTG CCGACCAACG CAGGGCAGCC GTTCCTCGAA
AAGCCGCCGC TGTACGACTG GGTCGCCACC GGCTTCGCAT GGCTCTTTTC CCGCTTCCTG
CCGCTGCACG ACGCCGCCCG GCTTGCGAGC GCCCTCTTCG CCGCGCTCGC GTTCGGCTTC
ACGGCCCGCG CCGCGCGCGT CGCGACGGGC GCGGCGCGCT GGCTCGAGTT GCCGGTGATC
GGCACCGTCG CGCTGTGCGC AGGCTCGCTC GTCGTCATCA AGCATTCGCA CGACCTGATG
ACCGACGTCG CGCTGATGGC GGGCGCCGCG ATGGGCTTTT GCGGGCTGCT CGAACTCGTG
ATCCGGCACG CCGGCGACGC GTTCGGCGCG CGCGCCGAAC GCCCGCCCGC GAACCGCTTC
GCAGCCCCGC TCTTCGGCCT GGGCGTCGGC ATCGCGCTGA TGTCGAAGGG CCTGTTCGTG
CCGCTCGTGT TCGGCGCGAC GCTTGCCGCG ACGCTCGTCC TGTACCCCGC CTGCCGCAGC
CGCGCGTTCT TCCGCTCGCT CGCGATCGCC GCGCTCGTGT GCGCGCCGTT CGCACTGATC
TGGCCGACCG CGCTGTTCCT GCGCTCCGAA TCGCTGTTCC TCGTCTGGTT CTGGGAAAAC
AACGTCGGAC GCTTCTTCGG TTTCTCGGTG CCGACGCTCG GCGCCGAGAA CGACAAGCCG
CTCTTCATCT GGCGCGCGCT GCTCACGGTC GGCTTCCCGG TGGCTCCGCT CGCGCTCGTC
GCGCTCGCGC GCGGCCTCTG GCGCGACTGG CGCGCGCCGC GCGTCGCGCT GCCGCTCACG
TTCGCGGGCA TCGGGATGGT CGTGCTGCAC ATCTCCGCGA CGTCGCGCCA ACTGTACATC
CTGCCGTTCA TCGCGCCGCT CGCGCTCGTC GCCGCGCAGG CGATTCCACG CCTGCCGCAG
CGCCTGCATG CCGCGTGGGA CTATGCGAGC CGGCTGCTGT TCGGCGCCGC CGCGGCGCTC
GTGTGGATCG TCTGGTCACT GATGTCCGAT CACGGTGGCC CGCGCGCCGG CTTGCAATGG
CTCGGCCGCT GGCTGCCGCT CGACTGGACG ATGCCGATCG AGCCCGCGCT CGTGCTGTCC
GCGCTCGCGA TCACGATCGG CTGGGTCGGC CTGCTGCCTT CGCTGCGGCT CGCGGGCAAG
TGGCGCGGCG CGCTGTCGTG GGCGATGGGC GCGCTCGTCG CGTGGGGGCT CGTCTATACG
CTGCTGCTGC CGTGGCTCGA CGTCGCGAAG AGCTATCGTT CGGTGTTCGA AGATCTGAAT
CGCCGGCTCG CGCTCGAATG GAACGACGGC GACTGCATGG CGAGCGTCGA TCTCGGCGAG
TCGGAAGCGC CGATGCTCTA CTACTTCTCC GGCGTGCTGC ACCAGCCCGT CGCCCAGCCG
AACGCGAGCG CCTGCACGTG GCTCATCGTG CAGGGCACGC GCGCGAATCC GCCGGTGCTC
GACGCCGAAT GGAAACCGTT CTGGGCGGGT GCCCGGCCGG GCGACGAGCA GGAGATGCTG
CGCGTCTACG TGCGCACGCC GGCCGCGGCA CGCCCCGCCC ATCCGTGA
 
Protein sequence
MQGNASPGGR RTTASAQRAG SSDNHSRSPS VALAAGAPGT VGAAGERAIG SRGAAATRTA 
LTGWRAWFVA AAVLCAYLLP GILGHDPWKQ DETYTFGIIQ HMLESGDFVV PTNAGQPFLE
KPPLYDWVAT GFAWLFSRFL PLHDAARLAS ALFAALAFGF TARAARVATG AARWLELPVI
GTVALCAGSL VVIKHSHDLM TDVALMAGAA MGFCGLLELV IRHAGDAFGA RAERPPANRF
AAPLFGLGVG IALMSKGLFV PLVFGATLAA TLVLYPACRS RAFFRSLAIA ALVCAPFALI
WPTALFLRSE SLFLVWFWEN NVGRFFGFSV PTLGAENDKP LFIWRALLTV GFPVAPLALV
ALARGLWRDW RAPRVALPLT FAGIGMVVLH ISATSRQLYI LPFIAPLALV AAQAIPRLPQ
RLHAAWDYAS RLLFGAAAAL VWIVWSLMSD HGGPRAGLQW LGRWLPLDWT MPIEPALVLS
ALAITIGWVG LLPSLRLAGK WRGALSWAMG ALVAWGLVYT LLLPWLDVAK SYRSVFEDLN
RRLALEWNDG DCMASVDLGE SEAPMLYYFS GVLHQPVAQP NASACTWLIV QGTRANPPVL
DAEWKPFWAG ARPGDEQEML RVYVRTPAAA RPAHP