Gene BMASAVP1_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_1052 
Symbol 
ID4676920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp1060313 
End bp1062115 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content75% 
IMG OID639843571 
Producthypothetical protein 
Protein accessionYP_990651 
Protein GI121597291 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGG GCGCCGTCGC GGCGCGGCGG CTTCCTCGCC GCGGCACGGC GCGCGGCCGG 
CGCGCCCCGT CGCGCGCGTT CACGATCCAC GGAGGTGACG AGATGCGCCG TTCGCGCCCG
CCGAAGCCCG CGAAGCCCGC CGCCGTCCGG CGCCGCCCGC ACGGCGCGGC GCGCGCGCAC
GCCATTCGCG GCAACGGTCC GCAGCGGCGC GCCGTGCGCG AACGCGCGGC CCGCCGCCTG
TCGCGCGACA TCGACGCGGC GCTGCGCCGC GCGTCGACCT ACCCGCATCC GGCCGGCCCT
ATCGTGCGCA TCGAAACGCA CCTCTCCGTC GTTTATCTCG TCGGGCGCTT CGCGTACAAG
CGCCTGAAGC CGTTCGATTT CGGCTTCGCG AATTTCACCG AACTCGCCGC GCGCCGCCGC
GCGTGCGAAG CGGAGCTCGC GCTGAACCGC CCGCTCGCCG CGCCGATCTA TCTCGCCGCC
GGCCCGCTCG TGCGCCGCGC GCGCGGCTTG CGCCTGTTCG GCGCGGGCGC GGCCGTCGAC
CACGTCGTCC GGATGCGCCG CTTCGACGAG CGGATGCTGT TCTCCCGGCT GCTCGCGCGC
GGCGCGCTCG ACGCGGCGGA CATCGATGCC GCCGCGACGC GCCTCGCCGC CTACCATCTG
CACGCGCCGC GCGACATCCC GCGGCGCGCG TACGGCAGCG CGCGCGAGCT GCGCCGGCAG
CTCGACGACA TGCTCGCGCC GCTCGAGCGC GCGCTCGGCG CGGCGCTGCC GGCGTCGCTG
CGCGCGTGGT GCGTGCGGCG CTGCGACGAG CTCGCCGCGC ACCTGGACGC CCGGCGAGCC
GACGGCTACG TCCGCGCATG CCACGGCGAT CTGCACCTGA ACAACGTCGT GAAGCGCGGC
CGCGACGCGC TGATGTTCGA CTGCATCGAT TTCGACGACG CGCTGCGCTG GATCGACGTG
ATCAACGATC TGTCGTTTCT GTTGATGGAT CTGCACGCGC ACGATCGCGC CGCCCTCGCG
CACCGCCTGC TGAACCGTTG GCTCGACGAA ACGGGCGATT TCGCCGGCCT CGCCGCGCTG
CCGCTGTATG TCGCGTATCG CGCGCTCGTG CGGGCGCTCG TCGCGACGAT GCGCGCGGGC
GGCGACGCCG CGGCGCGCGC CGCGCGCATC GAGCGCGCGC GCCGGTACGT CGACGTCGCC
GCGCACGCGG CCCGCGCGCG CCGCCCATGC CTGCTGCTGT GCCACGGCTA TTCGGGCTCG
GGCAAATCGG TTGCGAGCCG CGCGCTCGCC GACGTGTCCG GTGCGATCCG GCTGTCGAGC
GACAGCGAGC GCAAGCGCGC CCGACCGTTC GCGGCGGTCG ACGCGCGGCC GCTTCCCGCG
AGCGCGTACA CGCCGCAGCA GATCGACGCG CAATACGAGC GCCTGCGCGC GCTCGCGCGC
GACGTGCTGC GCGCCGGCTA CACGGCGCTC GTCGACGCGA CGTTTCTCTC GCATGCGCGC
CGCGCACGCT TCTTCGCGCT CGCGCGCGAG CTGGGCGTGC CCGTGTACGT GCTCGATTTC
CATGCGAGCC GCGCATGCCT CGAGCGGCGC GTCGATGCGC GCGCCGCCGC GCGCGACGAT
CGTTCGGACG CGGGCGCGGC CGTGCTCGCG ACGCAACGCG CGAGCGCCGA TCCGCTCGAT
GCCGACGAGC GCGCGCGCAC GATCGGCTTC GATACCGACG TGCCGCTCGC GACGCTCCGG
TCGGCCGGCT ATTGGCGGCC GGTGCTCGAC GCCGCGCGGG TGGACGCGCA AGCGACGCGT
TGA
 
Protein sequence
MTTGAVAARR LPRRGTARGR RAPSRAFTIH GGDEMRRSRP PKPAKPAAVR RRPHGAARAH 
AIRGNGPQRR AVRERAARRL SRDIDAALRR ASTYPHPAGP IVRIETHLSV VYLVGRFAYK
RLKPFDFGFA NFTELAARRR ACEAELALNR PLAAPIYLAA GPLVRRARGL RLFGAGAAVD
HVVRMRRFDE RMLFSRLLAR GALDAADIDA AATRLAAYHL HAPRDIPRRA YGSARELRRQ
LDDMLAPLER ALGAALPASL RAWCVRRCDE LAAHLDARRA DGYVRACHGD LHLNNVVKRG
RDALMFDCID FDDALRWIDV INDLSFLLMD LHAHDRAALA HRLLNRWLDE TGDFAGLAAL
PLYVAYRALV RALVATMRAG GDAAARAARI ERARRYVDVA AHAARARRPC LLLCHGYSGS
GKSVASRALA DVSGAIRLSS DSERKRARPF AAVDARPLPA SAYTPQQIDA QYERLRALAR
DVLRAGYTAL VDATFLSHAR RARFFALARE LGVPVYVLDF HASRACLERR VDARAAARDD
RSDAGAAVLA TQRASADPLD ADERARTIGF DTDVPLATLR SAGYWRPVLD AARVDAQATR