Gene BURPS1106A_A3085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3085 
Symbol 
ID4905939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2997866 
End bp2999677 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content75% 
IMG OID640146188 
Producthypothetical protein 
Protein accessionYP_001077114 
Protein GI126457946 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGG GCGCCGTCGC GGCGCGGCGG CTTCCTCGCC GCGGCACGGC GCGCGGCCGG 
CGCGCCCCGT CGCGCGCGTT CACGATCCAC GGAGGTGACG AGATGCGCCG TTCGCGCCCG
CCGAAGCCCG CGAAGCCCGC CGCCGTCCGG CGCCGCCCGC ACGGCGCGGC GCGCGCGCAC
GCCATTCGCG GCAACGGTCC GCAGCGGCGC GCCGTGCGCG AACGCGCGGC CCGCCGCCTG
TCGCGCGACA TCGACGCGGC GCTGCGCCGC GCGTCGACCT ACCCGCATCC GGCCGGCCCT
ATCGTGCGCA TCGAAACGCA CCTCTCCGTC GTTTATCTCG TCGGGCGCTT CGCGTACAAG
CGCCTGAAGC CGTTCGATTT CGGCTTCGCG AATTTCAGCG AACTCGCCGC GCGCCGCCGC
GCGTGCGAAG CGGAGCTCGC GCTGAACCGC CCGCTCGCCG CGCCGATCTA TCTCGCCGCC
GGCCCGCTCG TGCGCCGCGC GCGCGGCTTG CGCCTGTTCG GCGCGGGCGC GGCCGTCGAC
CACGTCGTCC GGATGCGCCG CTTCGACGAG CGGATGCTGT TCTCCCGGCT GCTCGCGCGC
GGCGCGCTCG ACGCGGCGGA CATCGATGCC GCCGCGACGC GCCTCGCCGC CTACCATCTG
CACGCGCCGC GCGACATCCC GCGGCGCGCG TACGGCAGCG CGCGCGAGCT GCGCCGGCAG
CTCGACGACA TGCTCGCGCC GCTCGCGCGC GCGCTCGGCG CGGCGCTGCC GGCGTCGCTG
CGCGCGTGGT GCGTGCGGCG CTGCGACGAG CTCGCCGCGC ACCTGGACGC CCGGCGAGCC
GACGGCTACG TCCGCGCATG CCACGGCGAT CTGCACCTGA ACAACGTCGT GAAGCGCGGC
CGCGACGCGC TGATGTTCGA CTGCATCGAT TTCGACGACG CGCTGCGCTG GATCGACGTG
ATCAACGATC TGTCGTTTCT GTTGATGGAT CTGCACGCGC ACGATCGCGC CGCCCTCGCG
CACCGCCTGC TGAACCGTTG GCTCGACGAA ACGGGCGATT TCGCCGGCCT CGCCGCGCTG
CCGCTGTATG TCGCGTATCG CGCGCTCGTG CGGGCGCTCG TCGCGACGAT GCGCGCGGGC
GGCGACGCCG CGGCGCGCGC CGCGCGCATC GAGCGCGCGC GCCGGTACGT CGACGTCGCC
GCGCACGCGG CCCGCGCGCG CCGCCCATGC CTGCTGCTGT GCCACGGCTA TTCGGGCTCG
GGCAAATCGG TCGCGAGCCG CGCGCTCGCC GACGTGTCCG GTGCGATCCG GCTGTCGAGC
GACAGCGAGC GCAAGCGCGC CCGACCGTTC GCGGCGGTCG ACGCGCGGCC GCTTCCCGCG
AGCGCGTACA CGCCGCAGCA GATCGACGCG CAATACGAGC GCCTGCGCGC GCTCGCGCGC
GACGTGCTGC GCGCCGGCTA CACGGCGCTC GTCGACGCGA CGTTTCTCTC GCATGCGCGC
CGCGCACGCT TCTTCGCGCT CGCGCGCGAG CTGGGCGTGC CCGTGTACGT GCTCGATTTC
CATGCGAGCC GCGCATGCCT CGAGCGGCGC GTCGATGCGC GCGCCGCCGC GCGCGACGAT
CGTTCGGACG CGGGCGCGGC CGTGCTCGCG ACGCAACGCG CGAGCGCCGA TCCGCTCGAT
GCCGACGAGC GCGCGCGCAC GATCGGCTTC GATACCGACG TGCCGCTCGC GACGCTCCGG
TCGGCCGGCT ATTGGCGGCC GGTGCTCGAC GCGCTCGACG CCGCGCGGGT GGACGCGCAA
GCGACGCGTT GA
 
Protein sequence
MTTGAVAARR LPRRGTARGR RAPSRAFTIH GGDEMRRSRP PKPAKPAAVR RRPHGAARAH 
AIRGNGPQRR AVRERAARRL SRDIDAALRR ASTYPHPAGP IVRIETHLSV VYLVGRFAYK
RLKPFDFGFA NFSELAARRR ACEAELALNR PLAAPIYLAA GPLVRRARGL RLFGAGAAVD
HVVRMRRFDE RMLFSRLLAR GALDAADIDA AATRLAAYHL HAPRDIPRRA YGSARELRRQ
LDDMLAPLAR ALGAALPASL RAWCVRRCDE LAAHLDARRA DGYVRACHGD LHLNNVVKRG
RDALMFDCID FDDALRWIDV INDLSFLLMD LHAHDRAALA HRLLNRWLDE TGDFAGLAAL
PLYVAYRALV RALVATMRAG GDAAARAARI ERARRYVDVA AHAARARRPC LLLCHGYSGS
GKSVASRALA DVSGAIRLSS DSERKRARPF AAVDARPLPA SAYTPQQIDA QYERLRALAR
DVLRAGYTAL VDATFLSHAR RARFFALARE LGVPVYVLDF HASRACLERR VDARAAARDD
RSDAGAAVLA TQRASADPLD ADERARTIGF DTDVPLATLR SAGYWRPVLD ALDAARVDAQ
ATR