Gene BURPS668_A0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0407 
Symbol 
ID4886709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp372485 
End bp373705 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID640130348 
Producthypothetical protein 
Protein accessionYP_001061413 
Protein GI126443337 
COG category[S] Function unknown 
COG ID[COG4461] Uncharacterized protein conserved in bacteria, putative lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCGG TGAGTTTGTG GCCGCTGCCG CGTTCACCAT GGCTACCGCC GCGCCGCGCC 
GCGCCGCGCC GCGCCGCGCC ACGCTATGCC AAGCCAAGCC AAGCCAAGCC AAGCCGCCAA
TGTCGTCGCT GCTACCGCTA CTGCCACCGC CACGGCATCG ACATCGACAT CGACTCTGGC
AACGGCAACG GCAACGGCAA CGGCAACAGC AACAGCAACG GCAACAGCAA CGGCAACAGC
AACAGCAACA GCAACAGCAA CAGCAACAGC AACAGCAACG GCAACGGCAA CAGCAACAGC
AATAGTTCCC GCCACCGCTA TCATCCCCAC CTTCGCCGCA CACCATATGA ACGTTCATCG
AATTACCCAC TCGAAGTACC GCAAGGAGGA AGCATGTCGA CGAACATGAA ACGACTGATG
ACCGCGGCGC TCGGCGCCGC ATTGGCGTTC GGCGCGCTGT CCGCGCGCGC CGCGAGCTTC
GACTGCGCGC ACGCCGCGAA CGCCGCCGAG CGCGCGATCT GCGGGACGCC CGCGCTCGGC
GAGTTGGACG TTCGAATGGC CGCGTACTAC GAAATACTGC AGAACGCGCG GCCGGCCGAC
GAGGGCATGG CGTATCGCGA GTTCCGCGAC GCGCTGCGCG ACGAGCAGCA GCGCTGGCGG
CAGCGCACGC GCGATGCGTG CGGTGCGCGG ATCGATTGCC TGACGAACGC CTATACCGCG
CGGATCGCCG CGCTGCGCGG CGTCGCCGCC GAGCGGCTCG TGCTGCGGAT GACGGGTGGG
AGCGCGGCGT CGGCAGGCGC CGCCGACGCG ACGTACGCGA TCGAAGGCGA GTCGATCACG
CTCGCGAACG GCGAATCGGT GCGTCCGGCC GCGCCGGGCT CGGCGATGAA GCGCGTGACG
ACGCTCGTCG CGCGGAGCGC TGTGGCGACG ATCGCCGGCC GTCCGGTCGA GGCCGTCCTG
CTGAGCGACG ATCCGGGCGG CAGCGGCCGG TTCCTGTATG TGGCGACCGC GCAGCCGGGC
GGCGGCGCGC CGGCGGTGCT GCTCGGCGAT CGGGTAAAGC CCGTGTCGGT GTCGATCGAG
CGCGCGGCGA CGGGCGGCGC GGTTGTCGTC GTCGAATATC TGGATCGTCC GGAAGGCGCG
CCGTTCGCGC AGGCGCCGAC GATCAAGATC GTCCGGCGCT TCGCGCTGGA GCAAGGCCGG
CTCGTCGAGC AGCGCGGGTA G
 
Protein sequence
MFPVSLWPLP RSPWLPPRRA APRRAAPRYA KPSQAKPSRQ CRRCYRYCHR HGIDIDIDSG 
NGNGNGNGNS NSNGNSNGNS NSNSNSNSNS NSNGNGNSNS NSSRHRYHPH LRRTPYERSS
NYPLEVPQGG SMSTNMKRLM TAALGAALAF GALSARAASF DCAHAANAAE RAICGTPALG
ELDVRMAAYY EILQNARPAD EGMAYREFRD ALRDEQQRWR QRTRDACGAR IDCLTNAYTA
RIAALRGVAA ERLVLRMTGG SAASAGAADA TYAIEGESIT LANGESVRPA APGSAMKRVT
TLVARSAVAT IAGRPVEAVL LSDDPGGSGR FLYVATAQPG GGAPAVLLGD RVKPVSVSIE
RAATGGAVVV VEYLDRPEGA PFAQAPTIKI VRRFALEQGR LVEQRG