Gene BURPS668_A0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0788 
Symbol 
ID4887403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp757515 
End bp758453 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content64% 
IMG OID640130728 
ProductImpA-related N-terminal family protein 
Protein accessionYP_001061787 
Protein GI126445235 
COG category[S] Function unknown 
COG ID[COG3515] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03363] type VI secretion-associated protein, ImpA family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACG ATGCACTCCC GGGCCATTCG CCGGATTTGC TGGATTTCGA CGAAGACTTT 
ATCAAGATCG ACGCAGCCAT CTGCGAATAC GATTCCGTCG GTTACGCGCC GCAACGCAAA
GGAGAGAGTG CGTTCCAGTG GGCGTCCATC GAAACGGGAT GCCTCGCTTT GTTGAAGAAG
GCGAAGGATG TCCGGGTCGG CATTTGGCAT CTGCGCGCAT GCATCGCGCG GCGCGGGCTG
AGCGGGCTGG CAGACGGCGT TCGATCGCTG GCCGATCTGA TGAGTGCGCC CGTCGAGGAA
TTGCATCCTC GCGCGCTGCC TGACGAATCG CCCGGCGAAA CCCTCCTGAT TCATCTCGGC
TGGCTTGCGG GCCCGCAGTT TCTGCATCAG CTCGGCAGTT CCCGGTTCGA AGACCGGGAC
GCAACCCTCA ACGATCTGAT CGGCGGGCGC GCCGCGGCGA TCGTGGAGGA TCGCGACTAT
CGCCTTAGAG CGAATACTCT TGTACATGAC ATTCAGGATT CCTTATCGCG AATTCGGGAA
TCGATCGCCG CGGCCGAGCA AGAGCTCAAC GTCTCGCGCG CGCTCGACCT GTTGAGCGTC
GCGGCGTCTC GGCTCACGCA GGCGCAAGCC GGCGGCGCGG ACAGCGCAAG CGTCGAGAGC
GACGCGCCGG TCGATGCGCC GGCCGGCGCG TCCGCGCCCG GCGCGCAGCA GCCGATGGCG
GGGCCGGGCG GCGTGTTGAG GTCGAGACAA GAGGTCGGCG CGGCGCTCGA TCGGATCGTG
GAGTACTTCC GCGTGCACGA GCCGAGCCAT CCGGCGCCGA TCTTCCTGTC GCGGATTCAA
CGAATGCTGG GGGCGGGTTT CGAGGAAGTG ATGGCGGAGC TCTATCCCGA GGCGGCATCT
CTCGTGGCCC AACTGAGCCG GCCGCAGAGT TCCAAGTAA
 
Protein sequence
MNNDALPGHS PDLLDFDEDF IKIDAAICEY DSVGYAPQRK GESAFQWASI ETGCLALLKK 
AKDVRVGIWH LRACIARRGL SGLADGVRSL ADLMSAPVEE LHPRALPDES PGETLLIHLG
WLAGPQFLHQ LGSSRFEDRD ATLNDLIGGR AAAIVEDRDY RLRANTLVHD IQDSLSRIRE
SIAAAEQELN VSRALDLLSV AASRLTQAQA GGADSASVES DAPVDAPAGA SAPGAQQPMA
GPGGVLRSRQ EVGAALDRIV EYFRVHEPSH PAPIFLSRIQ RMLGAGFEEV MAELYPEAAS
LVAQLSRPQS SK