Gene BURPS668_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3795 
Symbol 
ID4883479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3707956 
End bp3709329 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID640129723 
Producthypothetical protein 
Protein accessionYP_001060790 
Protein GI126440400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000690611 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATAG CCGTCGCCCC TTTGTTTGCC GCTTGTAGTG GAGGGGGCGG CGGCACCCCG 
GCCCCCATCG CCGTGCCGCA ATGCTCCGGT TCGAGCTGCG GCGTTCAGGG TCCGCCCAGC
TCGACGGCGG CCAACACTTC GCTGTGCCCC GCCGACGCGA ACATCGGCAG CAGCACCTAT
CTCGGCGGCG CCGGCGGCGG CGAGATCGTG AGCCTGAACA TCAACGCGAC CACGATGACG
TACACGCTCA AGTGGCTCGA GTCGCCGGTT CCGCTCGCCA CCGGCACCGT CACGCCGACC
CGCGCCGGCA CGACGATCAC GGGCAGCGTC GCGCATCCGC CCGCGGGCAC GCTGCCGACC
GCCGAGCAGA CGCGCTGCGC GTTCGTGCTG CTGCCCGGTA GCGGCACGGC GCCCGCGACG
AATTCGACGT ACTCGACCGC GGCCGACTTC AACCAGGCGA ACCCGCCGAT GATCCTGATC
GGCTTCGGCG TCGCGGGCGG CGGCATTCCG GGTGCGACGA TCCAGTACAG CGGCCTCACG
ATCATCCCGG GCGTGCTGCA GAACATCGGC CAGGTGCCGC AGCGCCATTT CGACTTCTAT
CCGTTCCTCG GCTTCGCGAA CACGACGACC GATCTGTCGA AGCTGCCGGG CACGTACAAC
GCCCTCGTCT ATCACACGGT GCCGTCGGGC AACTACGCGG CGAAGGCGAT CGCGTCGAAC
GAGACGTTCG ATGCGAACGG CGCGTGCACA TCGACGAGCG CATCGGGCTG CATGACGACC
GGCAATCCGT GGACGGCGAG CGGCAACGGC TACTTCAACA GCACGCAGGC GCCGCAGATC
CTGCCGCAGA CGCAGTTGCC GCTCATCGGC GCGACCGGCA AATCGGCCGT CGCGCACATG
GTGCTCGGCC AGTTGAACGG CGCGACCGTG CCTGTCGTCG TGCGCACGGG CAACGTGAAT
CTCGGCACGC CGCCGCTGCA CACCGATGCG CAGGTGGACG ACGAATCGGG CATCGCGGTG
CTCGGGCTCG CGCAGGCGAT CGCGTCGGGC GGCATCGACG GCGGCTACGC GGGCGCGGAC
TCGAACTTCA AGTACACGGC GACGGTGATC AAGGGCACGA CGGGCACGTT CGTGAACCCG
AGCACGCAGC AGGCCGAGAC GGGCTTCACG CTCGACTACG GCCAGTCGAC ACCGGGGCTG
CTCGGCGTCA CGACGACCGA CACGTCGGCG CCGGGCTTCG TGATCGCGAG CGGCGGGCTA
TATGCGGCGC TGGTCCAGGG CACCGTCAAC GGCGGCATCA CGCAGAGCTC GGCGATCGCC
GGCCAGACGC CGTCCGCGCC CTACTTCGGC GTAGGCGCGC AAGTCAGCAA GTAA
 
Protein sequence
MAIAVAPLFA ACSGGGGGTP APIAVPQCSG SSCGVQGPPS STAANTSLCP ADANIGSSTY 
LGGAGGGEIV SLNINATTMT YTLKWLESPV PLATGTVTPT RAGTTITGSV AHPPAGTLPT
AEQTRCAFVL LPGSGTAPAT NSTYSTAADF NQANPPMILI GFGVAGGGIP GATIQYSGLT
IIPGVLQNIG QVPQRHFDFY PFLGFANTTT DLSKLPGTYN ALVYHTVPSG NYAAKAIASN
ETFDANGACT STSASGCMTT GNPWTASGNG YFNSTQAPQI LPQTQLPLIG ATGKSAVAHM
VLGQLNGATV PVVVRTGNVN LGTPPLHTDA QVDDESGIAV LGLAQAIASG GIDGGYAGAD
SNFKYTATVI KGTTGTFVNP STQQAETGFT LDYGQSTPGL LGVTTTDTSA PGFVIASGGL
YAALVQGTVN GGITQSSAIA GQTPSAPYFG VGAQVSK