Gene BURPS668_A2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2042 
Symbol 
ID4888298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1974650 
End bp1975747 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content68% 
IMG OID640131980 
Productputative cell surface protein 
Protein accessionYP_001063037 
Protein GI284159993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGTCG CGTCGGCGAG CGTCGGCGGC CTCACGTTCG GCGGCTTCGC GGGCAGCGCG 
CCCATCGGCG TGTTCAGCGT CGGCGCACCG GGCGCGGAAC GCCAGATCAC GAACGTCGCC
GCCGGCCGCA TCTCCGCGGC CAGCACCGAC GCCGTCAACG GCAGCCAGCT CTATGCGACC
AACAGCAATG TCGCGTCGCT GTCGACCGGT CTGAACGCGA CCAACAGCAA CCTCGCGTCG
CTGTCCACGT CCACCTCGAC CGCCGTCGGC TCGCTGTCCA CCGGCCTGTC CACGACCAAC
AGCACCGTCG CCTCGCTGTC CACGTCGACC TCGACCAGCA TCGGCTCGCT GTCCACCGGC
CTCTCGACCG CGAACAGCAA CCTCGCGTCG CTGTCCACGT CCACCTCGAC CGGCATCGGC
TCGCTGTCCA CCGGCCTCGC GACGACCAAC AGCAATGTCG CGTCGCTGTC GACGAGCGTG
ACCAACATCA ACACGCAGCT CACGTCGCTG TCGACGTCGA TCACGAACAA CGTGATCCGG
TCGCTGCCCG CGAGCACCGG CGTCGCCGCG GACATGAGCG CGCCGAAGGC GACCTCGCCG
TCCGTCACGG CCGGCTCGAA CTCGGTCGCG CTCGGCGCGG GCTCGAACGA CGGCGGTCGC
TCGAACGTCG TGTCGGTGGG CAGCGACACG CAGCAGCGCC AGATCACGAA CGTCGCGGCC
GGCACCGAGG GCACCGACGC GGTCAACGTC AACCAGTTGA ATACGCTGTC GACGTCGATG
TCGCAATCGC TGTCGAATCA GCAAACGCAG CTCAACAATC TCGGCTCGCA ACTGAACCAG
ACGCAGCAGC AACTGCAGCA GACCGACACG ATGGCCCGCC AGGGGATCGC GGCGGTCGCG
GCGATGGCGT CGATTCCGCA CATGGACCGC GACTCGAACT TCGCGATGGG CGTGGGCACC
TCTTCGTTCC TCGGCCAGAA GGCGATCGCG GTCGGCATGC AGGCGCGCAT CACCGAGAAC
CTGAAGGCGT CGCTGAACGG CGGCTTCGCC GGCAATCAGA AGGTCATCGG CGCGGGCATG
CTCTATCAGT GGAAGTAA
 
Protein sequence
MPVASASVGG LTFGGFAGSA PIGVFSVGAP GAERQITNVA AGRISAASTD AVNGSQLYAT 
NSNVASLSTG LNATNSNLAS LSTSTSTAVG SLSTGLSTTN STVASLSTST STSIGSLSTG
LSTANSNLAS LSTSTSTGIG SLSTGLATTN SNVASLSTSV TNINTQLTSL STSITNNVIR
SLPASTGVAA DMSAPKATSP SVTAGSNSVA LGAGSNDGGR SNVVSVGSDT QQRQITNVAA
GTEGTDAVNV NQLNTLSTSM SQSLSNQQTQ LNNLGSQLNQ TQQQLQQTDT MARQGIAAVA
AMASIPHMDR DSNFAMGVGT SSFLGQKAIA VGMQARITEN LKASLNGGFA GNQKVIGAGM
LYQWK