Gene BURPS1106A_A3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3062 
Symbol 
ID4905834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2976128 
End bp2977285 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID640146165 
Producthypothetical protein 
Protein accessionYP_001077091 
Protein GI126457237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGAC AATCCACCAC TGTTCCTTTC GAACTCTCCT CCGCCGAGCT CGCGCGCACG 
CGCGTCGGCA TCGTCGACGG CAAGCGTATC TCGCTCGGCG TGCAAGGCGA CGCGTTGCGC
GGCTTCGTGC TCGAACGGCG CTGCAAAAGC CCCGGCGAGC CGGTATCGAC GCAGCGCGTC
GGCCTGCGCG ACCCGGCGGC CGTCGCGGCG TTCGTCGAAC ACGACCCTTA TGTCGTGCAG
CTCGGAATCG ACTACCGCGC GTTGCTCGAC GTGCACCGCG CAGCGGACGA TGCGGGATCG
CACGGCGCAT TCGCGGTGCA CGATGCACGG TATGCGCGCC CGGCGAGCGA GGCGGGCGGT
GCATTCCGCC CAGCGGAGCA CGCCGGTGCC GCGCCGGCCG CCTCCGGTGT GCCCACCGCC
TCCGCCGCAT CCGTCGCGCA GCCGGAGTTC GCGGTCGAAT GCGAGCACGA CGGCGCGCTG
CTCGCGCTGA TGCGGCGCAT CTGCGCATCG TGCGGCGCGA CGCAGTGCTT CTATCACTGG
TTCGTCGTCG ACGAAGACAC GGGGGAGTTC ACGGCGCACG ATCTGCTGAT CGGCGGCGCG
CCCGCGTGGG CGCAGCGCTA TGTGCATCAG CACTGGTATC TGAACGATCC GGCCGTCGCG
CACGCGCGCG ACAACACGCA GCCGCTGCGC GGCTCGGCGC TCGCCGAATT GCGCTCCGAT
CACTGGCTGA ACCGCTACGC GCAGACGCAA GGGCTCGGCA GCAACGTGTT CTTTCCCGCG
CATCGCCGCG ACGTGTCGAC CTTCGGCTTG CTGCACGTTG CCGCGCCGCT GCCCGCGCCG
CACGGCGAGG ACGCGCTGTG GCGCAACCGG CGCGTGCTGC GCGGGCTCGC GAACGAGATG
CTCGAATGGC GCGTCGTGCG GCGGCGCCGC GAGCTCGCGC AGGAGCTGTC GCTCGCCGCG
CAGGATGTGC TCGCGCTGCG GCTCGTCGCG CGCGGCGGCG GCGCGCGCCA CGTCGCCGAG
GAACTGCGGC TCGACGAGCG CGCGGTCTAC CAGCTCTTCA CCGCGATCAA CCGCAAGATG
GACAGCAAGC ACATCAAGAG CAGCGCGACG AAAGCGAAGC GCCTGGGCCT GCTCGCCGAA
GGCTATATCT CGAAATGA
 
Protein sequence
MARQSTTVPF ELSSAELART RVGIVDGKRI SLGVQGDALR GFVLERRCKS PGEPVSTQRV 
GLRDPAAVAA FVEHDPYVVQ LGIDYRALLD VHRAADDAGS HGAFAVHDAR YARPASEAGG
AFRPAEHAGA APAASGVPTA SAASVAQPEF AVECEHDGAL LALMRRICAS CGATQCFYHW
FVVDEDTGEF TAHDLLIGGA PAWAQRYVHQ HWYLNDPAVA HARDNTQPLR GSALAELRSD
HWLNRYAQTQ GLGSNVFFPA HRRDVSTFGL LHVAAPLPAP HGEDALWRNR RVLRGLANEM
LEWRVVRRRR ELAQELSLAA QDVLALRLVA RGGGARHVAE ELRLDERAVY QLFTAINRKM
DSKHIKSSAT KAKRLGLLAE GYISK