Gene BURPS1106A_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4001 
Symbol 
ID4899584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3903571 
End bp3904689 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID640137227 
Producthypothetical protein 
Protein accessionYP_001068220 
Protein GI126452219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCGTC AGACGTTTCC GGGCCGCGCG CAGGCGCTGC GGCAGCGCTT GAGCGCGCTC 
GCGCCGGCGC TCGTCGCCGC CGCCGCGCTG GCGGCGGCCG GCCCCGCGCG CGCGGCGATG
AATTTTTGCG CCGCGCCGGC GCTGCAAAGC AGCGAGGCGA CGCATGCCGA ACCGGGCGTG
CAGGCGCTCA TCAAGAGCGT CGATGCGCAT CTGAACGATG AGCCGAAGGC GCTGCCGCGC
GTGCACACCG AGGGCACGCT GCCGCACGAG GGCATTTACG ACCAGAGCGC CGAGGCGCTC
AACGACATGG AGCTGATGCG CAACGCGGCG CTCGCGTGGC GCGTGACGAA CCAGAGCCGC
TATCTGGCGC TCGTCGACCG CTTTCTGTCG ACGTGGGTGA ACACTTACCG CCCGAGCTTC
AATCCGATCG ACGAAACGCG CTTCGAGAGC CTGATCCTCG CGTACGACAT GACGGCGAGC
GCGCTGCCCG TGAAGACGCG CAACGCGGCG GCCGCGTTCA TCGCGGCGCT CGGCAACGGC
TACGTGCAAC AGATCGATGC GCAGAAGCGC CCGCTCAAGG GCACGTGGCG CAACAACTGG
CAGAGCCACC GGATCAAGCT GATCGCGCTC GCCGCGTTCA CGCTCGGCGA CCGTAGGATG
ATGAACGCCG CGCAGCGGCT TTTCGTCGAG CATCTCGCCG ACAACATCGA GCCCGACGGC
ACGACGTACG ATTTTCTCGA GCGCGACGCG CTGCACTACG CGGTCTACGA TCTGCAGCCG
CTCGCGACGG CCGCGCTCGC CGCGCGGCGC TTCAACCGCA ACTGGCTGCG CGAGCGCGCG
CCGAACGGCG CGACGCTCGC CGCCGCGCTC GACTGGCTCG CGCCGTACGC GCGCGGCGAG
AAGACGCACG AGGAGTTCGT CCACTCGCCC GTGCCGTTCG ACGCGAAGCG CCGCGAGGCG
GGCCTGCCCG GCTATTCCGG CATGTGGGAG CCGAAGAACG CGACCGAGCT GTTCCATCTC
GCCGCGCGCC TGGACGGCCG CTACGCGGGC ATCGCCCAAC AACTGTCGCC GATGCCGCCG
GCGTGGCTGG CCGCGTGCCT GCCGCTGCCG GCGCGGTGA
 
Protein sequence
MVRQTFPGRA QALRQRLSAL APALVAAAAL AAAGPARAAM NFCAAPALQS SEATHAEPGV 
QALIKSVDAH LNDEPKALPR VHTEGTLPHE GIYDQSAEAL NDMELMRNAA LAWRVTNQSR
YLALVDRFLS TWVNTYRPSF NPIDETRFES LILAYDMTAS ALPVKTRNAA AAFIAALGNG
YVQQIDAQKR PLKGTWRNNW QSHRIKLIAL AAFTLGDRRM MNAAQRLFVE HLADNIEPDG
TTYDFLERDA LHYAVYDLQP LATAALAARR FNRNWLRERA PNGATLAAAL DWLAPYARGE
KTHEEFVHSP VPFDAKRREA GLPGYSGMWE PKNATELFHL AARLDGRYAG IAQQLSPMPP
AWLAACLPLP AR