Gene BURPS668_A0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0798 
Symbol 
ID4887221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp772200 
End bp774416 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content69% 
IMG OID640130738 
Producthypothetical protein 
Protein accessionYP_001061797 
Protein GI126443461 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACATA TCAAACCACA AGCGGCCCTC GTGGCCACGA CCAACACGCA GATCGGCGCG 
CAGCCGATGC TCGGGATCAG CGTCGGGATC GGGTTCCGGC TCGATCAGCC GTCGATTCTC
GTGCACGAAG CCGCCGTCTG GGAGGCGTTG AAGGCGGCCG CGCCGTCACT GCCGCTGTAT
GAAGCCGCCT TGCCGAAGCA GCGCGCCGAA TGGCTGCTCG CCGGCCACTC CGTGCATGCG
GTGGGCGCCG GCGCCCGCGC GCGGGACGTC GACTGGACGG CGTGGGTCGA ACTCGACGGC
GTGCGCAAGG TCGTTTCGTG CGCGACGTCA CTGGGCGACG AACAGGCGCA GAGCGGTTAC
GCGCGCATCG CGGTCGATCA CCGGCACGCG GCGGCGGGCG GCGCGCGGGA GAACCCGTTC
GGCGTGGCGT CCGGCACGCC GCCGCTGCAG CAACTGCGCA CGTTCGGTGT CGGCCCCGCG
CCGCTTGCGG CGATGGGCGC GATCAACCCC GACTGGCCGG AGCGCGCGCA GTGGATGCCG
ACGCGGCCCG GCACGCTCGA CGCGATGGCG CAGGACGGCA CCCACATGGG CTGGCCCGCG
GACGTCGACC TGCGCTTCTT CCAGCAGGCC GCGCCCGACC AATGGGCACG CGGCGAATGC
TGGACGCCCG GCGCGCGTTT CGAGCTGAGC GGCTTCGGGC CGCGGGGCGA GGGCTTCGCG
GGCGAACTGC CGCGTCTCGC GCCGGTCGCG CTCGTGACGC GCAACGGCCG CCCGGGTATC
GAGCGGCTGT CGTTCAAGCA GCAGACGGCG TGGTTCCTGC CCGATCGCGG CATCGGCGTG
CTGTGGTGGA ACGGCGCGGT CGCGCTCGAT TTCCTGCTCG ACGACAGCCC GACGATGCTC
GTCACCGCAT TCAAGGACGA AGCCGAGCGG ATCGACATCG ACGCGCTGAT GAAGTTCGCC
GATCAGCGTG CCGACCTGAA CTGCACCGAT CCGCTTCAGC AGGCGGATCA CGAACTGATG
CCCGCGATTA CGAGGGGCTG GACCTGGGAG ATGATCCTCG ACACGGAAGA CCACCCGCGT
TTCGCTCCGG CGCCGCGCGG CTATGAAGAA GTCCGTGCGC GGGTCGAGCA GAATCGCCGC
GAGTTGGTCG AGGCGCGCGA TGCGAGCGAG CGGCTGTCGG CGTTCGAGGA AGCGAACCGC
AACGCGAAGC TGCCGGGCGC GCCGCGCGGC GGAGAGAACT GGCGCACGCG GCTGCGTCAG
GCGAAGACGC CCGAGCTCGC GAACGTGACG ATTCGCGACG CCGATCTGTC GTCGCTGCGC
TTTGACGGCT GGAAGTTCGA CGACGTGCGC TTCGAGCGCT GCACGCTCGA TCGCAGCGAA
TGGACGAACT GCCGGCTCAA TCAGGTGCAT GCGGTCGACT GCTCGTTCGC CGACGTCAAG
ATGAGTGACG GCTGGTGGAA GGGCGGCAAG ATCCAGCGCT GCAATCTCGA ACGCAGCGCG
TGGTTGAACG TCGAGATCGA GCGGATCTCG TTCGACGAAT GCCGGCTCGA CGATCTGAAG
GTGGCGGGCG GATCGTGGTC GATGCTGTCG GTGCAGGGCC GCGGCGGCGT GCGCGGCGAC
GTTCAGGACG TCCAATGGAA TTCGGTGTCG TGGTCCGAGG TGAGCGCGCC CGGCTGGACC
TGGACCCGCG TGCGCGCCGA CGATCTCGCG ATCGTCGAAT GCGCAATGGC GGGCCTCGCG
GTATCGCAGT GCACGCTCGC GAAGCCGAGC ATCCTGCTCA CCGACCTGTC CGCGAGCGTC
TGGCAGCGCA GCATGCTGAC GTTCGCGGTG CTGTCGCACG GCACGTCGAT CAACGGCGCG
CGGCTCACCG ATTGCGTGTT CAAGTCGTCG AGCCTGCAGG AGCTGCGTGC GGATCGGGTT
CAGGTCGATC ACTGCTCGTT CATGCAATTG AACGCGCAGC ATCTGCATGC GCAGCAGTCG
CATTGGAGCC GCACGGTGCT CGACGGCGCG AACGTGATGC ATGCGCAACT GACGGGCACG
TCGTTCGACC GCTGCTCGCT GAAGGAGGCG ATGTTCTATG GCGCCGACAT GCGGCAGACG
CGCATGCGCG ACTGCAATCT CGTCAGGGTC CGCACGTCGT GGATCCATCC GCCGGAAGCG
GGCGCGTGGC GCGGCAATCT GAGCGCCGGC CAGCTCGACG TGCCGAGGAG GGTGTGA
 
Protein sequence
MRHIKPQAAL VATTNTQIGA QPMLGISVGI GFRLDQPSIL VHEAAVWEAL KAAAPSLPLY 
EAALPKQRAE WLLAGHSVHA VGAGARARDV DWTAWVELDG VRKVVSCATS LGDEQAQSGY
ARIAVDHRHA AAGGARENPF GVASGTPPLQ QLRTFGVGPA PLAAMGAINP DWPERAQWMP
TRPGTLDAMA QDGTHMGWPA DVDLRFFQQA APDQWARGEC WTPGARFELS GFGPRGEGFA
GELPRLAPVA LVTRNGRPGI ERLSFKQQTA WFLPDRGIGV LWWNGAVALD FLLDDSPTML
VTAFKDEAER IDIDALMKFA DQRADLNCTD PLQQADHELM PAITRGWTWE MILDTEDHPR
FAPAPRGYEE VRARVEQNRR ELVEARDASE RLSAFEEANR NAKLPGAPRG GENWRTRLRQ
AKTPELANVT IRDADLSSLR FDGWKFDDVR FERCTLDRSE WTNCRLNQVH AVDCSFADVK
MSDGWWKGGK IQRCNLERSA WLNVEIERIS FDECRLDDLK VAGGSWSMLS VQGRGGVRGD
VQDVQWNSVS WSEVSAPGWT WTRVRADDLA IVECAMAGLA VSQCTLAKPS ILLTDLSASV
WQRSMLTFAV LSHGTSINGA RLTDCVFKSS SLQELRADRV QVDHCSFMQL NAQHLHAQQS
HWSRTVLDGA NVMHAQLTGT SFDRCSLKEA MFYGADMRQT RMRDCNLVRV RTSWIHPPEA
GAWRGNLSAG QLDVPRRV