Gene BURPS668_A0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0129 
Symbol 
ID4886633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp113552 
End bp114940 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content60% 
IMG OID640130070 
Producthypothetical protein 
Protein accessionYP_001061135 
Protein GI126442947 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCAT CCGAGAACAA CGGCGCCTCC GCATTGCTCG CTCAGTCGGG CGGAAAATTC 
GTATTGGAAG TGCGAGGCCT AGGGCTTGTG ACAGGAAAGG AGACGAATGA AGAGATCTGG
AAAGCCATCG AGACCAAGGC AGACAACCAT TCGACATACA TGTCACAGAA TTCCGCAGAC
TACCCTGCCA ATGAAGATGA ACGGATGACC GAGGTAGGGT TGTCGACGCG GATCAGCTTC
AAATATGGCG CGCGCCACTC TGTTGAATAT TGGCCGGTCC CGGTCTTCAT CTGGGAACCA
CCTAAGGCAT CACATGTTAG TCGGCCAGCC AATGAGCTCT CCGGTATTCG GCAGGAGGCT
AGCCTGGGCG TCACGCTCTT GCTGTGGCAG GAAGACGCGA ACACGAACGA TGGTTCTACC
ATCATCGAGA AGCTGTTTGC GTTCTTCGAT GCGCATCCGG ACATACCCGA AGCGATTATC
GTCACTTTTG ACGGAGCTGC GACTCGCGAT CTCAATCAGA CGCCGGGCTA CGTGGATACC
TTCAAGCAAT CGAACATTCC GACAATGCCG GATAGCATGG TCACGCTGCT CGTGTCCCGA
TCGGATCGCG TCGACCGCCT GATTCGCCCC TACGCCGTCG AGCAGACGGA AAACGTCGAC
AAGAACACGA CCGAATACGA CATTACCAAG CTGTGGAACT TCTTCTGGGA GAAGAACAAC
GGAGAAGGGC CCGGCAGCTT TGAAGCGTAC TATCAAGAGC AGCAGAAGGC GGCGGGCATC
CAGCCGCGCG CTTTTCTCGG TTTCATGTCT GCGCAATGGT GGCAGACACA ACTCCCCGAT
TTCTGGAAAA CGATCAGCAA CAAAGGCCCG GGCGAGTTCA AACCCACGCC GTACATCCCC
GTGCGCTGGA CGACCTGGCA GGTGAGGCAA TTCGACAACG CGCCGCTGCT CGGCTACCTG
CATCGGCCCA TCGACGTGAA GCTCGCCGAT GCGCACGGCA AGCCGCTGAA GACCGCGCAG
CAGGCGCAAG CGCTCAAGGC GGGGTGGCAG CAAGCCGTCG ATACGCTGCC CACCGGCGAA
ACGCCGAAGC GGATCTTCTA TGACACGACG GGCGATCGCG CGTGGGTCGC GCCGATCAAC
CAGGCGCTCG CGCAAAGCGG GCCGTCCGCG CCGAGTCTCG ATGACGTGAA GGAAGGCTAC
GACATCGGCC GCCGGATCGG GAACACGGGC ATCAGCTCGC CGTTGGTGCA AATCGGACTG
GGCCTGATCG CGAGCTACCA CGAAGGCGGG GCCAGCGCCA CGATCCATCG CCGGCCGAAC
GGCACGGCGA CGATCGTGAT GGTGAGCCCG CCGACGCATA AGCAGCCTGA CGTCAATCCG
TTCCGGTAA
 
Protein sequence
MDPSENNGAS ALLAQSGGKF VLEVRGLGLV TGKETNEEIW KAIETKADNH STYMSQNSAD 
YPANEDERMT EVGLSTRISF KYGARHSVEY WPVPVFIWEP PKASHVSRPA NELSGIRQEA
SLGVTLLLWQ EDANTNDGST IIEKLFAFFD AHPDIPEAII VTFDGAATRD LNQTPGYVDT
FKQSNIPTMP DSMVTLLVSR SDRVDRLIRP YAVEQTENVD KNTTEYDITK LWNFFWEKNN
GEGPGSFEAY YQEQQKAAGI QPRAFLGFMS AQWWQTQLPD FWKTISNKGP GEFKPTPYIP
VRWTTWQVRQ FDNAPLLGYL HRPIDVKLAD AHGKPLKTAQ QAQALKAGWQ QAVDTLPTGE
TPKRIFYDTT GDRAWVAPIN QALAQSGPSA PSLDDVKEGY DIGRRIGNTG ISSPLVQIGL
GLIASYHEGG ASATIHRRPN GTATIVMVSP PTHKQPDVNP FR