Gene BURPS668_A1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1694 
Symbol 
ID4888183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1644438 
End bp1646096 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content59% 
IMG OID640131632 
Producthypothetical protein 
Protein accessionYP_001062689 
Protein GI126443423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTC GTATCTGGCC GTTCGCGGCT GTATTCGTAT TGATCGCGCT GGGTTGGACG 
TTCGTCGTGA TGGCACAGCA TTATCACTAT TGGCAAGCAA CGGGACAGGA GATGCCGGGC
ATGGCGGCTA GTATTCGCAA CGGGGTATTG GTTGCACTGG GGGTTGTCGC AGCGGTCTTT
ACGGCGAGCT ATGTCTGGCT GCATCGAGCT CCGCATGAGT CGACGCCGGC AACTATGAGT
TCAGCATCTG CTGCATTGGA TACGCACGCA GCGACGAATT ACGGCAACAA TGGCGTGCCC
GCGTTACTCG GTCAGACCGG TGCGAATTAC GTGTTAGAGG TTCGTGGTTT GGGACTCGTC
ACGGGACATG AAACAAATGA AGAGATCTGG AAGGCTATAC AAGCCAAGGC AGACAATCAT
TCGACGTACA TGTCACAAAA TCCCGCTGAC TATCCGGCAA ACAAGGACGA GCGAATGACG
GATCTCGGCT TGTCGACTCG TATCAGCTTC AAATTTGGAG CGCGCAATTC CGTCGAGTAC
TGGCCGGTTC CCGTGTTCAT TTGGGAGCCG CCGAAGGACC GCCGTTTCGG TCGCCCCGGA
GCCGCACTTG ATGGAATTCG GCAAGAGGCG AGCTTGGGCG TAACGCTGTT GTTGTGGCAG
GAGGATGCGA ATACGGACGA TGGCGCCAGC ATCATCGAAA AGCTGTTTGC GTTCTTCGAC
TCGCATCCGG ATGTGCCGGA GGCCGTCATC TACACGTTGG ATGGATCGAT GAAGCGCTGG
CTCAATGAAA CGCCCGGCTA CATCGATACG TTCGAACAGT CCAACATTCC TTCAATGCCG
GACAGCATGG TCGCGATGCT GGTGTCCCGG TCCGATCGAG TTGACCGACT GATTCGTCCG
TATGCCGTTG AGCAAACGGA GAATGTAAAC AACGGCACCA CCGACTACGA CATCACGCGA
CTGTGGAATT TTTTCTGGAA AGTCAACCAG GATCGAGGGC CGGACAGCTT CACGGCGCAT
TACGAAGCGG ACGAAAAGCG AGCTGGAGTG AATACTCCGA TGTCGGCCGG TTTTGTAACG
TCCGCTTGGT GGCAAACCAA ACTCCCCGAT TTCTGGAAAA CGATCAGCAA CAAAGGCCCG
GGCGAGTTCA AACCCACGCC GTACATCCCG GTGCGCTGGA CGACCTGGCA GGTGAGGCAA
TTCGACAACG CGCCGCTGCT CGGCTACCTG CATCGGCCCA TCGACGTGAA GCTCGCCGAT
GCGCACGGCA AGCCGCTGAA GACCGCGCAG CAGGCGCAAG CGCTCAAGGC GGGGTGGCAG
CAAGCCGTCG ATACGCTGCC CACCGGCGAA ACGCCGAAGC GGATCTTCTA TGACACGACG
GGCGATCGCG CGTGGGTCGC GCCGATCAAC CAGGCGCTCG CGCAAAGCGG GCTGTCCGCG
CCGAGTCTCG ATGACGTGAA GGAAGGCTAC GACATCGGCC GCCGGATCGG GAACACGGGC
ATCAGCTCGC CGTTGGTGCA AATCGGACTG GGCCTGATCG CGAGCTACCA CGAAGGCGGG
GCCAGCGCCA CGATCCATCG CCGGCCGAAC GGCACGGCGA CGATCGTGAT GGTGAGCCCG
CCGACGCATA AGCAGCCTGA CGTCAATCCG TTCCGGTAA
 
Protein sequence
MKPRIWPFAA VFVLIALGWT FVVMAQHYHY WQATGQEMPG MAASIRNGVL VALGVVAAVF 
TASYVWLHRA PHESTPATMS SASAALDTHA ATNYGNNGVP ALLGQTGANY VLEVRGLGLV
TGHETNEEIW KAIQAKADNH STYMSQNPAD YPANKDERMT DLGLSTRISF KFGARNSVEY
WPVPVFIWEP PKDRRFGRPG AALDGIRQEA SLGVTLLLWQ EDANTDDGAS IIEKLFAFFD
SHPDVPEAVI YTLDGSMKRW LNETPGYIDT FEQSNIPSMP DSMVAMLVSR SDRVDRLIRP
YAVEQTENVN NGTTDYDITR LWNFFWKVNQ DRGPDSFTAH YEADEKRAGV NTPMSAGFVT
SAWWQTKLPD FWKTISNKGP GEFKPTPYIP VRWTTWQVRQ FDNAPLLGYL HRPIDVKLAD
AHGKPLKTAQ QAQALKAGWQ QAVDTLPTGE TPKRIFYDTT GDRAWVAPIN QALAQSGLSA
PSLDDVKEGY DIGRRIGNTG ISSPLVQIGL GLIASYHEGG ASATIHRRPN GTATIVMVSP
PTHKQPDVNP FR