Gene BURPS668_3493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3493 
Symbol 
ID4884453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3414673 
End bp3416388 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content67% 
IMG OID640129421 
Productintegrase 
Protein accessionYP_001060503 
Protein GI126441693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCCA TCGAAAACCG CTCTCGCATC CGCGTCACCG TCCGGAACCG CGACGACCTC 
ACGCGGACCT TCGCCCGCAA CGCCGACCAC GCCATCCAGC GCTACGTGCA GACGCTGCAG
TCGCAGGGGC TGAAGCCTCG GCTCGCGAGC CTCGATGACC ACTATGTCGT TCGCACGCGC
AGCGTCGCGC ACAAGAACCA GACGTTGAGA GCCAGTAGCG AAGCGGAAGC CATCAAAATC
AAGGAGCGCA TCGAGCGCGA GCAGCGTGAC GGGCTGTTCA TCGACTATGC GAAGGGGCAG
CAGACGACCC TCGCCGACCT TCTGATTCGC TACCTGCGCG ACGAGGCGCC GCGCGACAAA
AGCTTCGAAG TCCTCGGCTA CAAAATCAAT GCGTGGCTGG AGGACGCCGG GTTGCCCCGG
CAGGACCTCG CCGAGATTCG CGACGCGCAC CCCAATCCTT GCCCCAAGGT CACGGCCATG
AAAATCCGTA GGTCCACGGG TACGCGCGTC GGGCAGCCCT CGGAGACCGG CAAGTTCATC
CGCAAGCCGT TCGCCACCAT CGTCCCGGAC GACTTCGCGG ACTACATCGA CGAGCGCTGC
CAGGTGGTCG AGCCGAGCAC GGTCGACCGC GAAATCGACA TCTTCTCGGC GGTCTGCCAC
ATCGCCATCG ACACCTGGCG GATTCACGTC GCCAAGAACC CGATGGACGG TGTACGCCGG
CCCCGCTACT ACAACGAACG GGACCGCCGC CTGAAGGATG GCGAGGAGGC GCGCTTGCTG
GCCGCCGCCC GTGAGGAGGA CCGCGCGCAG TCCATCGCTC TGCGCCTCGA GCTGCTGATG
GCACCAGAGC GCGAGGACGC GAACAGCGCG ACCACCGTGT ACCGGCGCAA GCAGGTTATC
AAGGGCGCTC GGCAGCGCTA CCAGGCGGAA GCCGAGGAGA CGTACGAGCA CATTCCGTTG
CTGGAGACTT TCATCCACTT CCAGCTGATG ACCGGGGCCC GCCGAAGCGA GACGCTGACG
CTGACCTGGT CGAACGTGGA CCTGGACGGC CAGGCCGCGT TTCTGCCCGA AACCAAGAAT
GGCCGGCCCC GCACCTTGCC CCTTCGCAGC GACCTCGTCG AGCTGCTGCG CCAGCTGCCG
CGCACCGGCG AACTGGTGTT CCCAATTGGT GTCGACGGCC TTCGCAAGGC CTGGCAGCGC
ATTTGCATGG CCGCGGGCCT GGCTGGCGGC GCCGAGGTGC GCATCCATGA CCTTCGCCAT
GAAGCCATTT CCCGCGTCGC CGAGGCCGGC AGCCGCACGC CCGGCGGGTT CTCGCTGGTC
GACCTTCAGC ACTTCAGCGG CCACCGCGAT ACCCGCATGC TGCTGCGCTA TGCCCATCTG
TGCGCGGGAA GCTTCGCCAA GCGCTTGGAC GAGGCCTTCA GGGTCAACTC GCCCGACTCG
ACACTCCATC GCGGCCGGCT TCGGCTCAAG CAAGGCGCGT CCGTCTCGCT CAAGGAGGTG
GTGGATGACC ATGCGCGGGC ACCGTCAACC GCCCCGCTCA CCCACGGCGC CGCGCCGTCA
GCGCACGTCA CCAACGCGAC GCAGTCTTCA AGAAGCAGCC ATGGGAGCCT CGACGCAAGC
GCCGGTCAGC AAACCGAGGA GCCGGCCTCA TCGCCGACGT CAACCGGCGC GGCCGGGAAC
GTCATCCGTG TCGACTTTGC CCGCCGGGTC GCGTGA
 
Protein sequence
MASIENRSRI RVTVRNRDDL TRTFARNADH AIQRYVQTLQ SQGLKPRLAS LDDHYVVRTR 
SVAHKNQTLR ASSEAEAIKI KERIEREQRD GLFIDYAKGQ QTTLADLLIR YLRDEAPRDK
SFEVLGYKIN AWLEDAGLPR QDLAEIRDAH PNPCPKVTAM KIRRSTGTRV GQPSETGKFI
RKPFATIVPD DFADYIDERC QVVEPSTVDR EIDIFSAVCH IAIDTWRIHV AKNPMDGVRR
PRYYNERDRR LKDGEEARLL AAAREEDRAQ SIALRLELLM APEREDANSA TTVYRRKQVI
KGARQRYQAE AEETYEHIPL LETFIHFQLM TGARRSETLT LTWSNVDLDG QAAFLPETKN
GRPRTLPLRS DLVELLRQLP RTGELVFPIG VDGLRKAWQR ICMAAGLAGG AEVRIHDLRH
EAISRVAEAG SRTPGGFSLV DLQHFSGHRD TRMLLRYAHL CAGSFAKRLD EAFRVNSPDS
TLHRGRLRLK QGASVSLKEV VDDHARAPST APLTHGAAPS AHVTNATQSS RSSHGSLDAS
AGQQTEEPAS SPTSTGAAGN VIRVDFARRV A