Gene BURPS1710b_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3669 
Symbol 
ID3689866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp4016861 
End bp4018249 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content60% 
IMG OID637730124 
Productprophage CP4-like integrase 
Protein accessionYP_335034 
Protein GI76810764 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.18314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCTCC GACGACCTCC AAAGTCGTCC GAAAACCCCG CTAGAGTCCT TGATTCTGCG 
GGGTTTTTTG TTGTCCAGCG CCCTCCGAAG CTCCGATGGA ATCGAATAAG GTCCAATAGG
CTTGCTGGTA TATTCGTTGG TATCGCGCAA ACGCCTGTTG GTACCAACGA CATGAACCTT
TCAGATCTCG CTGTCCGCAA CGCCAAGCCA CAAGACAAGC CGTACAAACT GACGGACGGC
GACGGCATGT TCTTGCTCGT GCAGCCGAAC GGCGGGAAGT ACTGGCGGCT CGCGTACCGC
TATCTGGGAA AGCAGAAAAC ATTGGCTTTG GGCGTGTACC CCGAGGTTAG TCTCGCCCTC
GCCCGGCAGC GACGAGCGGA AGCACGCGAG CAGTTGGCAA AGGGGGCGGA TCCGGGGGAA
ACCAAGAAGG CGGATAGGCG GGCCGCCCGT TTAGCCGCGG ACAATTCGTT CGAAGTCGTC
GCATTGGCTT GGCTGGAGGA GCGTCGACCG TATGTCGAGC CTCGGCAGCA TGAGCGCACG
CTGGTACGGT TGAAGAACGA TGTGTTCCCC TGGCTTGGCA AGCGCCCAAT TGCGGACATT
GATGCCCCGG AGATCCTGGC CGTGCTGAAG CGGATCGATA GCCGTGGTGC TCGGTTCACA
GCGCATCGCA TCCGCAGCGA GATTGGCCGG GTCTTCGTCT ATGGGATCAA GGAAGGGCAT
TGCAAGGCCA ACCCTGCGGC GAGTCTGGTG AAGGCCATCC CGCCGGCGCA GACGACGCAC
TTCGCGTCGA TTACCGAGCC CGCACAGGTT GGAGAAATGT TGCGGGCGTT CGACGGTTTT
TCCGGCACGT TCCCCGTGCT TTGTGCGCTC AAGCTCGCAC CGATGCTGTT CGTGCGCCCC
GGGGAGCTGC GCAAGGCGGA ATGGTCCCAC TTCGATCTCG AGAAAGCGGA GTGGCGCTAC
TTGGTCACGA AGACCAAGAC CGAGCATCTC GTTCCGCTGG CCACACAAGC CGTCAAGATT
CTGAAGGAGC TGCACGCCGT GACCGGGAGC GGACAGTACG TGTTTCCGGG CGCCCGTTCC
AAGCAGCGTC CGATGAGCGA TGCTGCAATC AACGCCGCGC TGCGTCGTTT GGGTTACGAC
ACGCGTACCG AAATCACCGG CCATGGATTC CGAGCGATGG CTCGGACGAT TTTGCACGAA
GAGTTGGAGC AGAAACCGGA AGTCATCGAG CATCAACTCG CGCACACCGT GCCGGACACG
CTGGGCAGAG CATACAACCG GACGAAGTTC ATCAAGGAGC GGTGCGCCAT GATGCAGAAG
TGGGCCGACT TTCTCGATCA ATTGAAGCTC GGCGCAAAGA TCATCCCGAT TGCCGCTGTT
GCCAGTTAG
 
Protein sequence
MLLRRPPKSS ENPARVLDSA GFFVVQRPPK LRWNRIRSNR LAGIFVGIAQ TPVGTNDMNL 
SDLAVRNAKP QDKPYKLTDG DGMFLLVQPN GGKYWRLAYR YLGKQKTLAL GVYPEVSLAL
ARQRRAEARE QLAKGADPGE TKKADRRAAR LAADNSFEVV ALAWLEERRP YVEPRQHERT
LVRLKNDVFP WLGKRPIADI DAPEILAVLK RIDSRGARFT AHRIRSEIGR VFVYGIKEGH
CKANPAASLV KAIPPAQTTH FASITEPAQV GEMLRAFDGF SGTFPVLCAL KLAPMLFVRP
GELRKAEWSH FDLEKAEWRY LVTKTKTEHL VPLATQAVKI LKELHAVTGS GQYVFPGARS
KQRPMSDAAI NAALRRLGYD TRTEITGHGF RAMARTILHE ELEQKPEVIE HQLAHTVPDT
LGRAYNRTKF IKERCAMMQK WADFLDQLKL GAKIIPIAAV AS