Gene BURPS1710b_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3665 
Symbol 
ID3691183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp4008333 
End bp4010123 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content53% 
IMG OID637730120 
Producthypothetical protein 
Protein accessionYP_335030 
Protein GI76810522 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.135844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCA CCCCGACTAC GCTTACGCTT AAGCAGTTTT TTTCGGTGAG CAACGAGCAA 
TTTCTCATCC CCGCTTATCA GCGCCGCTAC GCGTGGGGGC AGCGCCAGCA GCGTGAATTG
TTCGACGACC TGCGGCTACT TGCCTCCGGC GATACCCACC TTTTGGGCAC AGTACTTTTC
CTCTCCGACA CTCATAATCC CGTTATCAAC CAGCTTGAAC TGGTAGATGG CCAGCAACGT
GTTACGACCA TCACTATCTT GATGAGCGTA CTGGCTCGTC GATTCGAGCA AGAGCCGGGT
TATGAAAAGA CGGCGCAGAA AATCGCGGAG TTGCTGCAGT GTGAGGGCGT GAGCGGGCAA
GTCGTACCGA AGCTTCAGCT GGGCGATTTG GATCATGGCG ATTACGAGCG CATCATGAAT
GGAGGCGATA TTTCGGAGGT GGCGAATGGT TGTCTGAAAG GGGCTTGCGA GTATTTCACT
GAGTGGGTCG AGCAGTTGTC CGTGAACGAG CTCAATGTGT TCTTCCACAA GCTCATGAAT
AGCGCATCAA TCATCCGATT GGACGTGGGT GCAGCTAAAG ATGCCTATAA GCTATTCGAA
ACGATCAACA ACCGGGGGTT GCGACTCAAA CCAACGGACA TCATCAAGAA CTTTCTTCTG
GGGCATGCAT CGTCATTGCC CGCCGGCACA CTGGACAAAG TGAAGGGAGA TTGGCGCAAA
TTGATCGTGG CCCTTGACGG CTTGGATAGT GATGACTTTT TCCGCCAGTG GCTGGCGGGC
AAACTACACC GAAAGGTCAC CAAGAGCAAA CTGGTGGCCG ACTTCAAGGC CAATTACCTG
CGTCACGTAC AAGAGGCCGA GAGCATGACC GAATTCATGT CGTCGACCAT AAAAGATGAC
GAAGATGAGG AGGAGATCGA GGATGTGGCT ATCCTCGATG ACGAGGAGGA CGGCACGGAG
ACACTGGCAA AAGTCAGGAA AGTCAAGCTA ACGGCGTTTG CTACGGCACT GCGCCAGTCG
GCCGAGCTGT ACTCTAAGTT GCTGTGGGGA ACGACGACGT CGGCTAAGAT CAATCGGCAC
ATAGGCAACC TGTGGCGGAT AAAGGCGTTC TCTGCCTTTA CCTGGCTGCT GGATATGTTC
GGACGCAAGG ATTTGGACGA GAAAGCTCAA ATTCGATTGT TGAAGGCATT GGAGGCATTC
ATGATGCGCC GGCATATCTG CGAGAAGCGC ACCAACGAGC TGGAGACGAT TTTTGCAAAC
ATGACCAGCA TTGCCGATAG CGACTACGAG AAAGCCGTTA TTAAGATTCT GAGGGAGCAC
ACGCCCGACG ACGAGGAGTT CGAGTCTGCG TTCGCCTCGT TCCCTTTTGT ACCCGCGGTG
ATCGATCGCG CACGCTATGC ATTGGAGATG TTCGAGTACC GGGCTATCGG ACATAAGAAC
GAATACTACC TAGCCGATCC GGATGAGCTC GAGCTTGAGC ACATCATCCC GAAGGCGGCC
GACAAGGCCA GCACAAAAAA GGAGTTCGGT GACTGGCCAA GCTACTTGGG CGACGGTTGG
AAAGCCAAGC ACGCCAAAAT GCTTCATCGT ATCGGCAACA TGACCTTGCT GGCCGACGAA
CTGAACGTGG TGGCGTCAAA CAACCCCTTC CTATCCAAGC GCAAAGAGTA CGCTAGCTCC
AACATCCGCC TAACAAACGA TCTGTCGACG CTCAATCAAT TCAAGTTCAA GCAGGTGGAT
GACCGATCCA AGGAGTTCGC CAAATGGGCA GTGCAGATAT GGAGGGTCTA G
 
Protein sequence
MKITPTTLTL KQFFSVSNEQ FLIPAYQRRY AWGQRQQREL FDDLRLLASG DTHLLGTVLF 
LSDTHNPVIN QLELVDGQQR VTTITILMSV LARRFEQEPG YEKTAQKIAE LLQCEGVSGQ
VVPKLQLGDL DHGDYERIMN GGDISEVANG CLKGACEYFT EWVEQLSVNE LNVFFHKLMN
SASIIRLDVG AAKDAYKLFE TINNRGLRLK PTDIIKNFLL GHASSLPAGT LDKVKGDWRK
LIVALDGLDS DDFFRQWLAG KLHRKVTKSK LVADFKANYL RHVQEAESMT EFMSSTIKDD
EDEEEIEDVA ILDDEEDGTE TLAKVRKVKL TAFATALRQS AELYSKLLWG TTTSAKINRH
IGNLWRIKAF SAFTWLLDMF GRKDLDEKAQ IRLLKALEAF MMRRHICEKR TNELETIFAN
MTSIADSDYE KAVIKILREH TPDDEEFESA FASFPFVPAV IDRARYALEM FEYRAIGHKN
EYYLADPDEL ELEHIIPKAA DKASTKKEFG DWPSYLGDGW KAKHAKMLHR IGNMTLLADE
LNVVASNNPF LSKRKEYASS NIRLTNDLST LNQFKFKQVD DRSKEFAKWA VQIWRV