Gene BURPS1106A_A0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0101 
Symbol 
ID4903578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp91141 
End bp93381 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content64% 
IMG OID640143208 
ProductType V secretory pathway, adhesin AidA 
Protein accessionYP_001074144 
Protein GI126457559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAA TCAACGATCA GTGTTTCACC GCCGCCCGCG GGGCGGGCGT GACGATGGCG 
AATCGGCCCG GCGACCGGCC CGTGCCGATT CCGCGCGATC TGCCGGGCGT GGTGATCTTC
ATCCACGGCG TCAACGATCC CGGGGCGGCT TATGCCACCG TGGAGCGCGG GCTGTGCCAG
GGGCTCAATG AGCGCTTGTC GCGAAGCGAT CTGCGGCCGG GGGCGTATGG CAGGCGGTAT
GCGGCTGCGT CGGCGGCGAA AGCCGAGGGC GATAGCGTTC GGTATGCGGA AGTGCTGGGC
GATCCGGACA TGTATCTGTA TCAACGTGCG GAGACCGGCG GTACTCACAG CATGTTCCTG
CCGTTCTACT GGGGCTATCG GGCGTCCGAT GACGAGATCG CGAAGATCAG CCATCCGGGC
GAGGTCAAGA GCCGCGTGGC GGACAGCGAC GGCAACCTGA TGACGCGCGG CCAGTATCAG
GACATCCACG GCAACCGGCT CGACGCCCAT TTCGGCAAGG GCGGCGGGTT TTTCGCCAAC
GCGACGAACA ACATCCCGCA GATGTACAGC CCGGGCTTCG AACCGGACAA GCTCGAGCGC
ACGGTCATGC AAAACGCGCT GGCCGGCAAT ACGATCTTCG CCGGCAAGTC GCCCGAGCGC
CGTTACTTCG TGCTGGCCGC GGCTCGCCTG GCGAATCTCA TCAAGACCAT TCGGACCATT
CAGCCCTCCG CGCTCGCGCT CGAACATGGA ATGGACCCTC AGCACGAAAC GATCACGGTG
ATGGGGCACA GCCAGGGCAC GATCATCACG CTGCTCGCGC AGGCGATGCT CAGGCAGCAA
GGGCAGCGCT GCGTCGATTG CATCATCATG GTCGATACGC CGTACAGCCT GCAGTTCACC
CAGGACGGCA GCCAGCAAAC CGGGCACGCG AAGCTCAAGA CGCTGGTCGA CATCGTCAAC
GCGGTGACGA GCGAGCCGCA TACGATTCCG GAGCTGGCCG AACTGATGAT CGATTCGGCC
CATTCGTGCG GCCGGGCCGG ACAGAACTGG AGCAAGACGC AGGGAAAGCG GCCGGACAAG
GGCGGCAAGC ACTGGATCAC GTTCGACGAG CGCGACAACC GCGGCAAGGT CTATCTGTAC
TTTTGCCCGG AAGATACCGT GGTCGGCCTC GACAAGGTGC GGGGGATCGG CACGTTCGGC
GTGCCCGACG AGGTGCCGGC CGATGGCGCG GCGGCCAGCC GGGGCAAAAC GATGCCCGCG
ATGACGGTGC TTGAGCCGAA GCGCTTCTTT CAGCGCATGT GGACGCGGCT CGAGCGCGAC
CAGGACGGCA GGGGGAAGCG CTCGAAGGTC GCCGTCGGCA CGCCGCCCGC GCGCGTGCCG
GTTCGCGACC CGTTTCAGCG ATTGACGCCG GGCCCCGACA CGGACGGCAC GATGCTTGGC
ACCTTGGTCG AATCCGGCAA GAACATGGCG CTGCAGGCAT CGTTCAAGCG CAACGACATT
CGCTTCATCA ACGGCGAGCA ACTGAAACCG GCGTACGAGC CCGATCTGTA CGGGGGCGAG
GTTCAGAAAG GCGGACAGGT GCCCGGGCAC GCCGATGTCG CCGGGTTGAT GCGTCCGGAC
GACGTGACGA AAAACGTCGC GCTCGGCAAC CAGTACGCGA AGTTCAAATG GAAAGACGTC
GCGACGACGG ACGATCCGGG CGCCGGCATC GAGCCGCACA AGCAGGCATT CAATCGCGGC
CGTCCGGTCG ACGAGCAGTC GCATAACTGG CGCATCGTGC CGAGCCGGTC GCTCGGCTCG
ATGCTGTCGG CGGCCGCCAC GGGCGGACGG TATCAGACGT ACGTGATCCA GCGCGAGGAG
ACGCCGGACG AGGTTCGCAA GCGCATGCGT ACCGATGCCG ATCAACTGGA AGCGAACAAC
TATCATTCGG GCGTACTGCT CAGTTCGGAG AACCATCGCT GGGTCACGGC GATGGATGTG
GCGATCGGTC AGGCGGTGAC GCTGGATGAT CCGGACTGGC GGCAGTTGTT GCTTTTGATG
GCGGATTGGA AGATGACGCC GGACGTGTAT CGCAATATCC AGAAATGCAA GAACTTTGAG
CGCTTGGACG AACACACGCG CGAATTCGTG AAAGCGTGTG TCGACTACTA CAAGACGGGC
CGATTTCCCG ACGAGAAGTA TGTACCTCTC ACCATGCCAC CGCTCGTGAC CAGCGAACTG
AAGGTTGAGA GCAAAACATG A
 
Protein sequence
MNGINDQCFT AARGAGVTMA NRPGDRPVPI PRDLPGVVIF IHGVNDPGAA YATVERGLCQ 
GLNERLSRSD LRPGAYGRRY AAASAAKAEG DSVRYAEVLG DPDMYLYQRA ETGGTHSMFL
PFYWGYRASD DEIAKISHPG EVKSRVADSD GNLMTRGQYQ DIHGNRLDAH FGKGGGFFAN
ATNNIPQMYS PGFEPDKLER TVMQNALAGN TIFAGKSPER RYFVLAAARL ANLIKTIRTI
QPSALALEHG MDPQHETITV MGHSQGTIIT LLAQAMLRQQ GQRCVDCIIM VDTPYSLQFT
QDGSQQTGHA KLKTLVDIVN AVTSEPHTIP ELAELMIDSA HSCGRAGQNW SKTQGKRPDK
GGKHWITFDE RDNRGKVYLY FCPEDTVVGL DKVRGIGTFG VPDEVPADGA AASRGKTMPA
MTVLEPKRFF QRMWTRLERD QDGRGKRSKV AVGTPPARVP VRDPFQRLTP GPDTDGTMLG
TLVESGKNMA LQASFKRNDI RFINGEQLKP AYEPDLYGGE VQKGGQVPGH ADVAGLMRPD
DVTKNVALGN QYAKFKWKDV ATTDDPGAGI EPHKQAFNRG RPVDEQSHNW RIVPSRSLGS
MLSAAATGGR YQTYVIQREE TPDEVRKRMR TDADQLEANN YHSGVLLSSE NHRWVTAMDV
AIGQAVTLDD PDWRQLLLLM ADWKMTPDVY RNIQKCKNFE RLDEHTREFV KACVDYYKTG
RFPDEKYVPL TMPPLVTSEL KVESKT