Gene BURPS1106A_A0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0900 
Symbol 
ID4904764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp883495 
End bp885432 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content57% 
IMG OID640144006 
Productcollagenase 
Protein accessionYP_001074936 
Protein GI126456492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT CCCACAATGT CGTCAATCGC TTTATTGTCG CCGCTTCTAT TATCATTGGA 
GTCGTTCTTT ACAGTTCCGC TTGGGCAAAT CCGCAGCCCA TGCATACGAA GCAGGCACGT
ATGCCGCGTA TCCCGCAGAA TCTCCCGCTT TCACCAGACC AAGCCAAATA CGACCTGCCG
CTCAGCAAGT ATGACCGCGC AACGCTGATG GAGCCGTTGC GGCGGAAGCA ATCAGCGAAA
CCCGACAGGC GCACCCGGCC TGGAGCAGAT TGCCGCGACA TGTCAATAAT GACGCAATAT
CACGGTACGG CGCTTGCTGA TTACATAGCA AACCTCCCCG ATTATGAGTG CCACTACGGA
CTATTCTCGA TTGACAGGGC GATGGCCGCG CAGATTTTCA ATTCTGAAAA CGTGTGGGCT
GTTGCCAGCC GTCTCACTCA AGAAATCAAT CGTTACGACG CAACAAATAT TACATTGGTA
AATTTGCTTA TTTATCTGAG AGCCGCTTAT TTCCAATATG ACGCAGCCCA GCTTGCTGAT
CCGATTCCCG GTCTCGTAGT CTGGCTGCGT CCGTATATTT TGCAGAGCCT CTCTGGCGAC
GCGCTTTACC TCGAGAATTC ACGCGCGCCG AGCACCGCCA ACGAGCTGAT GATCCTAATC
ACAAACATGA AGGACGAGGC GTACTACCTG CCAACGCTGA AGGACCGAAT CGCGTTCTAC
ACCGCGAGCG CGACCAACCC TCAGGCTGCG GCGCCGCTAC TGCAGCGAAG CGCGGCGGGT
GGCTTCACCG GCTTGCTCAC GGTGTTCTTC TACGCGCATC AGCGCAGCGG CGCTCAGCCG
ATGCTCGATA GCGATGCGAC TCTGCCGGAG ACGCTCAACC GCTTCGTCAC GGCGAACCGC
GCATACCTGT CGAACACCAG TGCCGCCTAT CAGCTCGCCG ATGCGGCGCG CGAAACGTAC
CGCTTTCTCC GCTATCCGTC GCAGAAGCCG CGGGTGAAGA AAATGATTCA GGATATGCTC
GCGTCGACTA CCATGACGGG CCCGGACAAC GACCTGTGGC TCGCGGCAGC GGAAGCAGCC
GATTACGGCG ATCCCGGCAA CTGCGCAGAT TACGGCACGT GCGACTATCA GAAGCGGCTC
ATCGAGGCAG TGCTCACGCA TCGGTACTCA TGCAATGCGA ACGTACGAAT TCTCGCGCAG
GACATGACGG TGCCGCAATT CCAGTCGGCA TGCCAATCGG TCGCCCAGGA GGAGGACTAT
TTCCACCGGA TGATGAAGAC AGGGCACGTA CCGGTCGCGA ACGATCACAA TGACACGATC
GAAATAGTCG TATTCGGCGA CTACGACAAT TATCGGAAGT ACGCTTCGGT GATCTACGGA
ATTAGCACCG ATAACGGCGG CATGTACGTT GAAGGCGATC CGTCGGCACC CGGCAATCAG
GCGCGCTTCA TCGCGCACGA GGCTTCGTGG CTACGGCCGG AGTTCAAGGT CTGGAACCTT
GAGCACGAGT TTACGCACTA TCTCGACGGC CGTTACGACA TGGCGGGCGA CTTCGCGGCG
AGCACGGCGA AGCCCACCGT GTGGTGGATC GAAGGTCTTG CCGAATATAT CTCCAGAAAG
AACGATGACC AGGAATCGAT CGACGCGGTG CGCACGAACG CATATCGGCT CTCGGACGTG
CTTCAGACGA CTTATTCGTC CGGCGACTAT GTCACGCGTG CGTATCGATG GGGTTATATG
GCGACGCGCT TCATGTTTGA ACGTCATCGC GCGGACGTCG ACGCGATCGT GTCACGTTTT
CGCGTGGGCG ATTACGACGG TTACGCGGAC TATGTCGCGT ACATGGGCAA CCGCTATGAC
AGCGAGTTTG TTGACTGGGC ACGCGGCGCG ACAACAACCG GTGAGCCGCC GTTGCCGCCA
ACGAAAGCGG GGCATTGA
 
Protein sequence
MKNSHNVVNR FIVAASIIIG VVLYSSAWAN PQPMHTKQAR MPRIPQNLPL SPDQAKYDLP 
LSKYDRATLM EPLRRKQSAK PDRRTRPGAD CRDMSIMTQY HGTALADYIA NLPDYECHYG
LFSIDRAMAA QIFNSENVWA VASRLTQEIN RYDATNITLV NLLIYLRAAY FQYDAAQLAD
PIPGLVVWLR PYILQSLSGD ALYLENSRAP STANELMILI TNMKDEAYYL PTLKDRIAFY
TASATNPQAA APLLQRSAAG GFTGLLTVFF YAHQRSGAQP MLDSDATLPE TLNRFVTANR
AYLSNTSAAY QLADAARETY RFLRYPSQKP RVKKMIQDML ASTTMTGPDN DLWLAAAEAA
DYGDPGNCAD YGTCDYQKRL IEAVLTHRYS CNANVRILAQ DMTVPQFQSA CQSVAQEEDY
FHRMMKTGHV PVANDHNDTI EIVVFGDYDN YRKYASVIYG ISTDNGGMYV EGDPSAPGNQ
ARFIAHEASW LRPEFKVWNL EHEFTHYLDG RYDMAGDFAA STAKPTVWWI EGLAEYISRK
NDDQESIDAV RTNAYRLSDV LQTTYSSGDY VTRAYRWGYM ATRFMFERHR ADVDAIVSRF
RVGDYDGYAD YVAYMGNRYD SEFVDWARGA TTTGEPPLPP TKAGH