Gene BURPS668_A0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0988 
Symbol 
ID4887219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp957620 
End bp959605 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content57% 
IMG OID640130928 
Productcollagenase 
Protein accessionYP_001061987 
Protein GI126442357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.244986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTATAC ATGATCATCA ATCATATTCA CGACAACAAA GGGCAAATAT GAAAAATTCC 
CGCAATGTCG TCAATCGTTT TATTGTCGCC GCTTCTATTA TCATTGGAGT CGTTCTTTAC
AGTTCCGCTT GGGCAAATCC GCAGCCCATG CATACGAAGC AGGCACGTAT GCCGCGTATC
CCGCAGAATC TCCCGCTTTC ACCAGACCAA GCCAAATACG ACCTGCCGCT CAGCAAGTAT
GACCGCGCAA CGCTGATGGA GCCGTTGCGG CGGAAGCAAT CAGCGAAACC CGACAGGCGC
ACCCGGCCTG GAGCAGATTG CCGCGACATG TCAATAATGA CGCAATATCA CGGCACGGCG
CTTGCTGATT ACATAGCAAA CCTCCCGGAT TATGAGTGCC ACTACGGACT ATTCTCGATT
GACAGGGCGA TGGCCGCGCA GATTTTCAAT TCTGAAAACG TGTGGGCTGT TGCCAGCCGT
CTCACTCAAG AAATCAATCG TTACGACGCA ACAAATATTA CATTGGTAAA TTTGCTTATT
TATCTGAGAG CCGCTTATTT CCAATATGAC GCAGCCCAGC TTGCTGATCC GGTTCCCGGT
CTCGTAGTCT GGCTGCGTCC GTATATTTTG CAGAGCCTCT CTGGCGACGC GCTTTACCTC
GAGAATTCAC GCGCGCCGAG CACCGCCAAC GAGCTGATGA TCCTAATCAC AAACATGAAG
GACGAGGCGT ACTACCTGCC AACGCTGAAG GACCGAATCG CGTTCTACAC CGCGAGCGCG
ACCAACCCTC AGGCTGCGGC GCCGCTACTG CAGCGAAGCG CGGCGGGTGG CTTCACCGGC
TTGCTCACGG TGTTCTTCTA CGCGCATCAG CGCAGCGGCG CTCAGCCGAT GCTCGATAGC
GATGCGACTC TGCCGGAGAC GCTCAACCGC TTCGTCACGG CGAACCGCGC ATACCTGTCG
AACACCAGTG CCGCCTATCA GCTCGCCGAT GCGGCGCGCG AAACGTACCG CTTTCTCCGC
TATCCGTCGC AGAAGCCGCG GGTGAAGAAA ATGATTCAGG ATATGCTCGC GTCGACTACC
ATGACGGGCC CGGACAACGA CCTGTGGCTC GCGGCAGCGG AAGCAGCCGA TTACGGCGAT
CCCGGCAACT GCGCAGATTA CGGCACGTGC GACTATCAGA AGCGGCTCAT CGAGGCAGTG
CTCACGCATC GGTACTCATG CAATGCGAAC GTACGAATTC TCGCGCAGGA CATGACGGTG
CCGCAATTCC AGTCGGCATG CCAATCGGTC GCCCAGGAGG AGGACTATTT CCACAGGATG
ATGAAGACAG GGCACGTACC GGTCGCGAAC GATCACAATG ACACGATCGA AATAGTCGTA
TTCGGCGACT ACGACAATTA TCGGAAGTAC GCTTCGGTGA TCTACGGAAT TAGCACCGAT
AACGGCGGCA TGTACGTTGA AGGCGATCCG TCGGCACCCG GCAATCAGGC GCGCTTCATC
GCGCACGAGG CTTCGTGGCT ACGGCCGGAG TTCAAGGTCT GGAACCTTGA GCACGAGTTT
ACGCACTATC TCGACGGCCG TTACGACATG GCGGGCGACT TCGCGGCGAG CACGGCGAAG
CCCACCGTGT GGTGGATCGA GGGTCTTGCC GAATATATCT CCAGAAAGAA CGATGACCAG
GAATCGATCG ACGCGGTGCG CACGAACGCA TATCGGCTCT CGGACGTGCT TCAGACGACT
TATTCGTCCG GCGACTATGT CACGCGCGCG TATCGATGGG GTTATATGGC GACGCGCTTC
ATGTTTGAAC GTCATCGCAC GGACGTCGAC GCGATCGTGT CACGTTTTCG CGTGGGCGAT
TACGACGGTT ACGCGGACTA TGTCGCGTAC ATGGGCAACC GCTATGACAG CGAGTTTGTT
GACTGGGCAC GCGGCGCGAC AACAACCGGT GAGCCGCCGT TGCCGCCAAC GAAAGCGGGG
CATTGA
 
Protein sequence
MTIHDHQSYS RQQRANMKNS RNVVNRFIVA ASIIIGVVLY SSAWANPQPM HTKQARMPRI 
PQNLPLSPDQ AKYDLPLSKY DRATLMEPLR RKQSAKPDRR TRPGADCRDM SIMTQYHGTA
LADYIANLPD YECHYGLFSI DRAMAAQIFN SENVWAVASR LTQEINRYDA TNITLVNLLI
YLRAAYFQYD AAQLADPVPG LVVWLRPYIL QSLSGDALYL ENSRAPSTAN ELMILITNMK
DEAYYLPTLK DRIAFYTASA TNPQAAAPLL QRSAAGGFTG LLTVFFYAHQ RSGAQPMLDS
DATLPETLNR FVTANRAYLS NTSAAYQLAD AARETYRFLR YPSQKPRVKK MIQDMLASTT
MTGPDNDLWL AAAEAADYGD PGNCADYGTC DYQKRLIEAV LTHRYSCNAN VRILAQDMTV
PQFQSACQSV AQEEDYFHRM MKTGHVPVAN DHNDTIEIVV FGDYDNYRKY ASVIYGISTD
NGGMYVEGDP SAPGNQARFI AHEASWLRPE FKVWNLEHEF THYLDGRYDM AGDFAASTAK
PTVWWIEGLA EYISRKNDDQ ESIDAVRTNA YRLSDVLQTT YSSGDYVTRA YRWGYMATRF
MFERHRTDVD AIVSRFRVGD YDGYADYVAY MGNRYDSEFV DWARGATTTG EPPLPPTKAG
H