Gene BTH_II1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1578 
Symbol 
ID3844676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1849092 
End bp1850900 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content64% 
IMG OID637838879 
Productcollagenase, putative 
Protein accessionYP_439773 
Protein GI83717827 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.285851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGCA TGCCGCAGAA TCTGCCGGTT TCACCCGAGC AGGCCGAGTA CAACCTGCCG 
CTCAGCGAGC AGGACCGGGC GGCGTTGACC AAGCCTTCGC AGCTCAAGCA GCAGGCCAAG
CGCAGCAAGC GCAGCGCGCC GGGCGCCGAT TGCCGCGACA TGTCGGCGAT GACGCGGTAT
CGCGGCGCGG CACTCGCCGA TTACATCGCG AATCTTCCCG ATTACGAATG CCATTACGGC
TTGTTCTCAG TCGATAAAAC GCTGGCTCAG CAGATTTTCA ATGCCGAAAA CGTGCATGCC
GTCGCGAGCC GTTTTGTGCA GGAAGTCTAT CGCTATGATG CGAGCAATTT GATTCTGGTC
AATTTGCTGA TTTATCTGCG TTCCGCTTAT TACCAATATG ATGTATCGGG CATTGCCGAT
CCGATTCCGG ATCTCGCCGT GTGGCTGCGT CCGTATATCA AGCAAAGCCT CGAAGGCGAG
GCGCTCTATC GCGAGAACGA CCGCGCGCCG AGCACGGCGA ACGAGCTGAT GAAGCTCATC
ACGAACATGA AGGACGAGGC GTACTATCTG CCGACGCTGA AGAACCGCAT CGCGTCCTAC
ACGGCGAGCG CGACGAATCC GCAGGCGGCG GCGCCGCTGT TGCAGCGCAG CGCGGCGGGC
GGCTTCACCG GCTTGCTCAC GGTGTTCTTC TACGCGCATC AGCGCAGCGG CGCGCGGCAG
ATGCTCGACA GCGACGCGAC GCTGCCGGAG ACGCTCAACC GCTTCGTCAC CGCGAACCGA
GCGAGCCTGT CGAATACGAG CGCCGCGTAT CAGCTTGCCG ACGCCGCACG CGAGACGTTT
CGCTTTCTCC GTTATCCGAC GCAGAAGCCG CGCGTGAAGA AGATGATCCA GGACATCCTG
GCATCGACGA GCCTGACGGG GGCGGACAAC GATCTGTGGC TCGCGGCGGC GGAAGCGGTC
GACTACGGCG ACGCGGGCAA CTGCGCGGAC TACGGCACGT GCGACTACAA GAAGCGGCTG
ACCGATGCGG TGCTCACGCA TCGTCACGCA TGCAACGCGA GCGTGCGCAT TCTCGCGCAG
GACATGACGG CGCCGCAGTT GCAGTCGGTC TGCGCGGCGG TCGCGCAGCA GGACGATTAC
TTCCACCGGA TGATGAAGAC CGGGCGCAAG CCGGTCGCGG GCGATCGCAA CGACACGATC
GAACTCGTCA TCTTCGACGA CTACGCGAAC TATCGCAAAT ACGCTTCGGT GATCTACGGC
ATCAGCACCG ACAACGGCGG CATGTACCTC GAAGGCGATC CGTCCGCGCC CGGCAACCAG
GCGCGCTTCA TTGCCCATGA GGCGTCGTGG CTGCGGCCCG AGTTCAAGGT CTGGAACCTC
GAGCACGAGT TCACGCACTA TCTGGACGGC CGCTACGACA TGGCGGGCGA CTTCGCGGCG
AGCACCGCGA AGCCCACCGT CTGGTGGATC GAAGGCGTCG CCGAATATCT GTCGAGAAAG
AACGGCAACC AGGAGTCGAT CGACGCGGCG CGCACGGGCG CGTACCGGTT CGCGGACGTG
CTCGGCACGC TGTATTCGTC GAGCGACTAC GTCGCGCGCG CATACCGGTG GGGCTACATG
GCGACGCGCT TCATGTTCGA GCGCCACCGC GCGGACGTCG ACACGATCGT GTCGCGCTTC
CGGGCGGGCG ATTACGACGG CTACGCGAAC TACGTCGCGT ACATCGGCAA CCGCTACGAC
AACGAGTTCG TCGACTGGGC GCGCAACGCG ACGACGGCGG GCGAGCCGCC GCTGCCGACG
CAGCGCTGA
 
Protein sequence
MPRMPQNLPV SPEQAEYNLP LSEQDRAALT KPSQLKQQAK RSKRSAPGAD CRDMSAMTRY 
RGAALADYIA NLPDYECHYG LFSVDKTLAQ QIFNAENVHA VASRFVQEVY RYDASNLILV
NLLIYLRSAY YQYDVSGIAD PIPDLAVWLR PYIKQSLEGE ALYRENDRAP STANELMKLI
TNMKDEAYYL PTLKNRIASY TASATNPQAA APLLQRSAAG GFTGLLTVFF YAHQRSGARQ
MLDSDATLPE TLNRFVTANR ASLSNTSAAY QLADAARETF RFLRYPTQKP RVKKMIQDIL
ASTSLTGADN DLWLAAAEAV DYGDAGNCAD YGTCDYKKRL TDAVLTHRHA CNASVRILAQ
DMTAPQLQSV CAAVAQQDDY FHRMMKTGRK PVAGDRNDTI ELVIFDDYAN YRKYASVIYG
ISTDNGGMYL EGDPSAPGNQ ARFIAHEASW LRPEFKVWNL EHEFTHYLDG RYDMAGDFAA
STAKPTVWWI EGVAEYLSRK NGNQESIDAA RTGAYRFADV LGTLYSSSDY VARAYRWGYM
ATRFMFERHR ADVDTIVSRF RAGDYDGYAN YVAYIGNRYD NEFVDWARNA TTAGEPPLPT
QR