Gene BTH_I1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I1804 
Symbol 
ID3848939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp2022908 
End bp2023942 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content68% 
IMG OID637841473 
ProductU32 family peptidase 
Protein accessionYP_442336 
Protein GI83720783 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAA GCAGCCACTT CGCGACGGGC GCCGCGCCGA TCGAACTCGT GTGCCCGGCG 
GGCAGCCTGC CCGCGCTGAA GGCCGCGGTC GACAACGGCG CGGACTGCGT GTATCTCGGT
TTTCGCGACG CGACGAACGC GCGCAACTTC GCCGGCCTGA ACTTCGACGC GCAGGCGATC
GACGCCGGCA TTCGCTACGC GCGCGAGCGC GGCCGCAAGG TGCTCGTCGC GCTCAACACG
TATCCGCAGC CGGACGGCTG GGCCGCATGG CGGGAAGCGG TCGGCCGCGC GGCCGACGCG
GGCGTCGACG CGATCATCGT CGCCGATCCG GGGCTCATGC GTTTCGCGCG CGAGCGCTAT
CCGGACCTGC GGCTGCACCT GTCGGTGCAG GGCTCGGCGA CGAACTACGA GGCGATCAAC
TTCTATCACG AGCACTTCGG CATATCGCGC GCGGTGCTGC CGCGCGTGCT GTCGCTCGCG
CAGGTCGAGC AGGTGACCGA AAACACGCCG GTCGAAATCG AGGTGTTCGG CTTCGGCAGT
CTGTGCGTGA TGGTCGAGGG GCGCTGCGCG CTGTCGTCGT ACGCGACGGG CGAATCGCCG
AACACGCGCG GCGTGTGCTC GCCCGCGAAG GCGGTGCGCT GGCAGAAGAC GCCGGACGGC
CTCGAATCGC GGCTGAACGG CGTGCTGATC GACCGTTACG AAGACGGCGA GAACGCCGGC
TATCCGACGC TCTGCAAGGG GCGCTTCACG GTGGCCGACG AGAGCTACTA CGCGATCGAG
GAGCCGACGA GCCTGAACAC GCTCGAACTG CTGCCGAAGC TGATGCAGAT CGGCATACGG
GCGATCAAGA TCGAAGGCCG TCAGCGCAGC CCCGCGTACG TCGCGCAGGT GACGCGCGTC
TGGCGCGACG CGATCGATCA GTGCGCATCG AACCTCGCGC GCTACTACGT GAAGCCCGCG
TGGATGACGG AACTGAACAA GGTCGCGGAA GGGCAGCAGC ATACGCTCGG CGCCTACCAC
CGGCCGTGGA AATGA
 
Protein sequence
MTQSSHFATG AAPIELVCPA GSLPALKAAV DNGADCVYLG FRDATNARNF AGLNFDAQAI 
DAGIRYARER GRKVLVALNT YPQPDGWAAW REAVGRAADA GVDAIIVADP GLMRFARERY
PDLRLHLSVQ GSATNYEAIN FYHEHFGISR AVLPRVLSLA QVEQVTENTP VEIEVFGFGS
LCVMVEGRCA LSSYATGESP NTRGVCSPAK AVRWQKTPDG LESRLNGVLI DRYEDGENAG
YPTLCKGRFT VADESYYAIE EPTSLNTLEL LPKLMQIGIR AIKIEGRQRS PAYVAQVTRV
WRDAIDQCAS NLARYYVKPA WMTELNKVAE GQQHTLGAYH RPWK