Gene BTH_II1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1417 
Symbol 
ID3844789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1670588 
End bp1672270 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content70% 
IMG OID637838719 
Productserine metalloprotease MrpA 
Protein accessionYP_439613 
Protein GI83717882 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.620558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AACATAGCAA TCCTGAACAA ACGCGGCGCG GCCAGGTCCG CGCCGTCGCC 
GGCATTCTCT CCATGTCCGT GCTCGTTCCG CTCGCCGGTT GCGGCGGCGG CGGCGACGGA
GGCGCAAGCG GCACGCCGTC CGCAGCCGCC CAGCCGACGG CGCCCACGCC GGCTCCGGCG
CCCGCGCCAA GTTCGGGCTC GTCGCAATCC GCGAATTCGT CCACGTCGAC GGCGGCCTGC
CCCGTCACGC AGGCCGCATC GACCGCCGCA AGCGAATCGC TCGCCCCCAG CACCGTCGCG
TACGACGCGC CCGTCGACCA TCTGATCGTC AAGCTGCAAA GCGCACCGGC GACGAGCGCA
TCGGGCGCGC GCATCATGGC CGCGACGAAC GACGCGGCCC GACTCGATTC GCTGATTCAG
CGGGTGATGT CGCAATGGAA CGCGAAGAGC GGCAACGTTC GCTCGTATGC GCAGAACGTC
GCGCCGATGA ACGCGGTGCA GGTCGAGCGG ACGATGTCGG ACGGCGCCGC GCTGCTCGCG
CTCGGACAAA AGATGAGCGC GGATAACGCC GGCGCTCTCG CGCAAACGTT CGCGGCCGAT
CCGGACGTCG CGTACGCGGA GCCCGACCGG CGCGTGTTCG CCCGCACGGT GGCGACCGAT
CCGAGCTACT CGCAGCAATG GAATTACTTC GATCCGACGG CCGGCATCGA CCTGCCGAAC
GCGTGGGACG TGACGACCGG CCTGCCGAGC GTCGTCACCG CGGTGCTCGA CACCGGCTAC
CGTCCGCATC CGGACATCAT CGCGAACCTG CTGCCCGGCT ACGACTTCAT CTCCGACATC
AACACCGGCA ACAACGGCCA TACCCGCAGC CCGGACGCGA CCGACCCGGG CGACTGGGTC
ACGCAGCAGG AACTGACCGA TCCGTCGAGC CCGTTCTACC AATGCGCGAG CGCGCCGTCG
AACAGCAGCT GGCACGGCAC GCAGGTCGCC GGCATCATCG GCGCCGCCGC GAACAACGGC
ATCGGCATCG CGGGCGTCAA CTGGTACGGC AAGATCCTGC CCGTGCGCGT GCTCGGCAAG
TGCGGCGGCA CGACGAGCGA TATCGCCGAC GCGATGCGCT GGGCGGCGGG CATCCCTGTC
GCGGGCGCCC CGACGAACCT CACGCCCGCG AAGGTGATCA ACCTGAGCCT CGGCGGCACC
GGCCCGTGCG GCGATACGTT CCAGCAAGCG ATCAACGACG TGATCGCGCG CGGCGCCACC
GTCGTCGTCT CGGCCGGCAA CGACGGCCAG GCGACGACGC TGGACCGCCC GGCGAACTGC
AAGGGCGTGA TCTCGGTGGG CGCGACCGAC AGCACGGGCC AGCGGGCGTG GTATAGCAAC
TTCGGCTCGG ACATCACGTT GAGCGCGCCG GGCTCGAACA TCCTGTCGAC GACCAATGCG
GGCACGACGG TGCCGACCAC CGACGCGTAC GGCACCCACA GCGGCACGAG CCTCGCCGCG
CCGCAGGTGG CGGGCGTCGC CGCGCTGATG CTCTCGGCCA ACCCGAACCT GACGCCCGCG
CAGATCGCGC AGAAGCTCGC GAGCACCGCG CGGCCATCGC CGGCGACGAC GTCCTGCCTC
GCCCGCGCAC CGGGTGCGGG CATCCTCGAT GCCGGCACGG TGGTTGCGTC CGCAACGAAA
TAG
 
Protein sequence
MNKKHSNPEQ TRRGQVRAVA GILSMSVLVP LAGCGGGGDG GASGTPSAAA QPTAPTPAPA 
PAPSSGSSQS ANSSTSTAAC PVTQAASTAA SESLAPSTVA YDAPVDHLIV KLQSAPATSA
SGARIMAATN DAARLDSLIQ RVMSQWNAKS GNVRSYAQNV APMNAVQVER TMSDGAALLA
LGQKMSADNA GALAQTFAAD PDVAYAEPDR RVFARTVATD PSYSQQWNYF DPTAGIDLPN
AWDVTTGLPS VVTAVLDTGY RPHPDIIANL LPGYDFISDI NTGNNGHTRS PDATDPGDWV
TQQELTDPSS PFYQCASAPS NSSWHGTQVA GIIGAAANNG IGIAGVNWYG KILPVRVLGK
CGGTTSDIAD AMRWAAGIPV AGAPTNLTPA KVINLSLGGT GPCGDTFQQA INDVIARGAT
VVVSAGNDGQ ATTLDRPANC KGVISVGATD STGQRAWYSN FGSDITLSAP GSNILSTTNA
GTTVPTTDAY GTHSGTSLAA PQVAGVAALM LSANPNLTPA QIAQKLASTA RPSPATTSCL
ARAPGAGILD AGTVVASATK