Gene Bind_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2235 
Symbol 
ID6201155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2562957 
End bp2564546 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content63% 
IMG OID641706224 
Productprotease Do 
Protein accessionYP_001833342 
Protein GI182679196 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.355037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAAT TGGAATTTCC CCCGCAGGAT AAAGGCGCCG CACGGAACGC CGCCACGCCA 
CGCTCGCAGC GTCGGCCGGG GATGCGCGCC ATCCTGCTCG GCGCCACAGC GGCCGTGGCC
CTGACCGGTG CCTTCACGCA TTCCGTCCTG CTGCCGCAAG CGGCCAATGC CGAAACACCC
ACCCTGAACG TGCCGGTGAA TGCGCCGAAT AGCAGCCCAG TCGTCGGACC GGTCTCTTTT
GCCGATGTGG TCGACCATGT GCGGGGGGCT GTCGTTTCGG TCAAGGTCAA GATTACCGAA
ACCGCCGATA ATGAAGAGGC CAATACCGGA AACGACATGC CTCAATTCGC CCCGGGCGAT
CCCCTGGAGC GTTTCTTCCG CCGCTTTGGC GAACAAGGAG GCGTCCCCTT CAACAAACAT
AGCGGCAAGC CACGGACCGG CCAGGCGCAG GGTTCGGGAT TCATCATTTC GAGCGATGGC
TATGTCGTCA CCAATAATCA TGTCGTCGAA AACGCGACAG AGGTCAGCCT GACGACCGAT
GGCGGTCAGA CCTTGACCGC GAGCGTGGTT GGCACCGACA AGAAGACTGA TCTCGCTCTC
TTGAAGATCA ATGGCTCGGG CACCTATCCC TTCGTCAAAT TCTCCAACGA GACACCCCGT
GTCGGCGAAT GGGTCATCGC TGTCGGCAAT CCTTTCGGTC TCGGCGGCAC GGTGACGGCA
GGCATTATTT CAGCGCGCGG CCGCGATATC GGCGCCGGCC CCTATGACGA CTTCCTGCAG
GTCGACGCCC CGGTCAATCG CGGCAATTCC GGTGGCCCGA CCTTCAACGC CAAGGGCGAC
GTGGTCGGCG TCAATACGGC GATCTTCTCA CCGTCCGGCG GCAGCGTCGG CATCGGCTTC
GCCATTCCCG CGGAGGTTGC GCAAAACGTC ATCACCTCCT TGCGGGAGAA AGGCACGGTC
GCGCGCGGTT GGATCGGCGT CCAGATTCAG CCTGTGACAG CGGAAATCGC CGATAGTCTC
GGCCTGAAAA CCAGCAAGGG CGCCCTGGTT GCCGAGGCAC AGCCGAATTC TCCCGCGCTC
TCGGCCGGTA TCCGCTCCGG TGACGTGATC CTCGGCGTCG ATGGCGAACG CATCGATGGT
CCGCGCGAAC TGGCCCGCAA GATAGCGGCG CTCGGCCCTG GCAAGAGCAC CAATCTCATG
TATTGGCACG ATGGCTCGGA AAAGACCGTC GCGGTGAAAC TCGGCAATCT GCCAAATGAC
AAGGAAGCCA AGGCGGACAT CACGACACGC CCCGATAAAA ACGTCCTCGG CGATCTCGGT
CTGACGCTCG CCCCGGCGGC GCAGGTCCCC GGCGCCGGCG ATGAAGGTGT AGTCGTCTCC
GACATCGATC CCGATGGCGT TGCCGCACAA AAGGGTTTGC GTGTCGGTGA TGTCATTCTC
GAAGCCGGTG GGCACGCTGT CAGCCGTCCG GCCGAAATCG GCGCGACCTT GAGCACCGCC
AAGAAAGATG GCCGCAAGGC CGTGCTCATG CGTGTCAAGA ATCGGGAAGG CACCCGCTAC
GTCGCGCTTG CGACCACTCC GGCTTCCTGA
 
Protein sequence
MPQLEFPPQD KGAARNAATP RSQRRPGMRA ILLGATAAVA LTGAFTHSVL LPQAANAETP 
TLNVPVNAPN SSPVVGPVSF ADVVDHVRGA VVSVKVKITE TADNEEANTG NDMPQFAPGD
PLERFFRRFG EQGGVPFNKH SGKPRTGQAQ GSGFIISSDG YVVTNNHVVE NATEVSLTTD
GGQTLTASVV GTDKKTDLAL LKINGSGTYP FVKFSNETPR VGEWVIAVGN PFGLGGTVTA
GIISARGRDI GAGPYDDFLQ VDAPVNRGNS GGPTFNAKGD VVGVNTAIFS PSGGSVGIGF
AIPAEVAQNV ITSLREKGTV ARGWIGVQIQ PVTAEIADSL GLKTSKGALV AEAQPNSPAL
SAGIRSGDVI LGVDGERIDG PRELARKIAA LGPGKSTNLM YWHDGSEKTV AVKLGNLPND
KEAKADITTR PDKNVLGDLG LTLAPAAQVP GAGDEGVVVS DIDPDGVAAQ KGLRVGDVIL
EAGGHAVSRP AEIGATLSTA KKDGRKAVLM RVKNREGTRY VALATTPAS