Gene Bind_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2041 
Symbol 
ID6198883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2332174 
End bp2333616 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content60% 
IMG OID641706028 
Productprotease Do 
Protein accessionYP_001833152 
Protein GI182679006 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTT GCGGCCGTTC CCCTTTTTCT CCCCTTCGCC TCCTCGTTGC CCTCGGTCTG 
GCCACCGGTC TGCTCTGTGA CTTTGGTCCT TCCGCCTCGG CCGAGACGCG AGAAATCCCG
GTTTCTCATG ATGACGTTCT GCTCTCCTTC GCCGGACCGG TGAAAAAGGC GCAGCCTGCC
GTTGTCAACG TCTATGCCTC GCGAACAGAG CGGCAGCCGC GCAACGTTCT CCTCGACGAT
CCGGTTTTCC GGCGTTTCTT TGGCGACGGC AATGGCCGCC GGCCTGGCGG TCCCACGGCG
CAATCCCTCG GATCCGGCGT GCTCGTTGAT CCGTCAGGCC TTGTCGTCAC CAATTTTCAC
GTCATCGAAG GCATGACCGA CGTCAAGGTT GCCTTGACCG ACAAGCGGGA ATTCGAGGCA
ACGATCGTTT TACGGGATCA GCGCACTGAT CTGGCCGTCT TGCGTCTGAA GGGGGGCGAT
GGGGCCTTCC CATCCATGGA AGTGGGTGAT TCGGACACGC TCCAGGTCGG CGATCTCGTG
CTCGCCATCG GCAATCCCTT CGGCGTCGGC CAGACCGTTA CCCAAGGCAT TGTGTCGGCG
CTTGCGCGGA CACAGGTGGG CATTTCCGAT TATGGTTTCT TCATTCAGAC GGATGCGGCG
ATCAATCCCG GTAATTCCGG CGGCGCCTTG ATCGACATGA AGGCCCGCCT CGTCGGCATC
AATTCCGCCA TCTTCTCCCA GACCGGCAGT TCGATTGGGA TCGGTTTCGC CATTCCGGTC
AATATGGTGA AAGTGGTCGT CGCCGCCGCC AAAAGTGGCG GCCATCAGGT GCATCGGCCC
TGGCTCGGCG CAAGCCTCCA GGGTGTTTCC CGCGAAATCG CTGATTCCCT TGGCCTTGAT
CGCCCTTCGG GTGCCCTCAT CGTCGAGGTG GCGAGCCAGA GCCCTGCTGC GGAAGCGGGC
CTCAAACGGG GTGATCTCAT CACACGCATC GACGGGCAGA CGCTGGAGGA CCCGGAATCT
TTCGGCTATC GTCTCGCGAC GCGCCCACTT GGCGGCAAGG CCCAATTGAC GGTCTTACGC
AACGGCAAGC CGATCGACGC GACATTGAAT CTTTCCGCCG CTCCGGAACA ACCCCCGCGC
GATCCGGTCA AACTCAACGG CCATTCGCCC CTCACCGGCT TGAGTGTCGT CAATCTTTCT
CCCGCCGTCA CTGAGGAATT CTCAATTCAG GGCGCATTCG AAGGCGTGGT GATCAATGAC
ATTGACGAAA ACTCTCCCGC GGCCAATGTC AATTTTCAAC GTGGCGATGT GATCATCGCC
GTCAACGGTG CCAAGATCAC CTCGACTCAT CAGCTTGAAA AAGCGATGAG CGAACAGCAT
TATTATTGGA AGGTAACGGT CGGTCGTGGC CATGATATTT TGACGACGGT GCTCGGCGGC
TGA
 
Protein sequence
MTSCGRSPFS PLRLLVALGL ATGLLCDFGP SASAETREIP VSHDDVLLSF AGPVKKAQPA 
VVNVYASRTE RQPRNVLLDD PVFRRFFGDG NGRRPGGPTA QSLGSGVLVD PSGLVVTNFH
VIEGMTDVKV ALTDKREFEA TIVLRDQRTD LAVLRLKGGD GAFPSMEVGD SDTLQVGDLV
LAIGNPFGVG QTVTQGIVSA LARTQVGISD YGFFIQTDAA INPGNSGGAL IDMKARLVGI
NSAIFSQTGS SIGIGFAIPV NMVKVVVAAA KSGGHQVHRP WLGASLQGVS REIADSLGLD
RPSGALIVEV ASQSPAAEAG LKRGDLITRI DGQTLEDPES FGYRLATRPL GGKAQLTVLR
NGKPIDATLN LSAAPEQPPR DPVKLNGHSP LTGLSVVNLS PAVTEEFSIQ GAFEGVVIND
IDENSPAANV NFQRGDVIIA VNGAKITSTH QLEKAMSEQH YYWKVTVGRG HDILTTVLGG