Gene BBta_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1107 
Symbol 
ID5152322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1179069 
End bp1180040 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content64% 
IMG OID640556096 
Productputative serine protease 
Protein accessionYP_001237263 
Protein GI148252678 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0917977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCTT TGACCGAATG GAAAGTGCCG TTCGCCTTCC AGCCTCGCGC CGAAGATTAC 
CAATATGATC TTGATCACGC CTTGTCCTCG ATCGTGGGCC TGCACGCGAT CATTCCCGCC
GACGCGTTCA GCGCGGAGAC GCTCGGCACC GAACGCGCCG GCAATGCGGT CGTGATCGAC
GAAGGCCTGG TGCTCACGAT CGGTTATCTC ATCACCGAGG CCGAATCCGT CTGGCTGCAC
CTCAACGATG GCCGGGTGGT CGAAGGCCAC GTGCTGGGGT TCGATTTCGC CACCGGCTTC
GGTCTGGTGC AGGCGCTCGG GCAGCTTGAC CTGCCGGCAT TGCCGCTCGG CTCCTCGGAG
TCTGCCAAGA TTGGCGATCA GGTGGTGCTC GGTGGCGCCG GCGGCCGCAC GCGATCCGTC
GCCAGCCAGA TCATCGCCAA GCAGGAATTC GCCGGCTATT GGGAATATCT GCTGGACGAG
GCGATCTTCA CCCATCCGGC GCATCCGAAT TGGGGCGGCA CTGCCCTGCT CTCGGCCAAG
GGCGAGCTGA TCGGCATCGG GTCGCTGCAG CTCGAACGTG AGCGCGACAA CAAATCTGAG
CACGTCAACA TGATCGTGCC GATCGATCTG CTGAAACCGA TTATCGACGA TCTCAAGCGG
TTCGGCCGCG TCAACAAGCC GGCCCGGCCC TGGCTCGGCC TCTACGCCAC CGAGGTCGAC
GATCGCGTCG TGGTGATCGG CGTCTCGAAC AACGGCCCCG CCGCCCGCGC CGAATTGAAG
GCCGGCGACG TCATCCTCGG CATCAATGGC GACAAGGTGA CGAGCCAGAG CGAATTCTAC
CGCAAGCTCT GGGCGCTCGG CGACGCCGGC GTGGACGTGC CACTCACGGT GCATCACGAG
GGCGTCACCT TCGACGTCAC CGTGGCCTCG ACCGATCGCG CCAAGCTGCT CAAGGCGCCG
CGGCTGCACT GA
 
Protein sequence
MPSLTEWKVP FAFQPRAEDY QYDLDHALSS IVGLHAIIPA DAFSAETLGT ERAGNAVVID 
EGLVLTIGYL ITEAESVWLH LNDGRVVEGH VLGFDFATGF GLVQALGQLD LPALPLGSSE
SAKIGDQVVL GGAGGRTRSV ASQIIAKQEF AGYWEYLLDE AIFTHPAHPN WGGTALLSAK
GELIGIGSLQ LERERDNKSE HVNMIVPIDL LKPIIDDLKR FGRVNKPARP WLGLYATEVD
DRVVVIGVSN NGPAARAELK AGDVILGING DKVTSQSEFY RKLWALGDAG VDVPLTVHHE
GVTFDVTVAS TDRAKLLKAP RLH