Gene BBta_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3032 
SymboldegP 
ID5156163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3168096 
End bp3169193 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content68% 
IMG OID640557904 
ProductSerine protease do-like precursor 
Protein accessionYP_001239058 
Protein GI148254473 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.722688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTC TGCGTTGTGT CCGGCCTTTG GTTCTGCTGC CGGCGATCGT GCTCTCCCTC 
CTGGCTGCCC CTGCCCTCGC CCAGATCCCG GACCTCAAGC TCGGCCGCGT GCCGACCTTG
GCGCCACTGG TCAAGGAGGT CACCCCGGCC GTCGTCAACA TCTCGGTCGA AGGCAAGGTC
CGGCAGGACA ATCCGCTGTA CCAGGACCCG CTGTTCCGCG AGTTCTTCGA CGTTCCGAAA
CAGGTCGAGA AGCAGATCAG CGCCACCGGC TCCGGCGTCA TCGTCGATGC GCAGCGCGGC
TACGTGATGA CCGCCAATCA CGTCGTCGAG CATGTCAGCA CCGCACAAAT CCGGACCAAG
GACGGCCGCA AATTCTCCGC CCGCCTGGTC GGGCGCGATC CCGCCACCGA CATCGCGGTG
CTGCAGATCA AGGATCCGAC CGAGCTCAAG GCGATCGCGC TTGGCGACAG CGATGCGCTC
GAGGTCGGCG ACTTCGTGAT CGCGGTCGGC AACCCGTTCG GCCTCGGACA GACCGTCACC
TCCGGCCTCG TCAGCGCGCT CGGGCGAACC GGGCTCGGCA AGCAGGGCTA TGAGGATTTC
ATCCAGACCG ACGCCGCGAT CAATCCCGGC AATTCAGGCG GGGCGCTGAT CAACCTCCGC
GGCGAACTGG TCGGCATCAA CACCGCGATC ATCTCGCCGG GCGGCGGCAA TGTCGGCATC
GGCTTTGCCG TGCCGATCAA CATGGCGCGA CGGGTGATGG AGCAACTGGT CGCCAACGGC
CGCGTCGACC GCGGACGCAT CGGGGTCACC CTGCTCGATC TGGATTCGCC GGCCGATGGC
CGCGTCCAGG GCGCCCGCGT CGCCGATGTG ACCGTCGGCT CCCCGGCCGA GCGGGCCGGA
CTGCGCAAGG GCGACATCAT CGTGAAGGCG AACGACATGC CGGTGCGCAG CGCGACGCAG
GTTCGCAATC TCATCGGGCT GACGCCGGTC GGCCAACGCG TCCGCCTCGT GTTCGAGCGC
GACCGCGCGC TCGGCAATGC GACGGTCGAG GTCGCGCCGG TTACCGAAGA ACGCGCCCGC
GCGCGAAGCT CGGGCTGA
 
Protein sequence
MQILRCVRPL VLLPAIVLSL LAAPALAQIP DLKLGRVPTL APLVKEVTPA VVNISVEGKV 
RQDNPLYQDP LFREFFDVPK QVEKQISATG SGVIVDAQRG YVMTANHVVE HVSTAQIRTK
DGRKFSARLV GRDPATDIAV LQIKDPTELK AIALGDSDAL EVGDFVIAVG NPFGLGQTVT
SGLVSALGRT GLGKQGYEDF IQTDAAINPG NSGGALINLR GELVGINTAI ISPGGGNVGI
GFAVPINMAR RVMEQLVANG RVDRGRIGVT LLDLDSPADG RVQGARVADV TVGSPAERAG
LRKGDIIVKA NDMPVRSATQ VRNLIGLTPV GQRVRLVFER DRALGNATVE VAPVTEERAR
ARSSG