Gene BBta_6094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_6094 
Symbol 
ID5153276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp6322104 
End bp6323600 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content67% 
IMG OID640560808 
Productputative Serine protease do-like precursor 
Protein accessionYP_001241927 
Protein GI148257342 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA CCCTTGCTCT CAGCCGCTTG CGCCCGATCG TGACCGCGGC GGTTTTCGCC 
GTCGGCTGGG CGCTCGCGCC GGCGCCGGCT TCGGCACGGG GGCCCGAGGG GATTGCCGAC
GTCGCCGAAA AGGTGATCGA CGCGGTGGTC AACATCTCGA CCTCGCAGAC CGTCGAGGCC
AAGGGCGGCG GAGAGGGGCG CGGCGCCAAT CCGCAGGTGC CGCCGGGCTC GCCGTTCGAG
GAGTTCTTCG ACGACTTCTT CAAGAACCGC CGCGGCGGCC CAGGCGGCCG GGGCGGGGCC
GACAACTCGC CACGCCGCAC CAATTCGCTG GGCTCGGGCT TCATCGTCGA CGATTCCGGC
GTGGTCGTCA CCAACAACCA CGTCATCGCC GATGCTGACG AGATCAACGT GATCATGAAC
GACGGCACCA AGATCAAGGC CGAGCTGGTC GGCGTCGACA AGAAGACGGA CCTCGCGGTG
CTCAAGTTCA AGCCGCCGCG GCCGCTGACC GTGGTGAAGT TCGGCGATTC CGACAAGCTT
CGGCTGGGCG ACTGGGTGGT CGCGATCGGC AACCCGTTCA GCCTCGGCGG CACCGTCACC
GCGGGCATCG TCTCGGCCAA GAACCGCGAC ATCTCGTCGG GGCCCTATGA CAGCTACATC
CAGACCGACG CCGCGATCAA TCGCGGCAAT TCCGGCGGTC CGCTGTTCAA CCTCGACGGC
GAGGTCATCG GCGTCAACAC CCTGATCATC TCGCCGTCGG GCGGCTCGAT CGGCATCGGC
TTCGCGGTGC CGTCGAAGAC GGTGGCCGGC GTCGTCGACC AGCTGCGCCA GTTCGGCGAA
CTGCGCCGCG GCTGGCTTGG TGTGCGCATC CAGGGCGTCA CCGACGAGAT CGCGGAGAGC
CTCAACATCA AGCCGGCGCG CGGCGCGCTG GTCGCAGGCG TCGACGACAA GGGACCGGCC
AAGCCGGCGG GCATCGAGCC GGGCGACGTG GTCGTCAAGT TCGACGGTCA TGACATCAAG
GAGCCGAAGG ACCTGTCGCG GATCGTCGCC GACACCGCGG TCGGCAAGGA AGTCGATGTC
ATCGTCATCC GCAAGGGCCA GGAGCAGACG CTCAAGGTCA AGCTCGGCCG GCTCGAGGAC
AACGAGAAAG TGCAGCAGGC GGCGATCAAG AAGGACGAGC CGGCCGAGAA GCCCCAGACG
CAGAAGGCGC TCGGGCTCGA TCTCGCCGCG CTGTCGAAGG ATCTGCGCAC GCGCTACAAG
ATCAAGGACA GCGTCAAGGG CGTCATCGTC ACCGGCGTCG ACCAGGGCTC CGATGCCGCC
GAGAAGCGCC TGTCGGCCGG CGACGTCATC GTCGAGGTTG CGCAGGAGGC CGTGACCTCC
GGCGCCGATA TTAAGAAGCG GGTCGACCAG CTCAAGAAGG ACGGCAAGAA GTCGGTGCTG
CTTTTGGTCT CCAACGCCGA CGGCGAACTC CGCTTCGTGG CGCTCAGCCT GCAGTAA
 
Protein sequence
MTATLALSRL RPIVTAAVFA VGWALAPAPA SARGPEGIAD VAEKVIDAVV NISTSQTVEA 
KGGGEGRGAN PQVPPGSPFE EFFDDFFKNR RGGPGGRGGA DNSPRRTNSL GSGFIVDDSG
VVVTNNHVIA DADEINVIMN DGTKIKAELV GVDKKTDLAV LKFKPPRPLT VVKFGDSDKL
RLGDWVVAIG NPFSLGGTVT AGIVSAKNRD ISSGPYDSYI QTDAAINRGN SGGPLFNLDG
EVIGVNTLII SPSGGSIGIG FAVPSKTVAG VVDQLRQFGE LRRGWLGVRI QGVTDEIAES
LNIKPARGAL VAGVDDKGPA KPAGIEPGDV VVKFDGHDIK EPKDLSRIVA DTAVGKEVDV
IVIRKGQEQT LKVKLGRLED NEKVQQAAIK KDEPAEKPQT QKALGLDLAA LSKDLRTRYK
IKDSVKGVIV TGVDQGSDAA EKRLSAGDVI VEVAQEAVTS GADIKKRVDQ LKKDGKKSVL
LLVSNADGEL RFVALSLQ