Gene BBta_7794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_7794 
Symbollon 
ID5149485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp8180940 
End bp8183300 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content61% 
IMG OID640562420 
ProductDNA-binding ATP-dependent protease La 
Protein accessionYP_001243527 
Protein GI148258942 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0810999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC AAGTCTCGCC ATCAAGCCAG AGCAACACCA AGCTCCCGCC GGGCGCACTG 
ATCTTGCTGC CAGTGCGCAA TATGGTGTTG TTTCCCGGCG TCGTGATGCC GCTGACGGTG
GGGCGGCCAC GCTCCCAGGC TGCCGCCCAG GAAGCATTGC GCGGTGAGCG GCCGATCGGC
ATCGTGTTGC AAACCGATCC GACAGTCGAC GAACCGGGCG ATGAGCAGCT TCATCGTATC
GGCACAGTCG CGGAAATTCT GCGCTACGTG ACGGCGCCGG ACGGCACCCA TCATCTGATC
GTCCGCGGCA CTCGCCGGTT TCGCATTCAA AGCTTCTTGC CAGGATATCC CTTTCTGACC
GCGCAGGTGG AGGAAATCGG GGAATCCGAA GTTCTGACGC CGGAAATCGA GGGCCGGGTG
CAATTGCTGC GACAGCGGGC GCACGAGGCC ATGCAGCTCT TGCCGAATGT CCCGGCCGAG
CTTGTCGCAG GACTTGACGG GGTTCAGTCG GCCTCGGCGC TGGCCGACTT CGTCGCCAAC
CTGATGGACA TCAAGCCGTC TGACAAGCAG GACATTCTCG AAACATTCGA CGTCAAAACG
CGGCTTGAAA AAACGATTCG GTTCCTGACC GAGCGTATTC AGGTGCTCCG TATCAGCAAA
GAGATTGGCG AGCAGACGCA GGAGACACTC AGTACGCAGC AACGCGAGCA CATCCTGCGC
GAGCAGATGC GGCAGATCCA GCGCCAGCTC GGCGAGTCCG ACGATCGTTC CGCCGAGCTC
GAAGAACTGA AGCGCGCTAT CGAATCGGCA GGCATGCCGA AGGAGGTCGA GGATCAGGCC
AAGAAGGAGC TGCGGCGTCT CGAACGGATG CCAGATGCCG CCGGCGAGTA TTCCATGATC
CGCACCTATC TCGACTGGCT CATCGAATTG CCCTGGTCAA AACTGGCCGA CGAGAGGATC
GACATTCAAG AAGCGCGCCG CATCCTCGAC GAAGACCACT ACGATCTCGA CAAGGTCAAG
AAGCGCATTC TCGAATATCT TGCGGTGCGC AAGCTTAATC CGGATGGCAA AGCGCCGATT
CTCTGTTTCG TCGGCCCGCC GGGCGTCGGC AAGACCTCGC TAGGTCAGAG CATCGCCCGG
GCGACCGGAC GTCCCTTCGC GCGCCTGAGC CTCGGCGGCG TTCACGACGA GGCGGAGATT
CGCGGGCATC GCCGAACCTA TATCGGTGCG CTTCCTGGGA ACATCATTCA GGCTATTCGC
AAAGCGGGCG CGCGCAATCC GGTCCTGATG CTCGATGAGA TGGACAAGCT TGGCGCCGGG
TTTCACGGCG ACCCTTCGTC GGCCCTGCTC GAAGTGCTCG ATCCCGAGCA GAACCGCACT
TTCCGCGATA ACTATCTGGG CCTACCCTTC GACCTATCGA AAGTGCTGTT CATCGGAACC
GCCAATATTC CGGACAGCAT TCCCGGCCCC CTGCGCGATC GCATGGAGAT GATCTCGGTC
CCCGGTTATA CGGAAGATGA AAAACTGCAG ATCGCCAAGC GCTACCTCAT CAAGCGACAG
CTCGATGCTG CCGGCTTGAA GGCCGAGCAA TGCGACATTT CCGACGAGGC GCTCCGGGGG
ATCATCCGGT ATTACACGCG CGAGGCCGGC TGCCGCAACC TGGAGAGAGA AATCGGCGCG
CTCTGCCGCC ACGCCGCGAT GCGTATTGCC GAGGGCAAAG ACACCGCGGT CAAGATCGGC
GAGGCGGACC TGCCAACGAT TCTCGGGCCC CATCGCTTCG AGGACGAGGT CGCGATGCGC
ACCAGCGTCC CGGGCGTCGC CACCGGACTT GCCTGGACGC CGACCGGCGG CGACATCCTG
TTCATTGAGG CAGCGCGCGT GCCCGGAAGC GGCAAGCTCA TTCTGACGGG GCAACTTGGT
GACGTGATGA AGGAGAGCGC GCAAGCCGCG CTCAGTCTTG TCAAATCGCG CGCCGAGAGC
CTCGGCATCG ATCCTGCCCA GTTCGAAAAG TCGGATATTC ACGTTCATGT GCCGGCAGGC
GCGATTCCGA AGGACGGCCC GAGCGCGGGC GTTGCCATGT TCATCGCCTT GACCTCGCTT
CTGACCGCCC GTACGGCACG AGGCGACACC GCGATGACAG GCGAGATTTC GCTCCGTGGA
CTGGTTCTGC CGATCGGCGG CGTCAAGGAG AAGGTCCTCG CTGCGGTGCG GGGCGGCATT
GAAACGGTCA TGCTTCCCGA ACGAAACAGA AAGGATCTCG AAGATATCCC TCCTGATGCG
CGACAACGGA TCAAATTCGT TTGGATGCGG ACCGTCGATG ACGCAATCGC GGCGGCGCTT
GAACCGCCTA AGGGGAAATA G
 
Protein sequence
MNEQVSPSSQ SNTKLPPGAL ILLPVRNMVL FPGVVMPLTV GRPRSQAAAQ EALRGERPIG 
IVLQTDPTVD EPGDEQLHRI GTVAEILRYV TAPDGTHHLI VRGTRRFRIQ SFLPGYPFLT
AQVEEIGESE VLTPEIEGRV QLLRQRAHEA MQLLPNVPAE LVAGLDGVQS ASALADFVAN
LMDIKPSDKQ DILETFDVKT RLEKTIRFLT ERIQVLRISK EIGEQTQETL STQQREHILR
EQMRQIQRQL GESDDRSAEL EELKRAIESA GMPKEVEDQA KKELRRLERM PDAAGEYSMI
RTYLDWLIEL PWSKLADERI DIQEARRILD EDHYDLDKVK KRILEYLAVR KLNPDGKAPI
LCFVGPPGVG KTSLGQSIAR ATGRPFARLS LGGVHDEAEI RGHRRTYIGA LPGNIIQAIR
KAGARNPVLM LDEMDKLGAG FHGDPSSALL EVLDPEQNRT FRDNYLGLPF DLSKVLFIGT
ANIPDSIPGP LRDRMEMISV PGYTEDEKLQ IAKRYLIKRQ LDAAGLKAEQ CDISDEALRG
IIRYYTREAG CRNLEREIGA LCRHAAMRIA EGKDTAVKIG EADLPTILGP HRFEDEVAMR
TSVPGVATGL AWTPTGGDIL FIEAARVPGS GKLILTGQLG DVMKESAQAA LSLVKSRAES
LGIDPAQFEK SDIHVHVPAG AIPKDGPSAG VAMFIALTSL LTARTARGDT AMTGEISLRG
LVLPIGGVKE KVLAAVRGGI ETVMLPERNR KDLEDIPPDA RQRIKFVWMR TVDDAIAAAL
EPPKGK