Gene BBta_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4801 
Symbol 
ID5154272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5028327 
End bp5034830 
Gene Length6504 bp 
Protein Length2167 aa 
Translation table11 
GC content68% 
IMG OID640559600 
Producthypothetical protein 
Protein accessionYP_001240731 
Protein GI148256146 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACG TTCCAGCCAC ATCATCGGCC GCCACGACGT CCGGCGTGCA GCCGCAGTTC 
TGTTCACTGG CCGATGCCCT CGCCCATTAC GGACGGGTCC AACCGGATCG CCTCGCCATT
CTGGCGCCCG ATCGAATCGC ACTGACCTAT GGCGGCCTGT GGCAGCGGAC GACCGAGATC
ATCGCCGAGC TGCGCGGGTT CGGTCTCGGC GCCAGGGATC GCGTGGCCGT GGTGTTGCCG
AACGGCGCCG ATGCCGCCGT GGCGACGGTC GCCGTCGCCT GCGGGGCCGT CTGCGTGCCG
CTCCATGCGG GTTTTTCGTC GGACGAAGTG CGGCGCGCGC TGAGCGACCT CGAGATCACC
GCGCTGCTGA CCTGTCCGGG GATCGAATCG GTCAGCCGCA GCGTGGCCTA TGCCATGGCG
ATCCCGGTCA TCGATCTGTC ATTCCGCGCC GACGCGGCGA TCGGATCCTT CGATCTCACC
TGCCCGGCGC CGCGGCCGGC CGTAACCTGC GACATGCCGC AACCGTCGGA CGACGCCTTC
GTGCTCCTGA CATCCGGCAG CACGGCGCAG CCGAAGCTGG TGCCTTTGAC ACAGGCCGGC
ATTTGCCACT CGGCCTATAG CGCGGGCGTT GCTCTGGCGC TCGCGCCGCA CGACCGGCTG
ATCAACGTCC AGCCGCTGGT GCATGCGCAC GGTCTGATCT CCGGCCTGCT GACGGCGCTT
GCATCGGGAT CGAGCGTGGT CTGTCCGCCG GAATTCGACG CCGCGGCGTT CCTGGATTGG
CTGGCGGCGT TCGAGGCCAG CTGGTACACG GCCGTGCCGC CGATCCACCG CGCGTTGATC
GCAGCGGCGC ACCGGCGCAA GGACGCCGTC AAGACCCGCC TGCGGCTGAT CCGATCCGCA
TCCTCTTCGC TGCCCACCAG CGTCCTCGAC GAGCTGGAAA GCCTGTTCGG CGTTCCCGTC
ATCGAGACCT ATGGCATGAC CGAAGCCGCA AGCCAGATCG CTGCCAATCC CCTCGAACGG
CGCAAGCCCG GCTCGGTCGG CAAGCCCGCC GGCGCCGCCA TCGCGATCAT GGATGACCAG
GGCCGCGTGC TCGCGGCCGG GCAACGCGGC GAGGTCGTGC TCCAGGGGCC CGCGATCACC
CGCGGCTACT ACAAGAACGA GACCGCGACA CGCGCAGCGT TCCGCGACGG CTGGTTCAGG
ACCGGCGATC TCGGTTACCT CGACTCCGAT GGCTATCTCT TCCTGCTCGG CCGCATCAAC
AAGGCCGACA TCATCAATCG CGGTGGACAG AAGGTCTCGC CGAGGGAGGT GGAGAATGCC
TTGATGCGCC ACCCCGACGT GGCCGAAGCG GTGGTGTTTC CGATTCCGCA TACGCGGCTC
GGTGAGGACG TCGCCGCCGC GGTTATTGCG AGGCCGCAGC ACAAGATCGA CATCAAGAAG
CTCCGGCGCT TCGCCAGCGA GCGGCTGGCG CGGTTCAAGG TCCCCGGCCT GATCCGCGTC
GTCACGGCTT TCCCGAAAGA CGCCGACGGC AGGGTTGTTC GCGGTGAGCT GGCCGGCCAG
CTGTCGATCG CGGCGCCACG CTCGCACATC CATCGCGGCG GACAGCTGGT GCCGGCGCGC
TCCGAAACCG AGTGGCAGCT CGCCAGCATG TGGGCCGACC TGCTCGGGCT CAATGAAATC
GGTGTGAATG AGGACGTCTT TGCGCTCGGC GCGGATTCGC TCACGATCAC CCAGCTGATC
GCCCGCCTCC GGGCGCGCTA TGACGCCGAG ATCTCGTTCA AGGACATTTT CGACGCGCCG
ACGGTGGCCG CGCTCGCTGA ACGTGTCGAG ATGCTGCGAG ACCATGCGGG TGCGCAGCGC
CTGCAGACGG CCAGCGCTCC GGAGCAGAGC GGCCCCCTGT CGCTCCAGCA GCAGCGCATC
CATCTGCTCG CCGCGATCGA TCCCGATCCA TCCCGATACC ACGTGATCTC GGGACTCATG
CTGACCGGCC CGCTCGACAT CGGTGCGCTC GACGCCAGCA TCGCGTCGGT CTGCGACCGC
CACGAGACGC TGCGTTCGAT CTTTCCCGAC CAGCAGGGAG AGGTACGGCA GACGGTAACC
CCCCGGCACC CTTCGATCGA GCGGTGCGAT CTTCGCGCCG TCCCTCAATC CGGGCAGATG
GCGGCCGTCC GGACTCACAT GCTCGACCTG CTGCGGTCCC CGTTCGAGAT CGAGACCACC
CCGCCGGTGC ACATCCAGCT TCTGCAGCTC GGTGATCAAC AGCATGTCCT CCTGGTCAAG
CTGCATCATC TGATCACCGA TGGTTGGTCG CACCGGCTGT TCTTCGACGA GCTCGAACGA
CTCTACAATG GCTATTGCCA CGGGAAGCCG TGCCGCCTCG ACGAGCTGCC GCAGCAATAT
CGGCACTTCG TGCAGTGGCA GCGCGCGTGG CTCGCCACGC CGGCGGCCGA GGCGCAATTG
AGCCATTGGC GCCGCCGGCT GGAGGGCCTG ACGGAACTGC CGCTGCGAAC CGACCGGCCG
CGGCCCGAGC AGTGGACCGG CCGCGGCGCC CGCATCCCGG TTCGTTTGTC GGCAAAGCTG
TCGGAGCGGC TGCGCAGCTT CAGCCGCGCC AACCATGCCA GCCTGTTCAT GACACTGCTG
GGCACGTTTC AATGCCTGCT CTGCCGCTAC ACCGACCATC ATGACATCGC CGTGGGCTCG
CTGATCGCCA ATCGCAGTCA GCTGGAGATC GAACGGCTGA TCGGCATGTT CGCCAACGCG
ATCGTGCTGC GGACCGATCT GACCGGCGAT CCGACCTTCC GCGAAGTGCT GAAGCGGGTC
CGCGACGTGA CGCTGGAGGC CTATCGCCAC CAGGAGCTGC CGATCGAGGA GCTCCTGCGC
ACGCTGCGTC TGCCGCGCAG GCTGGACCGC AACCCGCTGT TCCGGGTGAT GTTCATCCTG
CAGAAGGCGG CGAAGCCGCT CGCGCTCGAG AACCTGTCGG CGCGCGCCAT CGATCCCGAC
CCGGGCATCG CGCGATCCGA CCTGGTGTTG GAGCTGATCG ACGATGGCGG AGCGCTGGGC
GGCTGGCTCG AATATAGCAG CGAGTTGTTC GAAGCGACGA CGATCGAGCG GATGGTCGGC
CATTTCCGCA CCCTGTTGGA GCAGGTCATC GCGGCTCCCG ATCTGCCATT CTCGGAGCTG
CCCTTGCTGT CACCGGCGGA GCAACGGCAC CTCGTCGAGG GCTGGACCCC CACCTCGGCG
ACGCCGGCAA ACTCAGACGA TGTCCTCACG CGCTTCGCGC GCCAGGTCGA GCGCGCACCT
GCAGCTCCGG CCGTCTCCTG CGGCGAGACG AAGCTCAGCT ATGCCGGCCT GGCGCAGCGC
GCCGAGGCCA TTGCAGGCGG GCTGCAACGC ACGCCAATCA GCGACGGCGA CATCGTCGTG
CTGTTCGCCG AGCGCAGCGT CGACTATGTC GCGGCGCTGA TCGCCGTGCA GCAGACCGGG
GCGGCGTTCC TGCCGCTCGA TCCCAGCTTG CCGGCACTCC GGCTGACGAA AATTCTCCGG
CACAGCGCCG CGCGGATCGT CCTCGCCACG CAGCGGAGCG CCGCGGCGCT ACGAGCCGCA
CTCGCTGACC TGCCGCGCAC GGCGCAGCCC GACGTGCTGC TGCTGGATGA CATCGCGCCC
CCGAAGACGA CGCGGGCGGT CCCGGCTTCA CCGCGATCGC CCGCCTCGCT CGCCTGCGTC
ATCTACACGT CAGGCTCCAC CGGCGAACCA AAGGGGGCGA TGATCGCGCA GCGCGGCATG
GTCAATCATC TCCTGTCCAA GATCGCCGAT CTTGGCCTGT CATCCGGCGA CGTGGTCGCG
CAGACCTCGC CGCAGAGCTT CGTCATCGCA ATCTGGCAAT GTCTCGCGCC GCTGATGGTC
GGCGCCCAGG TCCATATCAT CGGTGATCAC GATGTTCAGG ACCAGGCGCG GCTGGTGCAC
GAGATGGCGC GCGAGGGCAC GACGGTGCTG GAGATCGTGC CGAGCCAATT GCGCGCTTTC
CTGCAGCCGG CTCCGGATGC CGCCACGACG CGCGCACTCG GCCAGCTGCG CGCACTGATC
GCGACCGGCG AGAGCCTGGC GCCGGATCTG TGCGAGGACT GGTTCAGGCA CTTCCCGCAG
GTGCCGCTCA TCAATGCCTA TGGCGCCACC GAATGCTCCG ACGACGTGGC GACCCATCGC
ATGGTCGCGC CACCGTCTGC ATCAAGCACG GTGCCGATCG GCCGTCCCAT CGCCAATGTA
CGGCTGCACG TCCTCGATCG GCACCTGCAG CCTGTGCCGA TCGGGATTGC CGGAGAGCTC
TATGTCGGCG GCGTCGCCGT CGGGCTCGGC TATCTCAACG ATCCCGGCCA GACACGCAGC
CGCTTCCTGC CTGACCCGTA CTCGCCGGAC AGCAGCGCGC GCCTGTATCG CACCGGCGAC
CTGGCACGTT GGCGCGCAGA CGGCACGCTG GAATGCTTCG GCCGGGTCGA TCAGCAGGTG
AAGGTCCGTG GCTGCCGGGT TGAGCTCGAA GAGATCGAGC ACGCTTTGGC GCAGCATCCG
GCCGTGCGAG CCGCGGCCAT CCTGGCGCGC GACACGCGCT ATGGCGACAC GCAGCTGACC
GCCTATATCG TCGCCGCCGA CGGCCAGCCG CCAGCGGTCG ATGACCTCAA CGGCTTCGCC
CGGAGCAGGC TTCCGGCACA TATGATTCCA GCCGGCTATG TCATGCTGGA TCAGCTGCCG
GTGACCGCGC ATGGCAAGCT CGACCGCACC GCCCTGGCGG CGCTGGGATC GCTCCCGGCA
ACGACAGGCG GTGACCACGT TGCGCCGCGG ACCCCGACCG AGCAACTGCT CGCCGGCATC
TGGGTCGACG TGCTCGGATG CGGGGAATGC GGGGTGACCA GCAACTTCTT CGATCTTGGG
GGACATTCGC TGCTCGCAGG CCGCGTCCTG GCGCGCGTGG CGAGCACACT CGGCGTCTCC
CTGCCGATCC GAACCTTGTT CGAGGCCAGC ACGATCGAAG CGCTGGCGCT ACGCGTCGAC
CAGGCCCGCG CCACGGCCGC ACCCCAAACG CTGCAGGTGC AATGGCGATC CGATCAGCGT
ACCGTCATCT CGATCCAGCA GGATCAGATC GTCCGTACCG AGCGTGACCT GCCGGGCCTG
CCGCAATTCA ACCTGCCTTA TGCCTACCGG CTGCGCGGCC CGCTCGACGC CCGGGCGCTC
GCAGCCAGCC TCGTGGACGT CATCCGCAGG CACCCGTCCC TGCGCACCAG TTTCCACTCG
GTTGACGCCG CCCCGGAGCC GCAAGTGCTG GACAGCGCGG CGATCACGTC GGTGCTCGAG
ACCGAGACCA TCGCGATCCC CGCGCCTCCG GGAGACAAGC GAGCCCGGAC GCTCATGCTG
AAAAAGGCGG AGCTGCGCAT CGCGCAGGAG GCCTGGCGTC CGTTCGACCT CGCGCGCGCC
CCGCTGCTGC GGGCGCGGCT GCTGCGGTTC GGTCCGGACG ACCATGTCCT GGTCCTGATC
ATGCATCATG TGATCGTGGA CGGTTGGTCG ATGGGTGTCC TGTTCGAGGA GATCGCCGCG
CTCTATGCAG CTCACATCGG CGGTCGCGAG GCATCGCTGC CGACCCCGGT TGTCGCATTC
TCCGAATTTG CTGCCTGGCA GGCGAAATGG TGCATCAGCG CGGCCGCTGC CCGGCAACTT
GCTGATTGGA CGCATCGCCT GCGCGGCGCC TCGCCCCTCT TCAACGCGCC CGCCGGCCAG
GCCGGAGAGC AGCCGAGCCC GCGTGTCGCC ACTGCAGCGT TTCATCTGCC GCGGCGGTTG
ATCGACCGGC TGACCAGGCT GAGCCATGGC GACGGCGCAA CCATATTCAT GGCGCTGATG
GCGGGCTTCA AGGCGATGCT GATGGCCCGC ACGGGCCGCG GCGACATCTG CGTCGGCACC
ACCATGGCCA ACCGCTTCGA GCGCTGGACG GAGCGCATCG TGGGCCCCGT CGAGAACACG
ACGATCGTGC GGACCCAGCT CGAACCCGAG CTGTCGTTCC GCGAGATGCT TCGGCGCGTC
CGCGACAGCC TGCTGGACGC GCATGCCCGG CAGGAGCTGC CGTTCGAAAC CCTGGTCGCG
GAGTTTTCCG GCGCGGCCGA GCTCGACCTG GCGGCGCTCA CGCAAGTGTA TTTCATCTTC
CAGAACGCGA TTCCGAAGCC GTTCGAGCTG TCCGGACTCG ACGTGCAGCG TTTCGACAGC
GCAACGGCCG AGGGCCGGCC CGTGCTGCCC GTCGACAGCG CCTACCTGAC CATCATGGTC
GAGGAGTTGC CGACCGGGCT CACCGGCGCC TGCATCTACA AGCCGGACCA GCTCAGCGAA
AAAAACGTGA CGTCCTGGCT CGATGACTAT TTGGCGATCC TGGCAGACGG CGCCGATCGG
CCGGACACTC CGCTTCACCT CCTGCCAGCC TCGCGCTTGG TTGGCCAAGC CGACCATCAC
ACCAGGCAAA GCCAGGCCGA ATAG
 
Protein sequence
MSNVPATSSA ATTSGVQPQF CSLADALAHY GRVQPDRLAI LAPDRIALTY GGLWQRTTEI 
IAELRGFGLG ARDRVAVVLP NGADAAVATV AVACGAVCVP LHAGFSSDEV RRALSDLEIT
ALLTCPGIES VSRSVAYAMA IPVIDLSFRA DAAIGSFDLT CPAPRPAVTC DMPQPSDDAF
VLLTSGSTAQ PKLVPLTQAG ICHSAYSAGV ALALAPHDRL INVQPLVHAH GLISGLLTAL
ASGSSVVCPP EFDAAAFLDW LAAFEASWYT AVPPIHRALI AAAHRRKDAV KTRLRLIRSA
SSSLPTSVLD ELESLFGVPV IETYGMTEAA SQIAANPLER RKPGSVGKPA GAAIAIMDDQ
GRVLAAGQRG EVVLQGPAIT RGYYKNETAT RAAFRDGWFR TGDLGYLDSD GYLFLLGRIN
KADIINRGGQ KVSPREVENA LMRHPDVAEA VVFPIPHTRL GEDVAAAVIA RPQHKIDIKK
LRRFASERLA RFKVPGLIRV VTAFPKDADG RVVRGELAGQ LSIAAPRSHI HRGGQLVPAR
SETEWQLASM WADLLGLNEI GVNEDVFALG ADSLTITQLI ARLRARYDAE ISFKDIFDAP
TVAALAERVE MLRDHAGAQR LQTASAPEQS GPLSLQQQRI HLLAAIDPDP SRYHVISGLM
LTGPLDIGAL DASIASVCDR HETLRSIFPD QQGEVRQTVT PRHPSIERCD LRAVPQSGQM
AAVRTHMLDL LRSPFEIETT PPVHIQLLQL GDQQHVLLVK LHHLITDGWS HRLFFDELER
LYNGYCHGKP CRLDELPQQY RHFVQWQRAW LATPAAEAQL SHWRRRLEGL TELPLRTDRP
RPEQWTGRGA RIPVRLSAKL SERLRSFSRA NHASLFMTLL GTFQCLLCRY TDHHDIAVGS
LIANRSQLEI ERLIGMFANA IVLRTDLTGD PTFREVLKRV RDVTLEAYRH QELPIEELLR
TLRLPRRLDR NPLFRVMFIL QKAAKPLALE NLSARAIDPD PGIARSDLVL ELIDDGGALG
GWLEYSSELF EATTIERMVG HFRTLLEQVI AAPDLPFSEL PLLSPAEQRH LVEGWTPTSA
TPANSDDVLT RFARQVERAP AAPAVSCGET KLSYAGLAQR AEAIAGGLQR TPISDGDIVV
LFAERSVDYV AALIAVQQTG AAFLPLDPSL PALRLTKILR HSAARIVLAT QRSAAALRAA
LADLPRTAQP DVLLLDDIAP PKTTRAVPAS PRSPASLACV IYTSGSTGEP KGAMIAQRGM
VNHLLSKIAD LGLSSGDVVA QTSPQSFVIA IWQCLAPLMV GAQVHIIGDH DVQDQARLVH
EMAREGTTVL EIVPSQLRAF LQPAPDAATT RALGQLRALI ATGESLAPDL CEDWFRHFPQ
VPLINAYGAT ECSDDVATHR MVAPPSASST VPIGRPIANV RLHVLDRHLQ PVPIGIAGEL
YVGGVAVGLG YLNDPGQTRS RFLPDPYSPD SSARLYRTGD LARWRADGTL ECFGRVDQQV
KVRGCRVELE EIEHALAQHP AVRAAAILAR DTRYGDTQLT AYIVAADGQP PAVDDLNGFA
RSRLPAHMIP AGYVMLDQLP VTAHGKLDRT ALAALGSLPA TTGGDHVAPR TPTEQLLAGI
WVDVLGCGEC GVTSNFFDLG GHSLLAGRVL ARVASTLGVS LPIRTLFEAS TIEALALRVD
QARATAAPQT LQVQWRSDQR TVISIQQDQI VRTERDLPGL PQFNLPYAYR LRGPLDARAL
AASLVDVIRR HPSLRTSFHS VDAAPEPQVL DSAAITSVLE TETIAIPAPP GDKRARTLML
KKAELRIAQE AWRPFDLARA PLLRARLLRF GPDDHVLVLI MHHVIVDGWS MGVLFEEIAA
LYAAHIGGRE ASLPTPVVAF SEFAAWQAKW CISAAAARQL ADWTHRLRGA SPLFNAPAGQ
AGEQPSPRVA TAAFHLPRRL IDRLTRLSHG DGATIFMALM AGFKAMLMAR TGRGDICVGT
TMANRFERWT ERIVGPVENT TIVRTQLEPE LSFREMLRRV RDSLLDAHAR QELPFETLVA
EFSGAAELDL AALTQVYFIF QNAIPKPFEL SGLDVQRFDS ATAEGRPVLP VDSAYLTIMV
EELPTGLTGA CIYKPDQLSE KNVTSWLDDY LAILADGADR PDTPLHLLPA SRLVGQADHH
TRQSQAE