Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_1158 |
Symbol | |
ID | 5153853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 1229626 |
End bp | 1232541 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640556146 |
Product | hypothetical protein |
Protein accession | YP_001237313 |
Protein GI | 148252728 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0746072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCAAG CGGTTCCATT TCAGACACGC GCACGCACGA TCGACCACCT TGGACGGGGC CAAATCGCCG ACGCTCCGAC CGCAGTCAGC GAACTTTGGA AGAATGCCTA CGATGCCTAC GCGAAAAACG TAGCTCTCCA TATTTTTGGT GGAAGCCCGG AAGTAGCAGC TGTTTTCGAT GACGGAATCG GAATGAGCCG CGAGGATGTA ATACAACGCT GGCTCGTAAT CGGAACAGAG AGCAAGTTTG ACAATGCGGA TACGACGCCA CCCGAGACGT TCGGTCTGCC GCCTCGCCCG CGACAAGGCG AGAAGGGCAT TGGGCGTCTC TCCGTTGCCT TTCTTGCACC AGGAACCATC CTCATCACTA AGCGGGCCAA CTCCGAATTC GTTGTTGTCG TAGTCGACTG GCGTCTGTTC GAAAATCCCT TTGTCGCGTT GGACGATATC CAGCTTCCCG TCGAAACCTT TCCGAACGCC GACGCATTGG CAAATGGACT ACCCACACTC TTGGATACAC TTAAGTCCAA TCTCGGAAAT GCGTCCGACG ACCGCGGCAA GCGACTCGCC GATAGCTGGG GACGATTTTC TGATTACGAG AAGGAGAAAG GCTTTTCTTC AACCCTTGAG GCGATCAAGT CGTCATGGGT CAAAATGCCG TTGACAAAGC GGCATCTGGA GGAGTGGCCA GTCTTTTTGG ATCTGGTCGA CCACGGCACC GCAATGTTCA TGATCTCGCT CAACAACGAG TTGGGCGTCT GGGTGCGCCC AGCCGAAACG GGTGAGGAAG TTGAAGCAGT CAAGCGGCGC CTCAAGCAAA CACTTACCGC ATTTACCGAC CCTTACTCGG AAGCGCCTCC ACAGTTCGAC TATGAAGTCT ACCTTCATCG CGGCAACGGC GACGAAAGAG TGATAGGAGC CTCCGAAGTC TTCGGCATCG ACGCGCTTCA CGATCTTGAG CACTACATTG ACGGCCGCTT TGACGAAAAA GGTACATTTA CGGGACAGGT CGTTGCATTT GGAAAAGACC TTGGTACGCG GACATTCACA CCGGCTCGCG CGCTTCCGAC TAAGGGGCGC GATCGTTTAG GACCCTTTAC GTTTGCAATT GGGACTTTTG AATTTGACGA GCGCAGGTCT ACTCACGACG AGAACCAGCA TGCGCATCTT ATGGAACAAG CCAAAAACTC TTCCGGTATC TTTGTCTATC GCGACCTCTT GCGCGTCATG CCCTATGGAC GTCCGGACGC AGATTTTCTG CAGCTAGAAG AACGCCGAAG CATGCATGCC GGACGCGAAT TCTGGGCGCA TCGACGAAGC TTTGGCCGTA TTGGCTTTAC GCGAGCAGAT AACCCGGCCC TTAAAGACAA GGCCGGTCGC GAAGGACTGG TCGACAATCG TGCCTTCCGC GAAATGTGCC TTCTGGTTAT TGACTTTTTG ATGGATGCCG CACGCAAGTA TTACGGAACG GATGCGCCTC TCCGAGAAGA GCTGTTGCCG GGCATTATGG AACGCAAAGC GCTTCAGAAA GAAGCTGCTG ATAAAGCGCG AGCGCGACGC CGTAGGGGCA TGAGGCAGTT TCTGAGGGAG CAATCCTCAC CTTTGGAGGA TGCTCTCCAA CGAGCCGAAT CGTTGATGGC GCTTGCAAAG GACACTCTGA CGAAGAACGA TAAGGTGCAG GCAACCGTCT TGGCGGCCCG GGTGCGCGAC ATTCGCGCTC TCGGCGAGAC CCTGCGGCCA CCGACGCCGC CTTCAAGGCT TGGTGATCTT GAGGTCGAAT GGCGCACCTA TCGTGATAAT TATAACGCGT TTCTCGGCAA ACTCAAGACG GTCGCAGGAC TTGCTGCCGA AGTTGAAACT GCAATGGAGG CAGAAAAGCC AAAAGACGTT TTGGCGTCGC ACTACGGTGA GCAACGTGAG ATTCTAGTCA ATCAGCTCAA AGATTTTTCG GTCTCAATCG ACGAGCGACT AAAGCGGCTT CGGTTAAAAT GGCAAGACCA ACAAAAGGCC GACGAAGCTG AAATAGAGAA TCGCATAGGT TACTTGCTTG AAACCAAAGT GAACGCTGCC AATCTTTTGG CGATGTTGAA TCTGATCGAC ACGAACCGAG CTGAATTGTC CGACGTCTTC GCCGCACGAT ATCAATCATT CATCGGTGCG CTCGATCAAT TGATCGAAGG TATCGATCTG CAAGGCGCGG TCGACATTGT TAACGAACGG CAAGAGGAAC TTGAAGAGCA GCTTCGGGAT ATTCGCGCCG TCGCGCAGAT TGGGATAACC GTTGAAATCA TCGGTCATGA GTTCGAGACG CTGGAATCTG AAGTCAGACG TAACCTGGGA AAACTTCCGT CAGACGTTCG CGAGACCGCG TCATACAAGC AGGCACTTCG CTCGCATCAG GCTCTCGCTG ATCGACTCCG GTTTCTCGCG CCCTTGAAGA TCGGTGGATA TCGTACGCGC GAAACGATAT CCGGACAACA GATTGCCGAT TACATTTCGG AATTCTTCTC CAAGATGTTC CTGGATCAGC GGATCGATTT TCTGGCTACT GCGGCGTTCA GAAGAATCTC GATTGTGGAC ATTCCGTCTC GGATCTTCCC GGTTTTCATC AACCTCATCA ATAATTCAGT TTATTGGGTC AGCCAGTCGG CGGAGCGCTA CATTCGTCTC GATTTCAAGA ACGGCCTCAC CGTCGTTGCT GATAGCGGCA AAGGCGTCGA TCCCGAAGAT GTTCCTCGGC TATTCAACAT CTTCTTCACC CGTCGCCGTT CGGGTCGGGG CGTCGGCCTC TATCTCAGTC GCGCAAACCT TGCCGTCGCG GGGCATAAGA TCAGATATGC AACCAACGCC GATCCCCACA TACTCGAGGG CGCGAATTTC ATAATTGACT TTAAAGGTGT GCGAACCGAT GCCTGA
|
Protein sequence | MIQAVPFQTR ARTIDHLGRG QIADAPTAVS ELWKNAYDAY AKNVALHIFG GSPEVAAVFD DGIGMSREDV IQRWLVIGTE SKFDNADTTP PETFGLPPRP RQGEKGIGRL SVAFLAPGTI LITKRANSEF VVVVVDWRLF ENPFVALDDI QLPVETFPNA DALANGLPTL LDTLKSNLGN ASDDRGKRLA DSWGRFSDYE KEKGFSSTLE AIKSSWVKMP LTKRHLEEWP VFLDLVDHGT AMFMISLNNE LGVWVRPAET GEEVEAVKRR LKQTLTAFTD PYSEAPPQFD YEVYLHRGNG DERVIGASEV FGIDALHDLE HYIDGRFDEK GTFTGQVVAF GKDLGTRTFT PARALPTKGR DRLGPFTFAI GTFEFDERRS THDENQHAHL MEQAKNSSGI FVYRDLLRVM PYGRPDADFL QLEERRSMHA GREFWAHRRS FGRIGFTRAD NPALKDKAGR EGLVDNRAFR EMCLLVIDFL MDAARKYYGT DAPLREELLP GIMERKALQK EAADKARARR RRGMRQFLRE QSSPLEDALQ RAESLMALAK DTLTKNDKVQ ATVLAARVRD IRALGETLRP PTPPSRLGDL EVEWRTYRDN YNAFLGKLKT VAGLAAEVET AMEAEKPKDV LASHYGEQRE ILVNQLKDFS VSIDERLKRL RLKWQDQQKA DEAEIENRIG YLLETKVNAA NLLAMLNLID TNRAELSDVF AARYQSFIGA LDQLIEGIDL QGAVDIVNER QEELEEQLRD IRAVAQIGIT VEIIGHEFET LESEVRRNLG KLPSDVRETA SYKQALRSHQ ALADRLRFLA PLKIGGYRTR ETISGQQIAD YISEFFSKMF LDQRIDFLAT AAFRRISIVD IPSRIFPVFI NLINNSVYWV SQSAERYIRL DFKNGLTVVA DSGKGVDPED VPRLFNIFFT RRRSGRGVGL YLSRANLAVA GHKIRYATNA DPHILEGANF IIDFKGVRTD A
|
| |