Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3952 |
Symbol | |
ID | 5151538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 4149455 |
End bp | 4152649 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640558787 |
Product | hypothetical protein |
Protein accession | YP_001239928 |
Protein GI | 148255343 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.376568 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAA CGAGTTTACG GGCCTACCGT CTGGGCGTGG ATCTCGGCGC CAATTCGCTG GGATGGTTCG TGGTCTGGCT CGACGATCAC GGACAGCCCG AGGGCCTTGG CCCGGGCGGC GTCAGGATTT TCCCCGACGG TCGTAACCCG CAATCCAAGC AATCCAATGC GGCCGGTCGC CGCCTCGCAC GCAGTGCACG ACGACGACGA GACCGCTATC TGCAGCGACG CGGAAAGCTG ATGGGCTTGC TGGTCAAGCA CGGCTTGATG CCCGCCGATG AGCCGGCCCG AAAGCGATTG GAATGCCTCG ATCCCTATGG TCTCCGCGCG AAAGCGCTCG ATGAAGTGCT GCCTTTGCAT CATGTCGGCC GGGCGCTGTT TCACCTCAAC CAGCGGCGCG GCCTGTTTGC CAATCGAGCG ATCGAGCAAG GCGACAAGGA CGCCAGCGCG ATCAAGGCCG CGGCCGGCAG ACTGCAGACA TCGATGCAGG CGTGCGGCGC GCGCACGCTC GGCGAATTCC TCAACCGCCG TCATCAGCTC CGCGCCACAG TGCGCGCCCG CAGCCCTGTC GGCGGCGACG TCCAGGCGCG GTATGAATTC TATCCGACAC GCGCGATGGT TGATGCGGAG TTCGAAGCCA TCTGGGCGGC ACAGGCACCG CATCACCCAA CGATGACGGC CGAAGCGCAT GACACGATCC GCGAGGCGAT CTTCTCTCAA CGCGCGATGA AGCGGCCGTC GATCGGGAAA TGCTCGCTCG ACCCCGCCAC CAGCCAGGAC GACGTCGACG GCTTTCGCTG CGCCTGGTCG CATCCCCTGG CGCAGCGTTT CCGCATCTGG CAGGACGTCC GCAATCTAGC CGTGGTGGAG ACTGGCCCCA CGTCTTCCAG GCTTGGCAAG GAGGATCAGG ACAAGGTCGC ACGGGCACTG CTACAGACCG ACCAACTCAG CTTCGATGAG ATCCGCGGCC TTCTCGGATT GCCGTCGGAC GCGCGGTTCA ACCTTGAAAG CGACCGGCGT GATCACCTCA AGGGCGACGC GACCGGCGCG ATCCTGTCCG CCAGGAGGCA TTTTGGCCCG GCATGGCATG ACCGGTCCCT GGATCGTCAG ATCGACATCG TCGCGCTGCT GGAGAGCGCG CTCGATGAAG CAGCGATCAT CGCCTCGCTC GGGACAACTC ACAGCCTTGA TGAAGCAGCT GCGCAGCGGG CGTTGTCCGC CTTGCTGCCT GACGGATATT GCAGGCTTGG ACTGAGGGCG ATCAAGCGGG TCCTGCCGCT CATGGAAGCT GGCAGGACCT ACGCGGAGGC CGCCAGCGCG GCCGGCTATG ATCACGCTCT GCTGCCGGGC GGCAAGCTCT CTCCCACCGG CTACCTGCCC TATTATGGAC AATGGCTGCA GAACGATGTC GTGGGCTCGG ACGATGAGCG CGACACCAAC GAACGGCGCT GGGGCCGCTT GCCGAATCCC ACCGTTCACA TCGGGATCGG CCAGTTGCGA CGCGTCGTCA ATGAGCTCAT CAGATGGCAT GGACCGCCGG CCGAGATCAC CGTCGAGTTG ACGCGTGACC TGAAGCTGTC GCCCCGACGG CTGGCGGAGC TCGAACGCGA GCAGGCCGAG AACCAGCGCA AGAACGACAA GCGTACCTCC CTATTGCGCA AGCTCGGGCT CCCCGCGAGC ACGCACAATC TCCTCAAGCT TCGGCTCTGG GACGAGCAAG GCGATGTTGC AAGCGAATGC CCCTATACGG GCGAGGCGAT CGGCCTCGAA CGTCTGGTCT CTGATGATGT GGATATCGAT CACCTCATCC CATTCTCGAT CAGCTGGGAC GACAGCGCGG CCAACAAAGT GGTCTGCATG CGCTACGCCA ATCGTGAGAA GGGCAATCGA ACGCCGTTCG AGGCCTTTGG CCATCGCCAA GGCAGGCCTT ACGATTGGGC GGACATTGCA GAACGCGCAG CGCGCCTGCC GCGCGGCAAG CGCTGGCGCT TCGGTCCAGG CGCGCGGGCG CAATTCGAGG AGCTCGGCGA CTTTCAGGCA CGCCTGCTCA ACGAGACCAG CTGGCTGGCG CGCGTCGCCA AGCAATATCT CGCAGCGGTC ACCCACCCGC ACAGGATCCA CGTTCTGCCG GGCCGGCTGA CAGCGCTGCT CCGCGCAACA TGGGAGCTCA ACGATTTGCT GCCCGGAAGC GACGACAGAG CCGCGAAGAG CCGCAAGGAC CACCGTCATC ATGCCATCGA CGCGCTGGTG GCGGCACTGA CAGACCAGGC GCTGCTGCGC CGCATGGCGA ACGCGCATGA CGATACGCGA CGGAAGATCG AAGTTCTCCT GCCCTGGCCG ACGTTCCGGA TCGATCTCGA GACCAGGCTG AAGGCGATGC TCGTATCGCA CAAGCCCGAT CACGGCCTCC AGGCCCGCCT GCATGAAGAC ACCGCCTATG GGACCGTCGA ACACCCCGAA ACCGAGGATG GTGCAAATCT GGTCTATCGG AAGACCTTCG TGGACATCAG CGAAAAGGAG ATCGACCGCA TTCGCGATCG CCGCTTGCGT GACCTCGTCA GAGCCCATGT GGCCGGCGAA AGGCAGCAGG GCAAGACGCT CAAAGCGGCG GTGCTGTCAT TCGCGCAGCG CAGGGACATT GCTGGTCACC CGAATGGCAT TCGCCATGTC CGCCTGACCA AATCGATCAA GCCGGACTAT CTGGTACCGA TCCGCGACAA AGCCGGCCGC ATCTACAAGT CCTACAATGC AGGCGAGAAT GCCTTCGTCG ACATCCTGCA AGCCGAGAGT GGCCGATGGA TCGCGCGGGC CACGACCGTC TTTCAGGCCA ATCAAGCCAA TGAGTCGCAT GACGCGCCAG CGGCGCAACC GATCATGCGG GTCTTCAAGG GCGACATGCT GCGCATCGAT CACGCTGGCG CGGAGAAGTT CGTGAAGATC GTCAGGCTTT CGCCCTCGAA CAACCTGCTC TACCTCGTCG AACATCATCA GGCGGGCGTG TTTCAGACCC GCCATGACGA CCCGGAAGAT TCCTTTCGGT GGCTCTTCGC CAGTTTTGAC AAGCTTCGCG AATGGAACGC CGAGCTTGTC CGGATCGATA CGCTGGGACA GCCCTGGCGG CGCAAGCGCG GCCTTGAAAC AGGAAGCGAG GACGCCACTC GCATCGGCTG GACGCGACCA AAAAAATGGC CCTGA
|
Protein sequence | MKRTSLRAYR LGVDLGANSL GWFVVWLDDH GQPEGLGPGG VRIFPDGRNP QSKQSNAAGR RLARSARRRR DRYLQRRGKL MGLLVKHGLM PADEPARKRL ECLDPYGLRA KALDEVLPLH HVGRALFHLN QRRGLFANRA IEQGDKDASA IKAAAGRLQT SMQACGARTL GEFLNRRHQL RATVRARSPV GGDVQARYEF YPTRAMVDAE FEAIWAAQAP HHPTMTAEAH DTIREAIFSQ RAMKRPSIGK CSLDPATSQD DVDGFRCAWS HPLAQRFRIW QDVRNLAVVE TGPTSSRLGK EDQDKVARAL LQTDQLSFDE IRGLLGLPSD ARFNLESDRR DHLKGDATGA ILSARRHFGP AWHDRSLDRQ IDIVALLESA LDEAAIIASL GTTHSLDEAA AQRALSALLP DGYCRLGLRA IKRVLPLMEA GRTYAEAASA AGYDHALLPG GKLSPTGYLP YYGQWLQNDV VGSDDERDTN ERRWGRLPNP TVHIGIGQLR RVVNELIRWH GPPAEITVEL TRDLKLSPRR LAELEREQAE NQRKNDKRTS LLRKLGLPAS THNLLKLRLW DEQGDVASEC PYTGEAIGLE RLVSDDVDID HLIPFSISWD DSAANKVVCM RYANREKGNR TPFEAFGHRQ GRPYDWADIA ERAARLPRGK RWRFGPGARA QFEELGDFQA RLLNETSWLA RVAKQYLAAV THPHRIHVLP GRLTALLRAT WELNDLLPGS DDRAAKSRKD HRHHAIDALV AALTDQALLR RMANAHDDTR RKIEVLLPWP TFRIDLETRL KAMLVSHKPD HGLQARLHED TAYGTVEHPE TEDGANLVYR KTFVDISEKE IDRIRDRRLR DLVRAHVAGE RQQGKTLKAA VLSFAQRRDI AGHPNGIRHV RLTKSIKPDY LVPIRDKAGR IYKSYNAGEN AFVDILQAES GRWIARATTV FQANQANESH DAPAAQPIMR VFKGDMLRID HAGAEKFVKI VRLSPSNNLL YLVEHHQAGV FQTRHDDPED SFRWLFASFD KLREWNAELV RIDTLGQPWR RKRGLETGSE DATRIGWTRP KKWP
|
| |