Gene BBta_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3952 
Symbol 
ID5151538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4149455 
End bp4152649 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content63% 
IMG OID640558787 
Producthypothetical protein 
Protein accessionYP_001239928 
Protein GI148255343 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.376568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAA CGAGTTTACG GGCCTACCGT CTGGGCGTGG ATCTCGGCGC CAATTCGCTG 
GGATGGTTCG TGGTCTGGCT CGACGATCAC GGACAGCCCG AGGGCCTTGG CCCGGGCGGC
GTCAGGATTT TCCCCGACGG TCGTAACCCG CAATCCAAGC AATCCAATGC GGCCGGTCGC
CGCCTCGCAC GCAGTGCACG ACGACGACGA GACCGCTATC TGCAGCGACG CGGAAAGCTG
ATGGGCTTGC TGGTCAAGCA CGGCTTGATG CCCGCCGATG AGCCGGCCCG AAAGCGATTG
GAATGCCTCG ATCCCTATGG TCTCCGCGCG AAAGCGCTCG ATGAAGTGCT GCCTTTGCAT
CATGTCGGCC GGGCGCTGTT TCACCTCAAC CAGCGGCGCG GCCTGTTTGC CAATCGAGCG
ATCGAGCAAG GCGACAAGGA CGCCAGCGCG ATCAAGGCCG CGGCCGGCAG ACTGCAGACA
TCGATGCAGG CGTGCGGCGC GCGCACGCTC GGCGAATTCC TCAACCGCCG TCATCAGCTC
CGCGCCACAG TGCGCGCCCG CAGCCCTGTC GGCGGCGACG TCCAGGCGCG GTATGAATTC
TATCCGACAC GCGCGATGGT TGATGCGGAG TTCGAAGCCA TCTGGGCGGC ACAGGCACCG
CATCACCCAA CGATGACGGC CGAAGCGCAT GACACGATCC GCGAGGCGAT CTTCTCTCAA
CGCGCGATGA AGCGGCCGTC GATCGGGAAA TGCTCGCTCG ACCCCGCCAC CAGCCAGGAC
GACGTCGACG GCTTTCGCTG CGCCTGGTCG CATCCCCTGG CGCAGCGTTT CCGCATCTGG
CAGGACGTCC GCAATCTAGC CGTGGTGGAG ACTGGCCCCA CGTCTTCCAG GCTTGGCAAG
GAGGATCAGG ACAAGGTCGC ACGGGCACTG CTACAGACCG ACCAACTCAG CTTCGATGAG
ATCCGCGGCC TTCTCGGATT GCCGTCGGAC GCGCGGTTCA ACCTTGAAAG CGACCGGCGT
GATCACCTCA AGGGCGACGC GACCGGCGCG ATCCTGTCCG CCAGGAGGCA TTTTGGCCCG
GCATGGCATG ACCGGTCCCT GGATCGTCAG ATCGACATCG TCGCGCTGCT GGAGAGCGCG
CTCGATGAAG CAGCGATCAT CGCCTCGCTC GGGACAACTC ACAGCCTTGA TGAAGCAGCT
GCGCAGCGGG CGTTGTCCGC CTTGCTGCCT GACGGATATT GCAGGCTTGG ACTGAGGGCG
ATCAAGCGGG TCCTGCCGCT CATGGAAGCT GGCAGGACCT ACGCGGAGGC CGCCAGCGCG
GCCGGCTATG ATCACGCTCT GCTGCCGGGC GGCAAGCTCT CTCCCACCGG CTACCTGCCC
TATTATGGAC AATGGCTGCA GAACGATGTC GTGGGCTCGG ACGATGAGCG CGACACCAAC
GAACGGCGCT GGGGCCGCTT GCCGAATCCC ACCGTTCACA TCGGGATCGG CCAGTTGCGA
CGCGTCGTCA ATGAGCTCAT CAGATGGCAT GGACCGCCGG CCGAGATCAC CGTCGAGTTG
ACGCGTGACC TGAAGCTGTC GCCCCGACGG CTGGCGGAGC TCGAACGCGA GCAGGCCGAG
AACCAGCGCA AGAACGACAA GCGTACCTCC CTATTGCGCA AGCTCGGGCT CCCCGCGAGC
ACGCACAATC TCCTCAAGCT TCGGCTCTGG GACGAGCAAG GCGATGTTGC AAGCGAATGC
CCCTATACGG GCGAGGCGAT CGGCCTCGAA CGTCTGGTCT CTGATGATGT GGATATCGAT
CACCTCATCC CATTCTCGAT CAGCTGGGAC GACAGCGCGG CCAACAAAGT GGTCTGCATG
CGCTACGCCA ATCGTGAGAA GGGCAATCGA ACGCCGTTCG AGGCCTTTGG CCATCGCCAA
GGCAGGCCTT ACGATTGGGC GGACATTGCA GAACGCGCAG CGCGCCTGCC GCGCGGCAAG
CGCTGGCGCT TCGGTCCAGG CGCGCGGGCG CAATTCGAGG AGCTCGGCGA CTTTCAGGCA
CGCCTGCTCA ACGAGACCAG CTGGCTGGCG CGCGTCGCCA AGCAATATCT CGCAGCGGTC
ACCCACCCGC ACAGGATCCA CGTTCTGCCG GGCCGGCTGA CAGCGCTGCT CCGCGCAACA
TGGGAGCTCA ACGATTTGCT GCCCGGAAGC GACGACAGAG CCGCGAAGAG CCGCAAGGAC
CACCGTCATC ATGCCATCGA CGCGCTGGTG GCGGCACTGA CAGACCAGGC GCTGCTGCGC
CGCATGGCGA ACGCGCATGA CGATACGCGA CGGAAGATCG AAGTTCTCCT GCCCTGGCCG
ACGTTCCGGA TCGATCTCGA GACCAGGCTG AAGGCGATGC TCGTATCGCA CAAGCCCGAT
CACGGCCTCC AGGCCCGCCT GCATGAAGAC ACCGCCTATG GGACCGTCGA ACACCCCGAA
ACCGAGGATG GTGCAAATCT GGTCTATCGG AAGACCTTCG TGGACATCAG CGAAAAGGAG
ATCGACCGCA TTCGCGATCG CCGCTTGCGT GACCTCGTCA GAGCCCATGT GGCCGGCGAA
AGGCAGCAGG GCAAGACGCT CAAAGCGGCG GTGCTGTCAT TCGCGCAGCG CAGGGACATT
GCTGGTCACC CGAATGGCAT TCGCCATGTC CGCCTGACCA AATCGATCAA GCCGGACTAT
CTGGTACCGA TCCGCGACAA AGCCGGCCGC ATCTACAAGT CCTACAATGC AGGCGAGAAT
GCCTTCGTCG ACATCCTGCA AGCCGAGAGT GGCCGATGGA TCGCGCGGGC CACGACCGTC
TTTCAGGCCA ATCAAGCCAA TGAGTCGCAT GACGCGCCAG CGGCGCAACC GATCATGCGG
GTCTTCAAGG GCGACATGCT GCGCATCGAT CACGCTGGCG CGGAGAAGTT CGTGAAGATC
GTCAGGCTTT CGCCCTCGAA CAACCTGCTC TACCTCGTCG AACATCATCA GGCGGGCGTG
TTTCAGACCC GCCATGACGA CCCGGAAGAT TCCTTTCGGT GGCTCTTCGC CAGTTTTGAC
AAGCTTCGCG AATGGAACGC CGAGCTTGTC CGGATCGATA CGCTGGGACA GCCCTGGCGG
CGCAAGCGCG GCCTTGAAAC AGGAAGCGAG GACGCCACTC GCATCGGCTG GACGCGACCA
AAAAAATGGC CCTGA
 
Protein sequence
MKRTSLRAYR LGVDLGANSL GWFVVWLDDH GQPEGLGPGG VRIFPDGRNP QSKQSNAAGR 
RLARSARRRR DRYLQRRGKL MGLLVKHGLM PADEPARKRL ECLDPYGLRA KALDEVLPLH
HVGRALFHLN QRRGLFANRA IEQGDKDASA IKAAAGRLQT SMQACGARTL GEFLNRRHQL
RATVRARSPV GGDVQARYEF YPTRAMVDAE FEAIWAAQAP HHPTMTAEAH DTIREAIFSQ
RAMKRPSIGK CSLDPATSQD DVDGFRCAWS HPLAQRFRIW QDVRNLAVVE TGPTSSRLGK
EDQDKVARAL LQTDQLSFDE IRGLLGLPSD ARFNLESDRR DHLKGDATGA ILSARRHFGP
AWHDRSLDRQ IDIVALLESA LDEAAIIASL GTTHSLDEAA AQRALSALLP DGYCRLGLRA
IKRVLPLMEA GRTYAEAASA AGYDHALLPG GKLSPTGYLP YYGQWLQNDV VGSDDERDTN
ERRWGRLPNP TVHIGIGQLR RVVNELIRWH GPPAEITVEL TRDLKLSPRR LAELEREQAE
NQRKNDKRTS LLRKLGLPAS THNLLKLRLW DEQGDVASEC PYTGEAIGLE RLVSDDVDID
HLIPFSISWD DSAANKVVCM RYANREKGNR TPFEAFGHRQ GRPYDWADIA ERAARLPRGK
RWRFGPGARA QFEELGDFQA RLLNETSWLA RVAKQYLAAV THPHRIHVLP GRLTALLRAT
WELNDLLPGS DDRAAKSRKD HRHHAIDALV AALTDQALLR RMANAHDDTR RKIEVLLPWP
TFRIDLETRL KAMLVSHKPD HGLQARLHED TAYGTVEHPE TEDGANLVYR KTFVDISEKE
IDRIRDRRLR DLVRAHVAGE RQQGKTLKAA VLSFAQRRDI AGHPNGIRHV RLTKSIKPDY
LVPIRDKAGR IYKSYNAGEN AFVDILQAES GRWIARATTV FQANQANESH DAPAAQPIMR
VFKGDMLRID HAGAEKFVKI VRLSPSNNLL YLVEHHQAGV FQTRHDDPED SFRWLFASFD
KLREWNAELV RIDTLGQPWR RKRGLETGSE DATRIGWTRP KKWP