Gene BBta_4858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4858 
Symbol 
ID5154914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5099850 
End bp5101445 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content63% 
IMG OID640559656 
Productputative ABC transporter, substrate-binding protein 
Protein accessionYP_001240787 
Protein GI148256202 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.185299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.554017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG ATCGCCGCGA CTTTCACTTT GGCATGCTCG CGTCGGCTGC GGCGTTGCTG 
CTCCCGCAGC ATGGCCAGGC CGCCGCCGAC AAGCCAGTTT CGGGCGGAAC GCTCAATTGG
GTCTACTACC CCGATCCATC GGCCTTGATC GCGATCAACA CCTCGTCGGG CACCGGCCAG
ACGATCGGCA CCAAGGTCAA CGAAGGCCTG CTCGCCTACG ACTATGATCT CAATCCGAAG
CCGGTGCTCG CGACGTCCTG GTCGATCAGC GAGGACGGCA AGCGCTATAC GTTCAAGCTG
CGGCCGAACG TGAAATGGTC GGATGGTCAG CCGTTCACCT CGGTCGACGT GGCGTTCTCG
GTCGAACGCC TCAAGATCGC CCATCCGCGC GGCCGCATCA CTTTCGCCAA TGTCGAGGCA
GTCGAGACGC CGGATCCGCT GACGGCCGTC ATCGTGCTGG CGAAGCCCGC ACCGTTCCTG
ATTTCGGCGC TGGCCGGCGC GGAGTCGCCG ATCGTGCCGC GGCACGTCTA CTCGACGTTT
AAGCCGGACG AACAGCCGAA GCTGGAGCAG ATGATCGGCA CCGGCCCCTT TATTCTCCAC
GAATGGGTGC CGGGCAGCCA TCTGCTGTTC GTCCGCAATC CCAACTACTG GGATGCACCG
AAGCCCTATG TCGATCGCCT AGTGCTGAAG GTCGTGCTCG ACCCGGCCGC GCGTGCCGCG
GCGCTGGAGG CCGGGGAGGT CGATATCGGT GCGACGCCGG TTCCCTATGG CGACATCGAG
AGGTTCAAGG CCGACAAGAG GTTTGTCGTC GATACGACCA CCTATGCCTA TTCCGGACCG
CAGCAGCAGC TGTTCTTCAA TTTCGATACG CCGGCGCTGC AGGACCGGCG CGTCCGCAAG
GCCATCGCGC ATGCGATCGA CCTGAAGGCG CTGCTCGACG TTGTCTATTT CGGCTATGGT
CTGGTCTCGC CGTCGCCGAT CAGTACGGCG CTGCCGAAAT TCTACGATCC CAAGATCCAG
GGATGGCCCG TCGACCTCAA GGCGGCGGAG CGTCTGTTGG ACGAGGCGGG CCTGCCGCGC
GGCGCCGGCG GCATTCGCAC CAAGCTCCGG CTGACGCAGA ATCCGTTCCT GCCGCCGAGC
TTCGCCGATT TTCTTCGCAA CGCGTTGCGG CGCATTGGAC TCGATATCGA GATCCAGCGT
TTCGACCTCG CGACCTATCT CAACGTCGTC TATCGCGACC GTGCGTTCGA TCTGACGATC
GAGAGCCTGT CGAACACGTT CGATCCGACG CTTGGCGTGC AGCGGGCTTA TTGGTCGAAG
AACTTCCGGA TCGGCCTGCC ATTCTCCAAC GCCGCGCATT ACAGCAATCC GGAGATCGAC
CGGCTGCTGG AGGCCGCGGC GATCGAGACG GATGTCGAGA AGCGCCGTCA GCTGTGGTCG
TCATTCCAGT CCATCATCCA TGAGGACGTG GCGTCGGTCG ATCTGGTCGC TGCCGGCGGC
GTCATCATCG CCAACCGCAA GGTGCGCAAT TTCGCGCCCG GTGCTGAGGG GCTGAATGGG
AGCTTCGCCG ATCTCTGGAT CGACCCCTCG GCCTGA
 
Protein sequence
MKLDRRDFHF GMLASAAALL LPQHGQAAAD KPVSGGTLNW VYYPDPSALI AINTSSGTGQ 
TIGTKVNEGL LAYDYDLNPK PVLATSWSIS EDGKRYTFKL RPNVKWSDGQ PFTSVDVAFS
VERLKIAHPR GRITFANVEA VETPDPLTAV IVLAKPAPFL ISALAGAESP IVPRHVYSTF
KPDEQPKLEQ MIGTGPFILH EWVPGSHLLF VRNPNYWDAP KPYVDRLVLK VVLDPAARAA
ALEAGEVDIG ATPVPYGDIE RFKADKRFVV DTTTYAYSGP QQQLFFNFDT PALQDRRVRK
AIAHAIDLKA LLDVVYFGYG LVSPSPISTA LPKFYDPKIQ GWPVDLKAAE RLLDEAGLPR
GAGGIRTKLR LTQNPFLPPS FADFLRNALR RIGLDIEIQR FDLATYLNVV YRDRAFDLTI
ESLSNTFDPT LGVQRAYWSK NFRIGLPFSN AAHYSNPEID RLLEAAAIET DVEKRRQLWS
SFQSIIHEDV ASVDLVAAGG VIIANRKVRN FAPGAEGLNG SFADLWIDPS A