Gene BBta_5253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5253 
Symbol 
ID5154631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5476598 
End bp5478211 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content63% 
IMG OID640560022 
Productputative ABC transporter (substrate-binding protein) 
Protein accessionYP_001241146 
Protein GI148256561 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0401744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCTC ATCCGCGCCG GGCTCATTCC GTGCTCGCAT CCTGGCTTGT TGCGTCCGCT 
GTTTTGCTAT CGCCCTGTCT TGCGCCGCAG GCGCTCGCCG CTGGCAAGAC AACCATCGCC
GCCGTGATGC ATTCCGACCT GCGGGTGATC GATCCCGGCT TCACCACCGC CTATATCACC
CGCGATCATG GCTACATGGT CTATGACACG CTGCTCGCCA CCGATGCTAA CTTCAAGATC
CAGCCGCAGA TGGCGGATTG GAAGGTCTCG GACGACAAGC TGACCTACAC CTTCACCTTG
CGCGATGGCT TGAAATGGCA TGACGGCACG CCGGTCACGG CGGAGGATTG CGTCGCCTCG
CTGAAGCGCT GGGGCCGCAA TGACGGCATG GGCCAGAAGC TGATGGACTT CACCGCGAGC
CTGGAAGCTC CGGACGCCAA GACCATCGTG CTGAAGCTGA AGGAGCCTTA TGGCCTGGTG
CTGGAGTCGA TCGGCAAGCC GTCGTCGCTG GTGCCGTTCA TGATGCCGAA GCGCCTCGCT
GAGACGCCGC AGGGCCAGCA GATCCCGGAG CAGATCGGCT CCGGCCCGTT CAAATTCGTC
AAGGCGGAAT TTCAGCCGGG CGTGAAGGCG GTCTACGAGA AGAACACCGA CTATGTGCCG
CGCAAGGAGC CGGCGAGCTG GACCTCCGGC GGCAAGGTGG TGAAGGTCGA TCGCGTCGAA
TGGATCACCA TGGCCGATGC GCAGACCGCG GTGAATGCGC TGCAATCGGG CGACATCGAT
TTCATGGAAG CGCCGCCCTA TGATCTGTTG CCGGTGCTGG AAGCGAACCC GGATCTGAAG
ATCGATGTTC TCAACAAGTT CGGCTTTCAG ACGCTGGGCC GGATGAACTT CCTCAATCCG
CCGTTCGACA ATGTGAAGGT GCGCCGCGCG GCGTTCCTGG CGCTGAACCA GAAGGACGTG
CTCGACGCGC TTGTCGGCAA TGCCAAATAT CAGAAGATCT GCGGCGCGGT GTTCGTCTGC
GGCACGCCGC TGGAGACCGA TGTCGGTGCG GAAACGTTGA TCAAGGGCAA CGGCATGGCC
GAGGCCAAGA AGGCACTGGC GGAATCCGGC TATGACGGCA CGCCGATCGC GATCATGGCG
CCCGGCGATG TCGGTGCACT GAAGCCGCAG CCGATCGTGG CCGCGCAGCT GCTGCGTGAG
GCCGGCTTCA AGGTCGACCT GCAGGCGACG GACTGGCAGA CCGTGGTGAC GCGCCGCGCC
AGCCAGAAGC CGGTCAAGGA CGGCGGCTGG AACATGTTCT TCACCAACTG GGTCGGTGCC
GACGTCGCCA ATCCCGTCGT CAACGTCTCT GTCGGCGGCC GGGGCACCAA GGGCGGCTGG
TTCGGCTGGG CCGAGGATGC CGAGATCGAG AAGCTGCGCG ACGCCTATGC GCGCGCGTCT
TCGCCGGACG AGCAGAAGAA GATCGCCGCC GAGATCCAGA AGCTGAACTA TGAGCGCGTG
ATCTACATCC CGCTCGGCCA GTACCTCCAG CCGAGTGTCT GGCGCAAATC CTTGACCGGC
GTGCTGGACG GTCCGGCGAC GCCGGTGTTC TGGAACATCG ACAAGAAGGA GTAG
 
Protein sequence
MSPHPRRAHS VLASWLVASA VLLSPCLAPQ ALAAGKTTIA AVMHSDLRVI DPGFTTAYIT 
RDHGYMVYDT LLATDANFKI QPQMADWKVS DDKLTYTFTL RDGLKWHDGT PVTAEDCVAS
LKRWGRNDGM GQKLMDFTAS LEAPDAKTIV LKLKEPYGLV LESIGKPSSL VPFMMPKRLA
ETPQGQQIPE QIGSGPFKFV KAEFQPGVKA VYEKNTDYVP RKEPASWTSG GKVVKVDRVE
WITMADAQTA VNALQSGDID FMEAPPYDLL PVLEANPDLK IDVLNKFGFQ TLGRMNFLNP
PFDNVKVRRA AFLALNQKDV LDALVGNAKY QKICGAVFVC GTPLETDVGA ETLIKGNGMA
EAKKALAESG YDGTPIAIMA PGDVGALKPQ PIVAAQLLRE AGFKVDLQAT DWQTVVTRRA
SQKPVKDGGW NMFFTNWVGA DVANPVVNVS VGGRGTKGGW FGWAEDAEIE KLRDAYARAS
SPDEQKKIAA EIQKLNYERV IYIPLGQYLQ PSVWRKSLTG VLDGPATPVF WNIDKKE