Gene BBta_5254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5254 
Symbol 
ID5154632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5478753 
End bp5480366 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content64% 
IMG OID640560023 
Productputative ABC transporter (substrate-binding protein) 
Protein accessionYP_001241147 
Protein GI148256562 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.534405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0632497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCACA TCCCGCGCTG GAAAGGTCTT ACGCTCGCCA CGAGCCTGGC CGTTTCCGTG 
CTTTCGCTGT CCACGGCGCT GAGCACCCAG GCGCTTGCCG CCGCCAAGAA GACGATCACG
GCGGTGATGC ATTCCGACCT GCGCGTGATC GATCCCGGCT TCACCACCGC CTATATCACC
CGCGACCATG GCTACATGGT CTACGACACG TTGCTGGCGG TCGATTCGAA CTTCAAGATT
CAGCCGCAGA TGGCGGACTG GAAGGTCTCC GACGACAAGC TGACCTACAC CTTCACCCTG
CGCGATGGCT TGAAATGGCA TGACGGCACG CCGGTCACGG CGGAGGATTG CGTCGCCTCG
CTGAAGCGCT GGGGCCGCAA TGACGGCATG GGCCAGAAGC TGATGGACTT CACCGCGAGC
CTGGAAGCTC CGGACGCCAA GACCATCGTG CTGAAGCTGA AGGAGCCTTA TGGCCTGGTA
CTGGAGTCGA TCGGCAAGCC GTCGTCGCTG GTGCCGTTCA TGATGCCGAA GCGCCTCGCC
GAGACGCCGC AGGGCCAGCA GATCCCGGAG CAGATCGGCT CCGGCCCGTT CAAATTCGTC
AAGGCGGAAT TTCAGCCGGG CGTGAAGGCG GTCTACGAGA AGAACACCGA CTACGTGCCG
CGCAAGGAGC CGGCGAGCTG GACCTCCGGC GGCAAGGTGG TGAAGGTCGA CCGCGTCGAA
TGGATCACGA TGCCGGATGC GCAGACCGCG GTGAACGCGC TGCAGTCGGG CGACATCGAC
TTCATGGAGA ACCTTCCCTA CGACATGATG CCGGTGCTGG AAGCGAACCC GGATCTGAAG
ATCGATGTGC TGAACAAGTT CGGCTACCAG ACGCTGGGAC GGATGAACTT CCTCAATCCG
CCGTTCGACA ATGTGAAGGT GCGCCGCGCG GCGTTCCTGG CGCTGAACCA GAAGGACGTG
CTCGACGCGC TGGTCGGCAA TGCCAAATAT CAGAAGATCT GCGGCGCGGT GTTCGTCTGC
GGCACGCCGC TGGAGACCGA TGTCGGTGCG GAGACGCTGA TCAAGGGCAA CGGCATGGCC
GAGGCCAAGA AGGCGCTGGC CGAATCCGGC TATGACGGCA CGCCGATCGC GATCATGGCG
CCCGGCGACG TCACGACGTT GAAGGCGCAG CCGATCGTGG CCGCACAGCT CTTGCGCGAG
GCCGGCTTCA AGGTCGACCT GCAGGCGACG GATTGGCAGA CCGTGGTGAC GCGCCGCGCC
AGCCAGAAGC CGGTCAAGGA AGGCGGCTGG AACATGTTCT TCACCAACTG GGTCGGCGCC
GACGTCGCCA ATCCCGTGGT GAACGTGTCG ATCGGTGGCC GCGGCACCAA GGGCGGCTGG
TTCGGCTGGG CCGAGGATGC CGAGATCGAG AAGCTGCGCG ACGCCTATGC GCGCGCGTCT
TCGCCGGACG AGCAGAAGAA GATCGCGACC GAGATCCAGA AGCTGAACTA TGAGCGTGTG
ATCTACATCC CGCTCGGCCA ATATCTGCTG CCGAGCGGCT GGCGCAAATC GCTGACCGGC
GTGCTGGACG GCCCGGCGAC GCCGATCTTC TGGAACATCG ACAAGAGCGA GTAG
 
Protein sequence
MFHIPRWKGL TLATSLAVSV LSLSTALSTQ ALAAAKKTIT AVMHSDLRVI DPGFTTAYIT 
RDHGYMVYDT LLAVDSNFKI QPQMADWKVS DDKLTYTFTL RDGLKWHDGT PVTAEDCVAS
LKRWGRNDGM GQKLMDFTAS LEAPDAKTIV LKLKEPYGLV LESIGKPSSL VPFMMPKRLA
ETPQGQQIPE QIGSGPFKFV KAEFQPGVKA VYEKNTDYVP RKEPASWTSG GKVVKVDRVE
WITMPDAQTA VNALQSGDID FMENLPYDMM PVLEANPDLK IDVLNKFGYQ TLGRMNFLNP
PFDNVKVRRA AFLALNQKDV LDALVGNAKY QKICGAVFVC GTPLETDVGA ETLIKGNGMA
EAKKALAESG YDGTPIAIMA PGDVTTLKAQ PIVAAQLLRE AGFKVDLQAT DWQTVVTRRA
SQKPVKEGGW NMFFTNWVGA DVANPVVNVS IGGRGTKGGW FGWAEDAEIE KLRDAYARAS
SPDEQKKIAT EIQKLNYERV IYIPLGQYLL PSGWRKSLTG VLDGPATPIF WNIDKSE