Gene BBta_4859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4859 
Symbol 
ID5154537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5101740 
End bp5103338 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content65% 
IMG OID640559657 
Productputative ABC transporter, substrate-binding protein 
Protein accessionYP_001240788 
Protein GI148256203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.590645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTCA CGCGTCGCAA TCTCAGCAAG ACCCTTCTAT TCGGAACGAT CACTGCCGCC 
GCCGGCCATG CGCGCACGTC GCAGGCCGCA GAGCCGCGGC GTGGCGGAAC GCTCAACCTG
GTCATTCAGC CCGAACCCCC GATTCTGGTG AGCCTCACTC ATACCGCAGG GCCGACGACG
CGGGTCAGCC CGAAGATCAC GGAAGGTCTT CTCACCTTTG ATCTCGACTT CAGGCCGCGG
CCGCAACTGG CGACGGACTG GCAAGTCAGC GACGACGGCC TGCGCTACAC CTTTGCTCTG
CGGCGCGGCG TGAAGTGGCA TGACGGCCGG GATTTCACCT CGGCCGACGT CGCCTACTCC
ATCAGCCTGC TCAAGCAGCA TCACCCGCGC GGCCGCGGCA CGCTGTCGTC CGTCCGCGAG
GTCGAGACAC CCGATGCGCA CACCGCCACA ATCGCGCTGG ACAAGCCGGC GCCCTATCTG
CTCGCCGCGT TGACGGCCTC GGAATCGCCG ATCGTGCCGC GCCATGTCTA TGAGGGCAGC
GATCCCCTGA ACAATCCGAA CGGGCGTGCG CCGATCGGTA CCGGTCCGTT CGTGTTCAAG
GAGTGGCAGC AGGGCAGCCA CATCATTCTC GAGCGCAATC CGACTTATTG GGACCCGGGC
AAGCCCTATC TCGACCGTAT CGTCGTCCGC TTCATCGCCG ATGCCAATGC ACGCGCCGTC
GCGCTGGAGA CCGGTGAGAT CCATCTGGCG CCCGACACGC CGGTTCCGCT CGGACAGATC
GAAGCGCTGA AGTCCAACCC GGCGCTGCTG ATCGAGACCC GCGGCTACGA TTACCAGCCG
ATCGTCTACC GCCTCGAATT CAATCTCGCC AATCCATACT TCGCCAAGCG CGAGGTGCGC
GCCGCCGTCG CCCACGCGAT CGATCGCGAC GCGATCACCC GCGTCGTGTT CTATGGCTGG
GGCCGGAATG CGCCGAGCGC AATCAGCCCT GCCCTGAAGC AGTTCCACAA CCCCGACATT
CCCCGCCACG ATTTCGATCC GAAGAAGGCC GAGGCGCTGC TCGATGCCGC CGGCTATCCA
CGCGGTCCAG ACGGCATCCG CTTCAAGGTG TTCCACGACT ACATGCCCTA CAGCGAGGCC
TACCAGCAGC TCGGCGCCTA CACGCGCCAG GCGCTCGCCA ATATCGGCAT TGGCGTGACC
TTGCGGGCGC AGGACGTCCC GACCTGGTTC AAACGGACCT ACACCAACCG CGATTTCGAT
TTCATGAGCA ACGGCATGAG CAACTCGTTC GATCCGACGG TCGGGGTGCA GCGGCTGTAT
TGGTCGAAGA ACTTCAAGCC CGGCGTGCCA TTCTCCAACG GCTCGGGCTA CAGCAATCCG
GAGGTCGACC GGCTGCTCGA AGCGGCCGCA GTCGAGAGCG ATCCGGCCAG GCGACGCGAA
CTGTTCAAGG CCTTCCAGGT CATCGTCGCC ACCGACCTGC CCGACGTCAA TCTCGTCACC
GGCGCCAACC TGACCATCGC CAATCGCAAG GTGCGCGACC ACACCACCAC GATCGACGGC
CCATCGGCGA ACTTCGCCGA TGTCTGGCTC GAGGCATAG
 
Protein sequence
MLLTRRNLSK TLLFGTITAA AGHARTSQAA EPRRGGTLNL VIQPEPPILV SLTHTAGPTT 
RVSPKITEGL LTFDLDFRPR PQLATDWQVS DDGLRYTFAL RRGVKWHDGR DFTSADVAYS
ISLLKQHHPR GRGTLSSVRE VETPDAHTAT IALDKPAPYL LAALTASESP IVPRHVYEGS
DPLNNPNGRA PIGTGPFVFK EWQQGSHIIL ERNPTYWDPG KPYLDRIVVR FIADANARAV
ALETGEIHLA PDTPVPLGQI EALKSNPALL IETRGYDYQP IVYRLEFNLA NPYFAKREVR
AAVAHAIDRD AITRVVFYGW GRNAPSAISP ALKQFHNPDI PRHDFDPKKA EALLDAAGYP
RGPDGIRFKV FHDYMPYSEA YQQLGAYTRQ ALANIGIGVT LRAQDVPTWF KRTYTNRDFD
FMSNGMSNSF DPTVGVQRLY WSKNFKPGVP FSNGSGYSNP EVDRLLEAAA VESDPARRRE
LFKAFQVIVA TDLPDVNLVT GANLTIANRK VRDHTTTIDG PSANFADVWL EA