Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3225 |
Symbol | |
ID | 5152749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 3377699 |
End bp | 3379138 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640558091 |
Product | putative sugar ABC transporter, periplasmic substrate binding protein |
Protein accession | YP_001239238 |
Protein GI | 148254653 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0178955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGATC GCGAGAAGAA CCTGTTTGCA TCCTATGCCG GCAAGCGCAT CAGCCGGCGT GACCTGCTCG ACGGCGCCGC CAAGCTCGGC ATCGCCGGCG CCGCAGCCAA TGCTGCCTTT GCCTCCACGA TGTCGAAGGC CCTGGCGGCG GACTTCAATT GGAAGGCGCA GAGCGGCAAG ACCGTCAAGC TGCTCCTGAA CAAGCATCCT TATGTCGATG CGATGATCGG AAATATCGAG GCTTTCAAGA GCCTCACAGG CATGAACATC ACCTACGACA TCTTCCCGGA GGATGTCTAT TTCGACAAGG TCACGGCGGC GCTGTCGTCG AAGTCCGATC AGTACGACGC CTTCATGACC GGCGCCTACA TGACCTGGAC CTATGGTCCG GCCGGCTGGA TCGAGGACCT CAACACCTAC ATCAAGGATC CCGCCAAGAC CAATCCGGCG TTCGCCTGGG ATGATGTGCT GCCCGGCCTG CGCTCGTCGA CCGCGTGGGA CGGTGTCGCC GGCTCCGAGC TTGGCTCGGG CAAGGCCAAG CAATGGTGCA TCCCGTGGGG CTACGAGCTC AATAACGTCA CGTATAACCG CAACATCTTC AACAAGGTCG GCGTGAAGCC GCCGGGCAAT CTCGATGAGA TGCTCGAGGT CGCTGCCAAG ATCACCAAGG ATGCCGGTGG GCCCTATGGC GTCGGCGTGC GCGGCTCGCG GTCCTGGGCC ACCATCCATC CGGGCTTCCT GTCCGCCTAT GCCAATTTCG GGCAGAAGGA CTTCGTGATG GAGGGCGGCA AGCTCAAGGC CGCGATGAAC ACCAAGGCCT CGAAGGAGTT CCACGACAAA TGGGTCAAGA TGATCCAGGG ATCGGGCCCG AAGAATTGGT CGACCTACAC CTGGTACCAG GTCGGCACCG ATCTCGGCGC CGGCGCCTCC GGCATGATCT TCGATGCCGA CATCCTCGGC TACTTCATGA ACGGCGGCGA GAACAAGGAG CGCGGCAATC TCGGCTATGC GCCGTTCGCG GCCAATCCGG CCGCCAAGGC GCCGACGCCC AACGTCTGGA TCTGGTCGCT GGCGATGTCG AGCTTCTCCA AGCAGAAGGA TGCGGCCTGG CTGTTCATGC AGTGGGCGGC TTCGACCGAG CACGGCCTGT TCGGCGCCCG CAAGATGGAC TTCGTCAATC CCGTCCGCAC CTCGGTGTGG AAGGACAGCG AATTCCGCGA CCGCATCGCC AAATCCTATC CGGGCTATCT CGAGCAGCAT GACCTCTCGG CACCGGGGGC CAAGATCTAC TTCACGGCGC AGCCGCTGTT CTTCGATCTC ACCACGGAGT GGGCGGCCTC GCTGCAGAAG ATGGTGGCCA AGGAGGTCAG TGTCGATGAA GGTCTCGACA AGCTCGCCGA GAGCATCAAC CGCCAGCTGA AGCAGGCCGG CCTCGGTTGA
|
Protein sequence | MYDREKNLFA SYAGKRISRR DLLDGAAKLG IAGAAANAAF ASTMSKALAA DFNWKAQSGK TVKLLLNKHP YVDAMIGNIE AFKSLTGMNI TYDIFPEDVY FDKVTAALSS KSDQYDAFMT GAYMTWTYGP AGWIEDLNTY IKDPAKTNPA FAWDDVLPGL RSSTAWDGVA GSELGSGKAK QWCIPWGYEL NNVTYNRNIF NKVGVKPPGN LDEMLEVAAK ITKDAGGPYG VGVRGSRSWA TIHPGFLSAY ANFGQKDFVM EGGKLKAAMN TKASKEFHDK WVKMIQGSGP KNWSTYTWYQ VGTDLGAGAS GMIFDADILG YFMNGGENKE RGNLGYAPFA ANPAAKAPTP NVWIWSLAMS SFSKQKDAAW LFMQWAASTE HGLFGARKMD FVNPVRTSVW KDSEFRDRIA KSYPGYLEQH DLSAPGAKIY FTAQPLFFDL TTEWAASLQK MVAKEVSVDE GLDKLAESIN RQLKQAGLG
|
| |