Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_1922 |
Symbol | |
ID | 5151360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 1983414 |
End bp | 1985021 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640556866 |
Product | putative ABC transporter, periplasmic solute-binding protein |
Protein accession | YP_001238022 |
Protein GI | 148253437 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.92103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG GGATCACCAA CTGGACTTCG CGTGACGATG TGCTGGTCGA GGCGGCGATC CGGCGCGGCG CGTCGCGCCG GGATCTCCTG AAGATGATGT TAGCCAACGG TGTCGCGCTC GCTGCAGGCA GCACCATCCT CGGCCGCGCC GAGCGCGCTG ACGCGGCAAC GCCGAAGAAG GGCGGATCAA TCAAGGCGGC GGGATGGTCG ACCTCGACCG CCGACACGCT CGATCCGGCC AAGGCGTCGT TCTCGACCGA CTATGTCCGC TGCTGCTCGT TCTACAACCG GCTGACGACA CTCGACAAGA ACGGCGCGGC GCAGATGGAG CTCGCTGAAG CCGTGGAGTC GACGGATGCG CAGACCTGGA CGGTCAAGCT CAAGAAGGGC GTCACCTTCC ACGACGGCAA GCCGTTGACG GCTGACGACG TCGTGTTCTC GCTGAAGCGG CATCTCGACA AGGCGACCGG CTCCAAGGTG GCCAAGATCG CGGCGCAGAT GACCGGCTTC AAGGCGGTCG ACAAGACCAC CGTCGAGATC ACCCTGGCGA GTCCGAACGC TGACCTGCCG ATGATCCTCG CGCTGCATCA CTTCATGATC GTGGCCGACG GCACCACCGA CTTCTCCAAG GCGAACGGTA CCGGCGCCTT CGTCCGCGAG GCTTTCGAGC CGGGCGTGCG CTCGATCGGC GTGCGCAACA AGAACTATTT CAAGGACGGT CCCTATCTCG ACCAGATCGA GTTCATCGCG ATCACGGAAG AGAATGCGCG CGTCAACGCG CTGCTCTCCG GCGATATTCA GCTCGCGGCC AGCATCAATC CGCGCTCGCT CCGCCTGATC GAGGGCAAGG AGGGGATCGC GCTGTCGAAA TCGACGACCG GCAACTACAC CGATCTCAAC ATGCGGCTCG ATATGGCCCC GGGCAACAAG AAAGATTTCA TCGATGGCAT GAAGCACATC GTCAACCGCG AGCAGATCGT CAAATCGGTC CTGCGCGGCT TCGGGGTCGT CGGCAACGAC CAGCCGGTCT CGCCGGCCAA CTTCTTCCAC AATCCCGACC TCAAGCCGAA GCCGTTCGAT CCGGACAAGG CGAAATTCCT GTTCCAGAAG GCCGGCGTGC TCGGTACGCC GATCCCGGTG GTCGCATCGG AAGCTGCGAA CTCCGCGATC GAGATGGCGA TGGTGATCCA GGCCTCGGCC GCGGCGGTCG GACTGACGCT CGACGTGCAG CGCGTTCCGT CGGACGGCTA TTGGGACAAT TACTGGCTGA AGGCGCCGGT CCATTTCGGC AACATCAATC CGCGGCCGAC GCCGGACATC CTGTTTTCGC TGCTCTACGC CTCCGAGGCG CCTTGGAACG AGAGCCGCTA CAAGTCCGAG ACGTTCGACA AGATGATGCT CGAGGCGCGC GGGATGCTCG ACCAGGCCAA GCGCAAGCAG ATGTATGCCG AGATGCAGGT CAAGATCGCG GAGGAGGCGG GCACCATCAT TCCGGCCTAT ATCTCCAATA TTGATGCCAC CTCGTCCAAG CTGAAGGGCT TGGAGCCGAG CCCGCTCGGC GGCATGATGG GATATGCCTT TGCTGAATAT GTCTGGCTCG ATTCGTGA
|
Protein sequence | MSNGITNWTS RDDVLVEAAI RRGASRRDLL KMMLANGVAL AAGSTILGRA ERADAATPKK GGSIKAAGWS TSTADTLDPA KASFSTDYVR CCSFYNRLTT LDKNGAAQME LAEAVESTDA QTWTVKLKKG VTFHDGKPLT ADDVVFSLKR HLDKATGSKV AKIAAQMTGF KAVDKTTVEI TLASPNADLP MILALHHFMI VADGTTDFSK ANGTGAFVRE AFEPGVRSIG VRNKNYFKDG PYLDQIEFIA ITEENARVNA LLSGDIQLAA SINPRSLRLI EGKEGIALSK STTGNYTDLN MRLDMAPGNK KDFIDGMKHI VNREQIVKSV LRGFGVVGND QPVSPANFFH NPDLKPKPFD PDKAKFLFQK AGVLGTPIPV VASEAANSAI EMAMVIQASA AAVGLTLDVQ RVPSDGYWDN YWLKAPVHFG NINPRPTPDI LFSLLYASEA PWNESRYKSE TFDKMMLEAR GMLDQAKRKQ MYAEMQVKIA EEAGTIIPAY ISNIDATSSK LKGLEPSPLG GMMGYAFAEY VWLDS
|
| |