Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_2546 |
Symbol | |
ID | 6200380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | - |
Start bp | 2903497 |
End bp | 2905092 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641706523 |
Product | extracellular solute-binding protein |
Protein accession | YP_001833637 |
Protein GI | 182679491 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.145645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATCG ACCGACGTCT GTTCAATCAG ATGCTGCTCC TTGCCGGAAG CAATCTGTCT TTCCCTCACC CTGCTTCCGC AGCAAATGAT CAGCCTGTCC GCGGAGGCAC GCTCAAGGTC GTTTACTATC CGGAGCCGAA CCAGGTCGTC GGCATTAATA CGAGCGCGGG TGGGCCCGCG ACCATTGGCC CAAAAATTTT CGATGGCCTC CTCACCTATG ATTACGCACT CAATCCACAA CCGGCGCTCG CCACAGAATG GTCGATCGCT CCGGACGGGC GGGTCTTGAC CTTTACACTC CGCAAAGGCG TGAAATTCAG CGATGGCCAT GATCTGACCT CGGAGGATGT CGCGTTTTCG ATCAGCCGCC TGCGTGAAGC GCATCCGCGC GGCCGCATCA CTTTCCAGAA TGTCACCGAG ATCGAGACGG CCGATCCTCA TATCGTCAAG ATCACATTGT CGAAGCCTTC GGCGCCGTTG CTCCTGGCGT TGGCTGCTTC GGAATCCCCC ATCGTGCCCA AGCATATTTT CGAACGCCTG AAGCCGAACG ATGATCCAGC CTTCGATCAG ATCATCGGCA GTGGCCCATT CCTACTCAAG GAATGGGTAA AAGGAAGCCA TATCCTTCTC GCGCGTAATC CTGCCTATTG GGATGCGCCG AAGCCTTATC TCGATCGTAT TATTTTCCGC TTCGTCTCAG ATCCGGCCGC GCGCGCCGCC GTGCTCGAAG CCGGTGAAGG CGACATCGGG CCGAACGCCG TTGCTTTCAG CGATCTCGAA CGTTTCGAGG CTCTGCCGCA ATTTACCGTG GATACAACTG TATTCGCCTA TGGCGGCCCG CTGCAACAAC TCATTCTCAA TCTCGACAAT GATTATCTGA AGAATTTGAA AGTCCGCCAG GCCATTGCGC ATGCAATCGA TCTCGAACAA CTCAATGCGA TCGTCTTTTA TGGCTATGGA CAAGTCTCTC CGACGCCGAT CAGTGTCGTA AACACCAAAT ATTTCGATCC CACGATCAAG GCGGCGACTT TCGACCCGAA GCTTGCGAAC CAATTACTTG ATGAAGCCGG ATACAAGCGG GGGGCCGGCG GCTTCCGCTT CAAGCTGCGG CTCACCAATA ACCCCTATAA TCCCTCCGCC TATTCCGATT TCCTGAAACA GGCTCTGGCC AGGATCGGCA TCGACGCGAC CATCCAGAAA TTCGATTTCG GATCTTATGT AAAGACGGTT TACACCGATG GCGCCTTCGA TCTTTCGACC GAATCTTGCG GTAATTTCTT TGACCCCAGC GCTGGCGCCC AGCGTCTCTA TTGGTCGAAG AACATCAAAA AGGGGCTGCC TTTTTCGAAC GGCGCCCATT ATGTCAATCC ACAACTCGAT CAGATTCTCG AAGATGCTGC CAGTACACTC GACGAGAGCC GACGACGGAC GCTCTTCTTC GAATTTCAGG AAATCATCGC GAGGGACCTG CCGATTATCA ATTTGATCGC GCCACCGACC ATCATCATCG CTAGAAAAAG CGTGAAGAAT TATGCAACCG GTGGCGATGG CTTCCTCGGC AATTTCGCCG AAACCTATAT CGATACGAAA AGTTAA
|
Protein sequence | MIIDRRLFNQ MLLLAGSNLS FPHPASAAND QPVRGGTLKV VYYPEPNQVV GINTSAGGPA TIGPKIFDGL LTYDYALNPQ PALATEWSIA PDGRVLTFTL RKGVKFSDGH DLTSEDVAFS ISRLREAHPR GRITFQNVTE IETADPHIVK ITLSKPSAPL LLALAASESP IVPKHIFERL KPNDDPAFDQ IIGSGPFLLK EWVKGSHILL ARNPAYWDAP KPYLDRIIFR FVSDPAARAA VLEAGEGDIG PNAVAFSDLE RFEALPQFTV DTTVFAYGGP LQQLILNLDN DYLKNLKVRQ AIAHAIDLEQ LNAIVFYGYG QVSPTPISVV NTKYFDPTIK AATFDPKLAN QLLDEAGYKR GAGGFRFKLR LTNNPYNPSA YSDFLKQALA RIGIDATIQK FDFGSYVKTV YTDGAFDLST ESCGNFFDPS AGAQRLYWSK NIKKGLPFSN GAHYVNPQLD QILEDAASTL DESRRRTLFF EFQEIIARDL PIINLIAPPT IIIARKSVKN YATGGDGFLG NFAETYIDTK S
|
| |