Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_B0230 |
Symbol | |
ID | 7169953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011777 |
Strand | + |
Start bp | 187296 |
End bp | 188936 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643559608 |
Product | bacterial extracellular solute-binding protein, family 5 |
Protein accession | YP_002455112 |
Protein GI | 218848334 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 2.2179599999999998e-27 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTAAAA AGGTTACATC GGTTGTTGCT TCTGTTTTAT GTACTAGTTT TCTATTAACT GCTTGTAGTG GAGAGAAAGA GACTAAAAAT ACAGCAAAGG CAACTAACAA GAATGAAACA AAACAATCCA TTAATTTACC GTATATAGCG GAGATTCCAA CAATGGATGT AACAAAGGCA ACTGATAGTG AATCAATGAA TGTAATGCGT AATGTTTTTG AAGGTCTGTA CACACTAGGA GAAGATAATA AATTGATTCC TGGAGTGGCA CAATCTTATG ACGTAAGTGA GGATAAAAAA ACGTACACGT TTCATTTGAG AGAATCGAAA TGGTCAAACG GAACACCTGT AACAGCTGCT GACTTCGCGT TTTCTTGGAA ACGTGCTGTG AATCCCGATA CAGCAGCAGA ATATGCTTTT CTATTCTTTG ATATAAAGAA TGCAAAGAGA ATTAATAGTA AAGAGCTACC TGTAGATCAG TTAGGGGTAA AAGCAATAGA TGATAAAACA TTAGAGGTGC AATTAGAGCA GCCTGTCCCT TATTTTATCG AATTGACAAC GTTCGCAACG TTTTTACCCA TTAACGAAAG GTATTTTGAA TCACAAGGAA AGCAGTATGG CTTAGAAGCA AGCAAATTGG TTTATAACGG AGCGTTTACA TTGGATAATT GGAAGCATGA ACAAGGTTTT CAGTTGAAAA AGAATCCTAA CTATTGGGAT TCTAAAACAG TGAAATTAGA TGCGATTAAC TTTGATATTG TGAAAGATAA ATCAACAGAA ATAAATCTAT ATGAATCAGG TCAGCTTGAC CGTGTAGGAC TAACAGGTGA GTTTGTAGAT AAATATAGAA AGAATCCCGA CTTCAAAGAA CGTTCTGAAG TTACAGTGCA ATTCCTACGT ATGAATCAAA AGAATGAAGT ATTAAAAAAT AAAAATGCTC GCTTGGCAAT CAGTCAAGCA ATGAATAAGA AAGCCTTTGT AGATACAGTT TTGAATAATG GTTCTCTTCC TGCAGACGGT TTTATACCAG TCGATTTTGC TAAAAGTTCG GACGGAAAAG ACTTCCGAAA GGAAAATGGA AAATTAGTAA AAGATGATGT GAAAGCAGCG AAGGAAATCT GGAAGAAAGC AAAGCAAGAA CTTGGGAAAG AACAGGTAAC ATTAGAATTG CTTACAAGTG ATAATGTGTT TGCTAAAAAG AATGCGGAGT ACTTAAAAGG TGAATTAGAA AAGAATTTAG AGGGATTAAC GGTCAATGTG AAGCCGCAGC CACGCAAACA ACAAATTCAA TTGCTACTAA ATAGTAATTA TGATTTAGGT GTTGATGTAT GGGCTCCCGA TATTCCTGAT CCAATCACAT TTTTAGATAT ATTCGCGACA GATAGCAGCT ATAACTTTGA TAAGTATTCA AATCAGGCAT ATGATGAACT AATTCATCAA GTGAAAACAG ATTTAGCTGG TAATGAAACT GCTCGTTGGG AGGCAATGAA ACAAGCAGAA AAAATATTAT TAGAAGATGG AGCGGTCGCA CCATTTTATC AATCAGGCAG ATCATACTTA CAGCGTTCCT CTATTAACGG AATTGTAACA AACGACTTTG GTGGAGAATT TAACTATAAG TTTGCTGAAA TTAAAAAATA A
|
Protein sequence | MFKKVTSVVA SVLCTSFLLT ACSGEKETKN TAKATNKNET KQSINLPYIA EIPTMDVTKA TDSESMNVMR NVFEGLYTLG EDNKLIPGVA QSYDVSEDKK TYTFHLRESK WSNGTPVTAA DFAFSWKRAV NPDTAAEYAF LFFDIKNAKR INSKELPVDQ LGVKAIDDKT LEVQLEQPVP YFIELTTFAT FLPINERYFE SQGKQYGLEA SKLVYNGAFT LDNWKHEQGF QLKKNPNYWD SKTVKLDAIN FDIVKDKSTE INLYESGQLD RVGLTGEFVD KYRKNPDFKE RSEVTVQFLR MNQKNEVLKN KNARLAISQA MNKKAFVDTV LNNGSLPADG FIPVDFAKSS DGKDFRKENG KLVKDDVKAA KEIWKKAKQE LGKEQVTLEL LTSDNVFAKK NAEYLKGELE KNLEGLTVNV KPQPRKQQIQ LLLNSNYDLG VDVWAPDIPD PITFLDIFAT DSSYNFDKYS NQAYDELIHQ VKTDLAGNET ARWEAMKQAE KILLEDGAVA PFYQSGRSYL QRSSINGIVT NDFGGEFNYK FAEIKK
|
| |