Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_4344 |
Symbol | pepQ |
ID | 2858060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | + |
Start bp | 4414304 |
End bp | 4415401 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637515759 |
Product | Xaa-Pro dipeptidase (proline dipeptidase) |
Protein accession | YP_038659 |
Protein GI | 49481350 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 68 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCTA GATTAGAAAA TTTAATGCAA TGGCTAAAAG AAAAAAACGT AGAAGCTGCG TTCTTAACTT CTACACCAAA CGTCTTCTAC ATGACAAACT TCCACTGTGA ACCACACGAA AGACTTCTTG GTATGTTTGT ATTCCAAGAA AAAGAGCCTA TTTTAATTTG TCCTAAAATG GAAGAAGGTC AAGCGCGTAA CGCTGGATGG GCACATGAAA TTATCGGATT TACTGATACT GACAGACCAT GGGATATGAT TGCAAAAGCA ATTAAAGACC GCGGTATCAA TGCAAATGCA GTTGCAATTG AAAAAGAACA TTTAAACGTA GAGCGCTACG AAGAATTAAC AAAACTATTC CCAAATGCAG CTTTCACATC AGCTGAGGAA AAAGTTCGTG AACTTCGTTT AATTAAAGAT GAGAAAGAAC TTTCTATTTT ACGCGAAGCA GCTAAAATGG CAGACTATGC TGTTGAAGTT GGTGTAAATG CAATTAAAGA AAATCGTAGC GAACTAGAAG TATTAGCAAT TATTGAACAT GAATTAAAAA CAAAAGGCAT ACATAAAATG TCATTTGATA CGATGGTATT AGCTGGTGCA AACTCTGCTC TTCCACACGG TATTCCAGGT GCAAACAAAA TGAACCGCGG CGATTTCGTA CTATTTGATT TAGGCGTAAT CATTGATGGC TATTGCTCTG ACATTACACG TACAGTGGCA TTTGGCGAGA TTTCTGAAGA ACAAACTCGC ATTTACAACA CTGTACTTGC TGGACAACTA CAAGCAGTTG AAGCATGTAA ACCAGGTGTT ACACTTGGCG CAATTGACAA CGCTGCTCGT TCTGTTATCG CAGATGCAGG TTACGGCGAC TTCTTCCCGC ACCGCCTTGG TCACGGACTT GGAATTAGCG TACACGAATA TCCAGATGTA AAAGCTGGTA ACGAATCTCC ATTAAAAGAA GGTATGGTCT TCACAATTGA ACCAGGTATT TACGTACCAA ACGTAGGTGG CGTTCGTATT GAAGATGATA TTTATATCAC AAAAGACGGA TCAGAAATTT TAACGAAGTT CCCGAAAGAA TTACAATTTG TAAAATAA
|
Protein sequence | MNARLENLMQ WLKEKNVEAA FLTSTPNVFY MTNFHCEPHE RLLGMFVFQE KEPILICPKM EEGQARNAGW AHEIIGFTDT DRPWDMIAKA IKDRGINANA VAIEKEHLNV ERYEELTKLF PNAAFTSAEE KVRELRLIKD EKELSILREA AKMADYAVEV GVNAIKENRS ELEVLAIIEH ELKTKGIHKM SFDTMVLAGA NSALPHGIPG ANKMNRGDFV LFDLGVIIDG YCSDITRTVA FGEISEEQTR IYNTVLAGQL QAVEACKPGV TLGAIDNAAR SVIADAGYGD FFPHRLGHGL GISVHEYPDV KAGNESPLKE GMVFTIEPGI YVPNVGGVRI EDDIYITKDG SEILTKFPKE LQFVK
|
| |