Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_3617 |
Symbol | pepQ |
ID | 2855039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | + |
Start bp | 3716487 |
End bp | 3717557 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637515035 |
Product | proline dipeptidase |
Protein accession | YP_037937 |
Protein GI | 49478427 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.338254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTAA AAATTAATAA AATCCAAAAT CAACTACAGA ACTATGAAAT TGACGGGTTA CTCATTACAA AAAAAGAAAA TCGCCAATAT GCGACAGGCT TTACAGGTAG TGCTGGTGTT GTCTTAATCA CTGCGGATGC AGCTGTTTTT ATAACTGATT TTCGCTATGT AGACCAAGCG AATTCACAAA TAAAAAATGC TGAAATTATT ATGCATAAAG GAAATTTAGA AAAAGAAATT GCAAATCAAG TATCAAAATT AAACATTCAA AAACTTGGAA TTGAAGAAAA TAATATGACA TTGCAACAAT ATAAAAACTT ACAAAAATAT GTACATACAG AAATGGTTCA AGTGCGTGAA ATCATTGAAA ACATTCGTCT TATTAAAGAC ACTCATGAAA TAGAAACAAT GAAAATCGCA GCTCATATTG CGGACGAAGC ATTTCACCAC ATCATTACGT TTCTAAAACC AGGAATAAGT GAAAATACTG TACGAGATGA ATTAGAATTT TTCATGCGCA AAAAAGGTGC TGCATCTTCA TCATTTCAAA TTATTGTAGC TTCTGGTGTT CGTTCTTCTC TTCCACATGG AGTTGCATCA AATAAAATAA TTGAACGAGG CGACATCGTT ACATTAGATT TCGGTGCACT TTACGACGGA TATTGTTCCG ATATAACACG TACTGTAGCA ATTGGGGAAC CATCAGAAGA GTTCAAAAAA ATATACAATG TTGTACGCGA AGCATTAAAA CGCGGGACTG AAGCAATTAA GCCTGGAGAA ACTGCAAAAA GTATCGATGA TGTAACAAGA AACTACATTA CAGATTGTGG ATATGGTCAA TATTTTGGTC ACTCTACAGG GCACGGTCTT GGCTTAGAAA TACATGAACC TCTTCGCCTA TCCCAAGAAA GTAAAGCTAC ATTAGAAGAA GGTATGGTTG TTACCGTTGA ACCCGGTATT TACATACCAA ACTGGGGCGG TTGTAGAATT GAAGATGATA TCGTCATTAC AAAAGACGGA TATGAAGTTA TTACAAAATC AAATCGAGAA CTAATTGTTA TTCCTTGTTA A
|
Protein sequence | MTLKINKIQN QLQNYEIDGL LITKKENRQY ATGFTGSAGV VLITADAAVF ITDFRYVDQA NSQIKNAEII MHKGNLEKEI ANQVSKLNIQ KLGIEENNMT LQQYKNLQKY VHTEMVQVRE IIENIRLIKD THEIETMKIA AHIADEAFHH IITFLKPGIS ENTVRDELEF FMRKKGAASS SFQIIVASGV RSSLPHGVAS NKIIERGDIV TLDFGALYDG YCSDITRTVA IGEPSEEFKK IYNVVREALK RGTEAIKPGE TAKSIDDVTR NYITDCGYGQ YFGHSTGHGL GLEIHEPLRL SQESKATLEE GMVVTVEPGI YIPNWGGCRI EDDIVITKDG YEVITKSNRE LIVIPC
|
| |