Gene BT9727_4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBT9727_4344 
SymbolpepQ 
ID2858060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus thuringiensis serovar konkukian str. 97-27 
KingdomBacteria 
Replicon accessionNC_005957 
Strand
Start bp4414304 
End bp4415401 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content39% 
IMG OID637515759 
ProductXaa-Pro dipeptidase (proline dipeptidase) 
Protein accessionYP_038659 
Protein GI49481350 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTA GATTAGAAAA TTTAATGCAA TGGCTAAAAG AAAAAAACGT AGAAGCTGCG 
TTCTTAACTT CTACACCAAA CGTCTTCTAC ATGACAAACT TCCACTGTGA ACCACACGAA
AGACTTCTTG GTATGTTTGT ATTCCAAGAA AAAGAGCCTA TTTTAATTTG TCCTAAAATG
GAAGAAGGTC AAGCGCGTAA CGCTGGATGG GCACATGAAA TTATCGGATT TACTGATACT
GACAGACCAT GGGATATGAT TGCAAAAGCA ATTAAAGACC GCGGTATCAA TGCAAATGCA
GTTGCAATTG AAAAAGAACA TTTAAACGTA GAGCGCTACG AAGAATTAAC AAAACTATTC
CCAAATGCAG CTTTCACATC AGCTGAGGAA AAAGTTCGTG AACTTCGTTT AATTAAAGAT
GAGAAAGAAC TTTCTATTTT ACGCGAAGCA GCTAAAATGG CAGACTATGC TGTTGAAGTT
GGTGTAAATG CAATTAAAGA AAATCGTAGC GAACTAGAAG TATTAGCAAT TATTGAACAT
GAATTAAAAA CAAAAGGCAT ACATAAAATG TCATTTGATA CGATGGTATT AGCTGGTGCA
AACTCTGCTC TTCCACACGG TATTCCAGGT GCAAACAAAA TGAACCGCGG CGATTTCGTA
CTATTTGATT TAGGCGTAAT CATTGATGGC TATTGCTCTG ACATTACACG TACAGTGGCA
TTTGGCGAGA TTTCTGAAGA ACAAACTCGC ATTTACAACA CTGTACTTGC TGGACAACTA
CAAGCAGTTG AAGCATGTAA ACCAGGTGTT ACACTTGGCG CAATTGACAA CGCTGCTCGT
TCTGTTATCG CAGATGCAGG TTACGGCGAC TTCTTCCCGC ACCGCCTTGG TCACGGACTT
GGAATTAGCG TACACGAATA TCCAGATGTA AAAGCTGGTA ACGAATCTCC ATTAAAAGAA
GGTATGGTCT TCACAATTGA ACCAGGTATT TACGTACCAA ACGTAGGTGG CGTTCGTATT
GAAGATGATA TTTATATCAC AAAAGACGGA TCAGAAATTT TAACGAAGTT CCCGAAAGAA
TTACAATTTG TAAAATAA
 
Protein sequence
MNARLENLMQ WLKEKNVEAA FLTSTPNVFY MTNFHCEPHE RLLGMFVFQE KEPILICPKM 
EEGQARNAGW AHEIIGFTDT DRPWDMIAKA IKDRGINANA VAIEKEHLNV ERYEELTKLF
PNAAFTSAEE KVRELRLIKD EKELSILREA AKMADYAVEV GVNAIKENRS ELEVLAIIEH
ELKTKGIHKM SFDTMVLAGA NSALPHGIPG ANKMNRGDFV LFDLGVIIDG YCSDITRTVA
FGEISEEQTR IYNTVLAGQL QAVEACKPGV TLGAIDNAAR SVIADAGYGD FFPHRLGHGL
GISVHEYPDV KAGNESPLKE GMVFTIEPGI YVPNVGGVRI EDDIYITKDG SEILTKFPKE
LQFVK