Gene B21_00397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00397 
SymbolppiD 
ID8113200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp430850 
End bp432721 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content52% 
IMG OID644846681 
Producthypothetical protein 
Protein accessionYP_002998254 
Protein GI251783950 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.955616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACA GCTTACGCAC GGCTGCAAAC AGTCTCGTGC TCAAGATTAT TTTCGGTATC 
ATTATCGTGT CGTTCATATT GACCGGCGTG AGTGGTTACC TGATTGGCGG AGGCAATAAC
TACGCCGCAA AAGTGAATGA CCAGGAAATC AGCCGTGGGC AATTCGAGAA CGCCTTCAAC
AGCGAGCGTA ATCGCATGCA GCAACAGCTG GGCGATCAAT ACTCCGAGCT GGCAGCGAAC
GAAGGCTATA TGAAAACCCT GCGTCAACAG GTGCTGAATC GTCTGATCGA CGAGGCGCTG
CTGGATCAGT ACGCACGTGA GCTGAAACTG GGTATCAGCG ATGAGCAGGT TAAACAGGCG
ATTTTCGCGA CCCCAGCCTT CCAGGTTGAC GGCAAATTTG ATAACAGCCG CTATAACGGT
ATCCTCAACC AGATGGGGAT GACCGCCGAT CAGTACGCCC AGGCGCTGCG TAACCAGCTC
ACTACCCAAC AGCTGATTAA CGGCGTTGCC GGTACCGATT TTATGCTGAA AGGTGAAACC
GACGAGCTGG CGGCACTGGT CGCGCAACAA CGCGTGGTGC GTGAGGCGAC TATCGATGTT
AACGCGCTGG CGGCGAAGCA GCCTGTGACC GAACAGGAGA TTGCCAGCTA CTACGAACAA
AACAAAAACA ATTTCATGAC GCCGGAACAA TTCCGCGTGA GTTACATCAA GCTGGATGCC
GCAACGATGC AGCAACCGGT TAGCGATGCG GATATCCAGA GCTACTACGA CCAGCATCAG
GATCAATTCA CCCAGCCGCA GCGTACCCGC TACAGCATCA TCCAGACCAA AACTGAAGAT
GAAGCGAAAG CGGTACTTGA TGAGCTGAAT AAAGGCGGTG ATTTTGCTGC ATTAGCCAAA
GAAAAATCTG CCGATATTAT CTCTGCTCGT AACGGCGGCG ATATGGGTTG GTTAGAAGAT
GCCACTATCC CGGACGAACT GAAAAATGCT GGTCTGAAAG AAAAAGGCCA ACTGTCTGGT
GTCATCAAAT CTTCGGTCGG TTTCCTGATT GTACGTCTGG ACGACATTCA GCCAGCGAAA
GTGAAATCGT TAGACGAAGT ACGTGACGAC ATTGCGGCGA AAGTGAAACA CGAAAAAGCC
CTCGATGCGT ACTACGCGCT GCAGCAGAAA GTGAGCGATG CGGCAAGCAA CGACACCGAG
TCTCTGGCCG GTGCAGAGCA AGCTGCCGGC GTTAAAGCCA CTCAGACGGG TTGGTTCAGC
AAAGATAACC TGCCGGAAGA GTTGAACTTC AAGCCGGTTG CCGACGCTAT CTTTAACGGC
GGTCTGGTAG GTGAAAACGG CGCGCCGGGC ATCAACTCTG ACATCATCAC CGTAGACGGC
GACCGCGCAT TCGTGCTGCG CATCAGCGAG CACAAACCGG AAGCGGTGAA ACCGTTGGCA
GATGTTCAGG AACAAGTTAA GGCACTGGTT CAGCACAACA ACGCTGAACA ACAGGCGAAA
GTGGATGCTG AGAAACTGCT GGTTGATTTG AAAGCCGGCA AAGGTGCGGA AGCTATGCAG
GCTGCCGGTC TGAAATTTGG AGAGCCGAAA ACCTTAAGCC GTTCCGGTCG TGACCCGATT
AGCCAGGCGG CGTTTGCACT GCCACTGCCA GCGAAAGACA AACCGAGCTA CGGTATGGCG
ACCGATATGC AAGGCAATGT GGTTCTGCTG GCGCTGGATG AAGTGAAACA AGGTTCAATG
CCGGAAGATC AGAAAAAAGC GATGGTGCAG GGTATCACCC AGAACAACGC ACAAATCGTC
TTTGAAGCTC TGATGAGTAA CCTGCGTAAA GAGGCGAAAA TCAAAATTGG CGATGCGCTG
GAACAGCAAT AA
 
Protein sequence
MMDSLRTAAN SLVLKIIFGI IIVSFILTGV SGYLIGGGNN YAAKVNDQEI SRGQFENAFN 
SERNRMQQQL GDQYSELAAN EGYMKTLRQQ VLNRLIDEAL LDQYARELKL GISDEQVKQA
IFATPAFQVD GKFDNSRYNG ILNQMGMTAD QYAQALRNQL TTQQLINGVA GTDFMLKGET
DELAALVAQQ RVVREATIDV NALAAKQPVT EQEIASYYEQ NKNNFMTPEQ FRVSYIKLDA
ATMQQPVSDA DIQSYYDQHQ DQFTQPQRTR YSIIQTKTED EAKAVLDELN KGGDFAALAK
EKSADIISAR NGGDMGWLED ATIPDELKNA GLKEKGQLSG VIKSSVGFLI VRLDDIQPAK
VKSLDEVRDD IAAKVKHEKA LDAYYALQQK VSDAASNDTE SLAGAEQAAG VKATQTGWFS
KDNLPEELNF KPVADAIFNG GLVGENGAPG INSDIITVDG DRAFVLRISE HKPEAVKPLA
DVQEQVKALV QHNNAEQQAK VDAEKLLVDL KAGKGAEAMQ AAGLKFGEPK TLSRSGRDPI
SQAAFALPLP AKDKPSYGMA TDMQGNVVLL ALDEVKQGSM PEDQKKAMVQ GITQNNAQIV
FEALMSNLRK EAKIKIGDAL EQQ