Gene ECD_01735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01735 
SymbolsppA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1793213 
End bp1795069 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID 
Productprotease IV (signal peptide peptidase) 
Protein accessionACT43589 
Protein GI253977919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0080246 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT 
TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTCC TCGTACTGGT TGGTGTGGGG
ATTTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG CGCACTGCTG
CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA
AGCCGCCAGC TGCTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC
GTCAACACTA TTCGCCAGGC GAAGGACGAC CGCAATATCA CCGGTATTGT GATGGATCTG
AAAAACTTCG CAGGCGGCGA CCAACCGTCT ATGCAGTACA TCGGCAAAGC TCTGAAAGAG
TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT
TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCACCGC AAGGCGTGGT GGATTTGCAC
GGTTTTGCTA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC
CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGACATG
TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG
AATACTGTTG CCGCTAACCG GCAGATCCCT GCTCAGCAGG TATTCCCTGG CGCGCAAGGG
TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG
CTGGTCGATG CACTGGCATC GAGTGCGGAA ATCGAAAAAG CACTGACCAA AGAGTTCGGC
TGGAGTAAGA CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG
CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTCGCTA ATGGCGCAAT TATGGATGGC
GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC
CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC
GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCCTGTGGTT
GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC
ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATCGGTA TCTTCGGCGT GATCACCACC
GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG
GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCTGATGAT GCAGTTAAGC
ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG
GAGCAGATTG ATAAAATTGC CCAGGGCCAC GTCTGGACCG GTCAGGATGC AAAAGCTAAC
GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTCG CCAAAGCAGC AGAGCTGGCA
AAAGTGAAAC AGTGGCATCT GGAATATTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG
ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA
CCTGCACCGC TGGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGTGACAA GCTGGCCGCG
TTTAATGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACAT GCGTTAA
 
Protein sequence
MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL 
LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL
KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH
GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG
WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR
LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY
IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQLMMQLS
IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA
KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA
FNDPQNRYAF CLTCANMR