Gene Ent638_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3669 
Symbol 
ID5111917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3976124 
End bp3977491 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID640493874 
Productserine endoprotease 
Protein accessionYP_001178377 
Protein GI146313303 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0364754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.327931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AAAACCAGCT GTTAAGCGCG ATTGCGTTAA GTGTCGGGTT ATCTCTCACG 
GCGTCATGGC CTGCGGCTGC GGCCCTGCCT TCGCAAGTGC CTGGTGAGGC AGCCATTCCA
AGCCTTGCGC CAATGCTTGA AAAAGTGTTG CCGGCGGTGG TCAGCATTAA GGTTGAGGGC
ACGGCGGCAC AGAGCCAACG TATACCAGAA GAGCTAAAAA AATATTTTGG TGAAGAGGGA
CCCGATCAGC AAACCCAGCC GTTTGAAGGG CTTGGCTCCG GGGTAGTGAT TGATGCGGCG
AAAGGCTACG TGCTGACGAA CAATCACGTT ATCAGCCAGG CCGATAAAAT CAGCGTGCAA
ATGAATGATG GTCGCGAGTT CGATGCCAAA CTGATTGGCA GCGACGATCA AAGTGACATC
GCGCTACTGC AAATCCAGAA TCCCAGCAAG CTGACGCAAA TTGTTATCGC GGATTCCGAC
AAATTGCGTG TCGGAGATTT TGCCGTGGCC GTGGGTAACC CTTTCGGTCT GGGCCAGACG
GCAACCTCAG GCATTATCTC TGCGTTGGGT CGTAGCGGTC TAAATCTCGA AGGCCTGGAG
AACTTTATCC AGACTGACGC CTCCATTAAC CGCGGGAATT CCGGCGGCGC ACTGCTAAAC
CTCAACGGTG AGCTGATTGG TATTAACACC GCGATTCTTG CTCCCGGTGG CGGCAGCGTT
GGTATTGGCT TTGCGATCCC CAGCAATATG GCCAAAATCC TCTCGCAACA GCTGATTGAA
TCCGGCGAGG TAAAACGCGG TCTGCTGGGA ATTAAAGGCA TGGAAATGAG CGCTGATATC
GCAAAAGCCT TTAATCTCGA CGTGCAGCGT GGGGCGTTTG TCAGTGAAGT TCTGGCGAAC
TCCGGCTCCG CAAAAGCGGG TGTGAAATCG GGCGATATCA TCGTTAGCCT CAACGGTAAG
CCGCTGAGCA GCTTCGCTGA ATTACGCTCA CGTGTTGCCA CCACGGAACC TGGGACGAAG
GTGAAGCTTG GGTTGCTGCG CGATGGCAAA CCGCTGGAAG TCGAAGTTAC GCTGGATAAG
AGCACCTCAT CGTCGGCCAG TGCTGAACTG ATTGCCCCTG CACTACAAGG TGCGGCGCTC
AGCGATGGTC AGCTTAAAGA CGGTACAAAA GGTATTTCTA TCGAGAGCGT CGAAAAGAGC
AGTCCCGCCG CGCAGGCAGG CCTGCATCAG GATGACGTGA TTATCGGTGT TAACCGCAGC
CGTGTTCAGT CGATCGCCGA GATGCGCAAG GTCCTGGAAA GCAAGCCAGC AGTGATTGCG
CTGCAAATCA TCCGTGGCAA CGATACGCTC TACATTCTAT TACGCTAA
 
Protein sequence
MKKKNQLLSA IALSVGLSLT ASWPAAAALP SQVPGEAAIP SLAPMLEKVL PAVVSIKVEG 
TAAQSQRIPE ELKKYFGEEG PDQQTQPFEG LGSGVVIDAA KGYVLTNNHV ISQADKISVQ
MNDGREFDAK LIGSDDQSDI ALLQIQNPSK LTQIVIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSV
GIGFAIPSNM AKILSQQLIE SGEVKRGLLG IKGMEMSADI AKAFNLDVQR GAFVSEVLAN
SGSAKAGVKS GDIIVSLNGK PLSSFAELRS RVATTEPGTK VKLGLLRDGK PLEVEVTLDK
STSSSASAEL IAPALQGAAL SDGQLKDGTK GISIESVEKS SPAAQAGLHQ DDVIIGVNRS
RVQSIAEMRK VLESKPAVIA LQIIRGNDTL YILLR