Gene Ent638_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3670 
Symbol 
ID5111918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3977541 
End bp3978650 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID640493875 
Productserine endoprotease 
Protein accessionYP_001178378 
Protein GI146313304 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.932831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.311262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATGCT GCATCCGTCC CTTAATTAAC GACGCCCGCA TCATGTTTCT TAAGCTATTT 
CGTTCAGTTG CTATCGGTTT GGTCGTTGGC GGCCTGTTAT TGGCTGCAAT GCCCTCTTTA
CGTCAGTTAA ATCAGCTGGC GGCACCGCGT TTCGACAGCA CCGATGAGAC GCCTGTGAGC
TACAACCAGG CGGTTCGCCG TGCTGCGCCT GCTGTGGTTA ACGTCTATAA CCGCGGCCTC
AACAGTTCCG CGCATAATCA ACTTGAGATT CGTACTCTCG GCTCCGGCGT GATCATGGAT
GAGCGCGGCT ACATCATTAC CAACAAACAC GTCATTAACG ACGCCGATCA GATTATCGTC
GCCCTGCAGG ATGGCCGCGT GTTTGAAGCC TTATTGGTTG GCTCGGACAC GCTGACCGAT
CTCGCCGTCC TCAAAATCAA TGCCACGGGC GGGCTGCCGG TGATTCCGAT CAACCGTAAA
CGCACGCCGC ATATCGGCGA TGTGGTGATG GCAATTGGTA ACCCGTATAA CCTTGGGCAA
ACCATTACCC AGGGGATTAT CAGCGCCACC GGTCGTATCG GTTTGAATCC CTCCGGGCGA
CAAAACTTCC TGCAAACGGA CGCCTCCATT AACCACGGTA ACTCCGGCGG CGCGCTGGTG
AATTCGCTGG GAGAATTAAT GGGCATCAAT ACCCTGTCGT TCGATAAAAG TAACGATGGC
GAAACGCCTG AGGGCATCGG TTTTGCGATC CCGTTCCAGC TGGCCAACAA AATTATGGAC
AAGCTGATTC GCGATGGTCG CGTCATTCGC GGCTATATTG GCATCGGTGG CCGAGAAATT
GCGCCAATGC ACACTCAGGG CGGCGGCATC GATCAGATTC AGGGTATCGT GGTCAATGAA
GTGACACCGG GTGGTCCGGC GGCTAACGCG GGGCTACAGG TGAACGACGT GATTGTTTCG
GTCAATGGCA CGCCTGCCGT ATCCGCACTA GAAACAATGG ATCAGGTAGC AGAGATTCGT
CCTGGCTCGA TTATTCCGGT CGAAGTCATG CGTAATGACA AAAAACTGAC GCTTCAGGTG
ACGATTCAGG AATATCCCGC CACTAACTAA
 
Protein sequence
MLCCIRPLIN DARIMFLKLF RSVAIGLVVG GLLLAAMPSL RQLNQLAAPR FDSTDETPVS 
YNQAVRRAAP AVVNVYNRGL NSSAHNQLEI RTLGSGVIMD ERGYIITNKH VINDADQIIV
ALQDGRVFEA LLVGSDTLTD LAVLKINATG GLPVIPINRK RTPHIGDVVM AIGNPYNLGQ
TITQGIISAT GRIGLNPSGR QNFLQTDASI NHGNSGGALV NSLGELMGIN TLSFDKSNDG
ETPEGIGFAI PFQLANKIMD KLIRDGRVIR GYIGIGGREI APMHTQGGGI DQIQGIVVNE
VTPGGPAANA GLQVNDVIVS VNGTPAVSAL ETMDQVAEIR PGSIIPVEVM RNDKKLTLQV
TIQEYPATN