Gene Veis_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1784 
Symbol 
ID4693290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1991548 
End bp1992705 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID639849550 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_996556 
Protein GI121608749 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.98033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC TCTGGCTGTT GTTTTCCCAG GCCGTGACGG TGTTCGTTGC GGCCTATTTC 
GTCGTCGCCA CGTTGCAGCC CGAATGGCTG GGCCGGGGCG GTACGCGCTC CGCAGCCGGC
GTTGCCTTGC TGCAGGCGCC GGCCACGCCC GGCGCGCAGC CGGCGAACGG CAGCTTCAGC
GCGGCAGCGC ACCGGGCGGC GCCGGCGGTG GTCAGCATCA ATACCAGCAA GGAAGTGCGG
AGTTTGCGCA GCAACGACCC GTGGTTCCAG TTCTTCTTTG GCGACCAGGG CGGACAGGGC
AGCCAGTCGC AGGTCGGGCT GGGCAGCGGC GTGATCGTCA GTCCGGATGG CTATATCCTC
ACCAACAACC ATGTGGTCGA GGGGGCCGAC GAGATCGAGG TCACATTGAC CGATGGCCGC
CGCGCCCGTG CCCGCGTGAT CGGCACCGAT CCCGACACCG ACCTGGCGAT CCTGAAGGTC
GCGCTGGACA AACTGCCCGT GATCGTGCTG GGCAACTCCG ATACGCTCGA TGTGGGCGAC
CGGGTGCTGG CCATTGGCAA TCCGTTCGGC GTGGGCCAGA CCGTGACCAG CGGCATCGTC
AGTGCGCTGG GCCGCAACCA GTTGGGTATC AACACCTTCG AGAACTTCAT TCAGACCGAC
GCGGCCATCA ACCCCGGCAA TTCGGGCGGC GCGCTGGTCG ATGTGAGCGG CAATCTGTTG
GGCATCAACA CCGCGATCTA TTCGCGCTCG GGCGGCAGCA TGGGGATAGG CTTTGCGATC
CCGGTGTCCA CGGCCCGGCT GGTGCTCGAC AGCATCGTCA GGGATGGCAA GGTCACGCGG
GGCTGGATTG GCGTGGAGCC CAGTGTGCTG TCGCCCGAAC TGGCCGAAGC CTTTGGCGTG
AAGAAGACCA CCAGGGGCGT GATCGTCATT GGTGTCGCGC AGAATGGCCC CGCAGCCCAG
GCCGGCATGC GCCCGGGCGA TGTGGTGCTG CGCGTCGATG GCAAGAGCGT GGTCAGCGCG
CCCGAGTTGC TCAGCGCCGT GGCGGCACTC AAGCCCGGCA CGGACTCGGT CTTTCAGGTG
CAGCGCGGGG ATCGACTGGT GGAATTGCAC GTCAATCCCG GCGTGCGCCC CCGGCCACAG
CGCAACGTGC GGCGCTGA
 
Protein sequence
MKRLWLLFSQ AVTVFVAAYF VVATLQPEWL GRGGTRSAAG VALLQAPATP GAQPANGSFS 
AAAHRAAPAV VSINTSKEVR SLRSNDPWFQ FFFGDQGGQG SQSQVGLGSG VIVSPDGYIL
TNNHVVEGAD EIEVTLTDGR RARARVIGTD PDTDLAILKV ALDKLPVIVL GNSDTLDVGD
RVLAIGNPFG VGQTVTSGIV SALGRNQLGI NTFENFIQTD AAINPGNSGG ALVDVSGNLL
GINTAIYSRS GGSMGIGFAI PVSTARLVLD SIVRDGKVTR GWIGVEPSVL SPELAEAFGV
KKTTRGVIVI GVAQNGPAAQ AGMRPGDVVL RVDGKSVVSA PELLSAVAAL KPGTDSVFQV
QRGDRLVELH VNPGVRPRPQ RNVRR