Gene Acid345_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1591 
Symbol 
ID4069029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1938975 
End bp1939964 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content57% 
IMG OID637983600 
Producttype 4 prepilin peptidase 1. Aspartic peptidase. MEROPS family A24A 
Protein accessionYP_590667 
Protein GI94968619 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.395261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACCGT TCGTGGACAT TCTCTTCGCG ACATTCGCCT TTCTCTTCGG CCTAGTCTTC 
GGCAGCTTTT TGAATGTCTG CATTTACCGG CTGCCGCGCG GATTATCCGT AGTCACGCCA
CGTTCTGCTT GTCCCAACTG CCATAAACCT ATCGCTGCTT ACGACAATAT TCCCGTCCTG
AGCTGGATCA TCCTTGGCGG CAAGTGCCGC AACTGCAAGA CTCCGATCAC GCCCCGCTAT
GCCATCGTGG AGCTGGTCTG TGGGCTGCTG GTGCTGGCGT GCTACCTGGA ATATGGGCCA
AAACTCGGGC CCAGCATGAT GATCGACGGC ACTGTCTTCA AAGTACTCCT GCTGCCGTAT
CTGCTTAGAT TCCTGAAGTA CGCCATTCTC GCTTATCTGC TACTCGGCCT CATCTTCACG
GATGCCGAGA CTCAACTCCT GCCCGACAAG ATGACCCTGC CGGGCCTAGT GATTGGACTG
ATTTTTAGCG TTTTGGTACC CATGCAGGAC CTGGTGGTCA TGCTTTTCCT GGAATTTGTA
CCGTTTCCGA TTCCACATGG ACATCAGGTG CTTTGGATTT CAGTGATCAG CTCCGTCACC
GGCGCGATCG TGGGCGGAGG CTTCATCTGG GGCGTAGGCG CATTGTGGAA GCTCGCGCGC
GGCTATGAGG GAATGGGCTT TGGCGACGTG AAGCTCATGG CGATGGTCGG CGCGTACCTT
GGCGCGGCCA CGACGTTGAT CGTGATTTTC ACAGCATCGA TCATGGGATC GGTTATCGGG
CTGGCGACGA TTGGAATCGT CTATCTACAG CGCCGCGGCA GGTACGTGAA AAAGCTCGGA
CCGGCGCAAG GCGCTAGCCG CGCTCGCAAG GCAGCGTTTG TTGCCTACCG CTACCTGCCA
ATGCCCTTTG GCGTTTCCCT TGGCGCCATG GCGCTGGTCG CGGTGTATTT CAGCCGGTAT
ATCTACCATT GGTGGTTAGG CACGCCGTGA
 
Protein sequence
MPPFVDILFA TFAFLFGLVF GSFLNVCIYR LPRGLSVVTP RSACPNCHKP IAAYDNIPVL 
SWIILGGKCR NCKTPITPRY AIVELVCGLL VLACYLEYGP KLGPSMMIDG TVFKVLLLPY
LLRFLKYAIL AYLLLGLIFT DAETQLLPDK MTLPGLVIGL IFSVLVPMQD LVVMLFLEFV
PFPIPHGHQV LWISVISSVT GAIVGGGFIW GVGALWKLAR GYEGMGFGDV KLMAMVGAYL
GAATTLIVIF TASIMGSVIG LATIGIVYLQ RRGRYVKKLG PAQGASRARK AAFVAYRYLP
MPFGVSLGAM ALVAVYFSRY IYHWWLGTP