Gene Acid345_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0808 
Symbol 
ID4068687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1000440 
End bp1001900 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content59% 
IMG OID637982815 
ProductTPR repeat-containing protein 
Protein accessionYP_589887 
Protein GI94967839 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.224492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.93988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTT TGATGACCAA GCTTATGTGC TATGCCGTGC TCGCGGCACC GTGCGCGTTT 
GCTACCGGAT CAGGCCAGGA GGCCACTCCG GAGAAAGACA TTCCACTCAC CACTAGTTCG
GACGTAGCGC GGCGCGCGTT CCAGGGCGGA CTCGAGAATA TCGAGAACCA GCAGTCCAAG
CGTGCCCATG TCGACTTCCG GTCCGCAGTG CGCGCTGACA GCAACTTTGC GCTGGCGCAC
TTGTTCCTGG CATACGACAA CGGAAATCCG GCGGAAGAGA AGGCGGAGCT TCAGAAGGCG
CGCACGCTGG CCGCGAACGC CTCGAAGCCC GAACAGATGC TGGTGGAATG GATGGCGGGA
TCGCGTGAGG GGAAGATGGT TCCGGCGATC TCTGCACTGA ACGATGTGAC GTCCGCGTAT
CCCGAAGACA AGTTCCTGCT ATTTCTTGCC GGTCGCTGGA TGGTGCAACA GCAGAATTAC
GAAGGTGCGC AGCGGTTCCT CGAGCGGGCA GTGACGATCG ATCCGAATTA TCCGGCGGCC
CTGAACGAAC TGGCGTATGC CTACGCCGGC AACAGGATCT TCGACAAGGC ATTCGAGGCA
CTCGACAAGT ACGCGAAGCT GCTTCCAGGT GAACCGAATA CGCAGGACTC CTACGGGGAG
ATCAGCCTGA AGGCGGGGCG ATTCGAGCAG GCGGTTGAAC ACTACAAGAA AGCGCTGGAA
TACGATTCGA CGTTCGTGTG GTCGCAGGTT GGCCTGGGCG ACAGTTACAT GCTGATGGGG
AAAGAGCAGC AGGCCCGTGC AGAATATGCG AAGGCGGCTG CGATGGCACC GTCGGATGGC
GATCGGCTTA CGTGGCAACT GCAGTCGGCA TTGACCTATA TGTTCGCGCA TCAGCATGAA
TCCGCCGATG ATGCGTTCAC GAAGGTGGCT GAAGAAGCGA GTTCGTTGCA TATCGGCAAG
CAGGAAGCCA TGGCGCACAG GCTTATGTCG GCTTACGACC CGGATTTGCC GGGGTTCCTG
CGGCACACTG CTGACGCTGA AAGCGCGTTG AAGACACGCT CGGATATATC GAAGAGCGAT
CGCGACCAGG AGATGGCGCT GCTGTTAAAA TCCCGCGCTG TGCGGGCGGC GGAATTCGGC
CGCACGGACA TGGCGAACGA GTCCTTGAAG AAGCTCAGCC AGATGGCGGA GAGCATCCCG
GACAACGTAA TTCAACAGGC GAACGAAGGC GCGCAGGGCG GCGTATTGTG GGTCCAGAAG
AAATACGCCG AGGCCATACC TCATTTGGAA GAGGACCAGG GGAATCCGTT GAGTGCGGCG
CGCCTGTTGC AGGCGTATCG CGAGAGCGGA GACTCGTATC GTGCCGACTC TCGGGCGATA
CGGTTAAATA GTTATTACGA GCCGACGCTC GACGATTACC TGGCCAGGCA GCTGTTGAAC
GGCAAACAGC GCAAGAAGTA A
 
Protein sequence
MNRLMTKLMC YAVLAAPCAF ATGSGQEATP EKDIPLTTSS DVARRAFQGG LENIENQQSK 
RAHVDFRSAV RADSNFALAH LFLAYDNGNP AEEKAELQKA RTLAANASKP EQMLVEWMAG
SREGKMVPAI SALNDVTSAY PEDKFLLFLA GRWMVQQQNY EGAQRFLERA VTIDPNYPAA
LNELAYAYAG NRIFDKAFEA LDKYAKLLPG EPNTQDSYGE ISLKAGRFEQ AVEHYKKALE
YDSTFVWSQV GLGDSYMLMG KEQQARAEYA KAAAMAPSDG DRLTWQLQSA LTYMFAHQHE
SADDAFTKVA EEASSLHIGK QEAMAHRLMS AYDPDLPGFL RHTADAESAL KTRSDISKSD
RDQEMALLLK SRAVRAAEFG RTDMANESLK KLSQMAESIP DNVIQQANEG AQGGVLWVQK
KYAEAIPHLE EDQGNPLSAA RLLQAYRESG DSYRADSRAI RLNSYYEPTL DDYLARQLLN
GKQRKK