Gene Acid345_0915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0915 
Symbol 
ID4069126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1153410 
End bp1154384 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content53% 
IMG OID637982922 
ProductTPR repeat-containing protein 
Protein accessionYP_589992 
Protein GI94967944 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0051049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCTG CGGCGCAGAC CACGGCCACT GCCACGGACC CGCAGAAGGT CATCGCCGAT 
GCTCATAGAC AACTAGATCA TGGACAGTAT GACGAAGCGA TTGTGCAACT GCAGAAACTT
CAACAGGAAC AGTCGGCAAT TGAAGGCCTC GTGAGGGAAA TTGGAATTGC CTATTACAAG
AAGGGCGATT ACCTAAGCGC GATTCGTTAT CTCAAACAAG CGACGAAACA AAATGAAAAA
GACAGCGAAG CGGTGCAACT GCTGGGATTG TCGTACTACT TGTCTGGGAA GCCGCAGGAC
GCGATCCCTC TGTTGGAGAA GGTGCAGACG TGGTTTCCGA GCGCGAATGT TGATGCGTCG
TACATCCTGG GAATCTGCTA CATCCATTTG AAGAATTACG ATCAGGCGCG CGGCGCGTTC
GCACAGATGT TCGATGTGAA GCGAGACTCG GCTGCCAGTT ATCTATTCAC AGCACGGATG
TTGCTTCGAC AGGAATATGA TCCGGTAGCA GAGGAATATG CGCAGAAGGC GATTGCGCTC
GAGCCTACGC TTCCACTGGC GCACTATTTA TTAGGCGAGC TTCACCTCTA TAAGTCACGC
GTGCCAGAGG CAATCGAGGA TTTCCGGAAG GAGTTGAACC TCAATCCGTC CTACGCGCCG
GCGTATTACA AGCTGGCGGA CGCCTACTCG CGCGTTCAAA AGTACGATGA TGCGGAGCGA
CTGCTGCAAC GCTCGATCTG GCTGGATGCA ACGACGACAG GGCCGTACAT ACTTCTCGGC
AAAGTGCTGG AAAAGAAGGG AGAGCCGCTA CTAGCGTTGC GCGCGTTGCA ACATGCAGCG
AGCATGGATC CGAAAAACCC GATCACCCAC CACTTACTCG GGCAGTCGTA TCGCGATCTA
GGTAAGAGTG ATGACGCAGA GCGGGAATTA AAACTCGCCG AGGAATTGCA GAACAAGCAG
AACGAGAATC GCTAA
 
Protein sequence
MSAAAQTTAT ATDPQKVIAD AHRQLDHGQY DEAIVQLQKL QQEQSAIEGL VREIGIAYYK 
KGDYLSAIRY LKQATKQNEK DSEAVQLLGL SYYLSGKPQD AIPLLEKVQT WFPSANVDAS
YILGICYIHL KNYDQARGAF AQMFDVKRDS AASYLFTARM LLRQEYDPVA EEYAQKAIAL
EPTLPLAHYL LGELHLYKSR VPEAIEDFRK ELNLNPSYAP AYYKLADAYS RVQKYDDAER
LLQRSIWLDA TTTGPYILLG KVLEKKGEPL LALRALQHAA SMDPKNPITH HLLGQSYRDL
GKSDDAEREL KLAEELQNKQ NENR