Gene Acid345_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0446 
Symbol 
ID4071693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp526694 
End bp527827 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content56% 
IMG OID637982450 
ProductTPR repeat-containing protein 
Protein accessionYP_589525 
Protein GI94967477 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.021695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGGG GCTGTTTTGT CGTCGTATCG CTTTTTTTTC TTCCGTATTT CGTATTCGCC 
CAGGCCCCCG ATCCTTCCGC AGCCATCGCC TTTGAACAAC AAGGGAAACT CCAGGAAGCG
GAGAGGACCT GGCGCGAGGT CATCAAGCAA AATCCCAAAG ATGCGGGTGC CTTTGCCAGT
CTGGGTGTAG TCCTGTCGAA GGAACAGAAA TATCAGGAGG CCGCTGCAGC TGATCGCAAA
GCGCTCACGC TTGATCCTTC TCTGCCGGGG GTGCAACTCA ATCTTGGACT GGCGGAGTTC
AAGCGAAATA ACTTCGTAGC AGCCATACCG CCACTCAAAG CCGCCTACGC CGCGGATCCA
GCCAGTAAGC AGGCGGTGAC TTTACTTGGT TTGAGTTACT ACGGGTCGAA GCAATTTGCC
GAGGCAAGCA AGTATCTCGG AATCGCCGCC AAGGCAGATC CAGCCAACAT TGAATTGCAT
CAGGTGCTGG CGCAAAGCTG TCTCTCCGCC CGGAATTACG ACTGTGCGCT CGAGGAATTC
CGCCAAATCT CCGCCCAGAA TCCGCAATCG GCCGCGGTTC ATATGCTCAC CGGCGAGGCG
CTCGATGGAA CAGGACACAC CGCTGCCGCC ATTGAGGAAT TCAAAGCCGC GGTCAACATC
TCACCGCGCG AGCCTAATCT GCACTTTGGG CTCGGTTATC TTTTCTGGAA GTCGCACCAA
TATGACGACG CCAAAGCCGA ATTTGAGAAA GAACTCGCGA TCGATTCGGA TCACGCTCTC
GCCTTGGGCT ATCTTGGCGA TATCGCCATG AAACAGAATC GGCTGGAAGA AGCCTCAAAG
TTCCTCCGCA AGGCGATCAG TGCCAAGCCT GATCTTCGAA TGGCATATGT AGATCTCGGT
TCCGTGCTGA CCGAACAAAA GCAGTATGAG GAAGCCATGG AAGCTCTCAA GCACGCCATC
AAACTCGATC CGAGCCAACC GGATGCACAC TTTAAACTCG GCCGAGTTCT GCAGCGACTC
GGGCGGTCCG AGGAATCGCG CAAAGAGCTG GCCAAGGTGC GCGAACTTCA TGAGCAGGCC
GACGCCCCGC TTGCCACACA ATTGCCGGAG TCTGCCGCCC CGCTGCCGAA ATGA
 
Protein sequence
MARGCFVVVS LFFLPYFVFA QAPDPSAAIA FEQQGKLQEA ERTWREVIKQ NPKDAGAFAS 
LGVVLSKEQK YQEAAAADRK ALTLDPSLPG VQLNLGLAEF KRNNFVAAIP PLKAAYAADP
ASKQAVTLLG LSYYGSKQFA EASKYLGIAA KADPANIELH QVLAQSCLSA RNYDCALEEF
RQISAQNPQS AAVHMLTGEA LDGTGHTAAA IEEFKAAVNI SPREPNLHFG LGYLFWKSHQ
YDDAKAEFEK ELAIDSDHAL ALGYLGDIAM KQNRLEEASK FLRKAISAKP DLRMAYVDLG
SVLTEQKQYE EAMEALKHAI KLDPSQPDAH FKLGRVLQRL GRSEESRKEL AKVRELHEQA
DAPLATQLPE SAAPLPK