Gene Acid345_0424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0424 
Symbol 
ID4069650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp494523 
End bp495464 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content57% 
IMG OID637982428 
ProductTPR repeat-containing protein 
Protein accessionYP_589503 
Protein GI94967455 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.446862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0100173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAACGC TCATCGGCTT CTGCTGTCTT CTGATTTCTG GTGTCAATGT GCACGCGCAA 
GCCGCATCGA ATAGCCCCGC CAATTCGCCT GCATCGCCCA AGGCGGCTCT TCGTCTGGCG
CAGGAAGGGC ATTGTAAGGA GGCGTTACCG GCGCTGAAAA AGGGCCTCGC TAGCGCGGCA
AAAGACGATC ATCGGGATCT CGCGATGGCT GGCGTGCGCT GCGCGATGTT CATGAACCAG
CCAGAGAGTG CGCTGGAATT TCTACGTGTG CTTGAACGCG AGTTCCCGAG CGATCCCGAC
ACGCTTTACC TTCTTGTTCA TACCTACTCC GATCTCTCAA CCCATGCCGC AGCAGAGTTG
GCGACAAAGC ATGGCAACAC ATACCAGGCT CGCGAATTGA ACGCAGAAGC GCTGGAGTCA
CAAGGCAAGT GGCAGGAGGC GGAGAAAGAG TACAAGACGA TTCTCGAGGC GAACCCTAAA
GCCGTTGGAA TCCACTTTCG CTTAGGCCGA TTGCTTCTCT CCGCGCCGAA TCCGCCAGCC
GATATGGCGG AGCAAGCAAA GAGAGAATTC GAGGCAGAAC TCGCGGTCGA TCCCACAAAT
GCAGGCGCGG AATACGTGCT TGGCGAGTTG GCGAAGACAG CCAATTCCTA TGACGACGCA
ATCCAACACT TCTCCAAAGC CACCAAACTC GACCCCTCTT TCGCAGCCGC GTATCTCGGT
ATTGGAACGA GCTTAGTCGC GCAGAAAAAA TTTGCTGAAG CTGTGACGCC GCTCGAAACA
GCGGTGAAGC TACAGCCCGC TAATCCTGCT GGCCACTACA ATCTCGCAAC CGCCTATAGT
CGCACCGGAC GCAAAGCAGA TGCCGACCGC GAGTTCGCGA TCCATAACGA GATGATGCAA
CGCAGCGGCG GCGCTTCGGC GCCCGTCCAG CAGCCGCAGT AA
 
Protein sequence
MRTLIGFCCL LISGVNVHAQ AASNSPANSP ASPKAALRLA QEGHCKEALP ALKKGLASAA 
KDDHRDLAMA GVRCAMFMNQ PESALEFLRV LEREFPSDPD TLYLLVHTYS DLSTHAAAEL
ATKHGNTYQA RELNAEALES QGKWQEAEKE YKTILEANPK AVGIHFRLGR LLLSAPNPPA
DMAEQAKREF EAELAVDPTN AGAEYVLGEL AKTANSYDDA IQHFSKATKL DPSFAAAYLG
IGTSLVAQKK FAEAVTPLET AVKLQPANPA GHYNLATAYS RTGRKADADR EFAIHNEMMQ
RSGGASAPVQ QPQ