Gene Acid345_3615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3615 
Symbol 
ID4070135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4277040 
End bp4278101 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID637985638 
ProductTPR repeat-containing protein 
Protein accessionYP_592690 
Protein GI94970642 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.242924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.313882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACC GGCCCCTTTC CCTTCTCCTC GGTATCGTGG TCGCAACCAC GGCCATCGGT 
ACCGCCCAGA TTGAAGAGTA CCGTACCGGC AGCGATTCTC CCGCTGCAGA CACCTTAACG
CACTCGACGT TGACCGGTAC GGTGACTTCC GCCGACGGGT CGCCTCTCAA TAATATCCGC
ATTGAGGTTC GCAGGATCGG GATCGGCTCA CCGGCGGACG CAACTTACAG CCATGTGAAC
GGCTCGTTCG ACTTCGCGAA CCTGCGGCCG GGTTCCTACG AAGTGGTGGC GATTGACGGT
GTGATGGAGG CGCGGGAACA GTTCATCGTA CAGAGCCAGT TGGTGTCTCT CAGCCTGCGG
ATGCCGGTAA CCCGGTCAGC AGCACCGACC CGCGGCACGA TTTCTGTAGC GGAATTAAAG
GTCCCCGATA AAGCCAAGCA CCTGCTCGAC AAGGCGCAAG GGGCGCTGTC GAAGGGTCAC
AGCGACGAAG CCGAGAAGCA GGTAGAAGAG GCGCTGCAGG CAGCGCCAGA TTATGCGGCC
GCACTGTCAT TCCGCGCCGC GCTGAAACTT ACCCGCAACG ATACGCAATC GGCGCTCGAT
GACCTCGACC ACGCGGTAAA GGCCGATCCG AATTTTGCGC AGGCTTACAT GTTGCTGGGA
GCGGCGTTTA ACCAGCTAGG CCGCTACGAC GAGGCGCTCC GCAGCTTGGA TCGTGGCTCG
ATGTATGACC CTAAGTCATG GCAGGTTTCC TACGAGATGT CGAAGGCGTG GATGGGCAAG
CATGATTACG TTCATGCCAT CCAGCAGCTG AACCGGACGG AGTCGTTGGG CGCAGTGAGA
ATCGCGGGGC AGGTGCATCT GCTCAAGGGC TACGCGTTCA TGGGCCAGAA ACAATTTGAG
CAGGCACAGA CGGAACTGCA GGCGTACTTA ACGTCCGAAC CTCAGAGCAA GATGGCGGGA
TCGGTTCGCG CTGCCCTTGC ACAGATCCAG ACCCAGATGG CGCAGAGTCC TGCGGCGTTG
ACGTTGCCGA CGATGACGGG GATCTTCGCG CAGGCGCACT GA
 
Protein sequence
MYYRPLSLLL GIVVATTAIG TAQIEEYRTG SDSPAADTLT HSTLTGTVTS ADGSPLNNIR 
IEVRRIGIGS PADATYSHVN GSFDFANLRP GSYEVVAIDG VMEAREQFIV QSQLVSLSLR
MPVTRSAAPT RGTISVAELK VPDKAKHLLD KAQGALSKGH SDEAEKQVEE ALQAAPDYAA
ALSFRAALKL TRNDTQSALD DLDHAVKADP NFAQAYMLLG AAFNQLGRYD EALRSLDRGS
MYDPKSWQVS YEMSKAWMGK HDYVHAIQQL NRTESLGAVR IAGQVHLLKG YAFMGQKQFE
QAQTELQAYL TSEPQSKMAG SVRAALAQIQ TQMAQSPAAL TLPTMTGIFA QAH