Gene Acid345_0912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0912 
Symbol 
ID4069123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1146681 
End bp1148927 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content56% 
IMG OID637982919 
ProductTPR repeat-containing protein 
Protein accessionYP_589989 
Protein GI94967941 
COG category[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.795031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00626987 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTTA CATTCGAAAT TCGCGAATGG CTGTTCACGG GAGTGCTCGC CGGCATAGCC 
TTTGTTTCCG TTTCCTCGAT GTTTGCGCAG ACACCGCAGG CTGCCACGGT GCAAAAGGAC
GCTAGTTCCC CCGAGGCTCA CTTCGATTCC GCGCAGACTT TCCAGATTGC TAGTGACCTC
ACCCGCGCTG CGGCAGAGTA TCGCAAGGGG ATCTCGGTCG CACTCGAGCG CGTCGGGAAT
TTGAAAGTTG CAAAAGGTGA GTTCACCTCG GCTCTCGATT TGCTTCGGAA GGCAGTGGCC
ACCGACCCAG AAAACACGGA CGGCCAAATT GATCTCAGCA TCGCCTACTT CCGTGCCGGC
AATTACGAGG GCGCGAGGAC CGTGCTGCTA CCTCTTGTGA AGAGCGACCC GGGCAGTGCA
CGCGGTCGCA ACCTGATGGG CAAGATCCTT TTCATGCAAG GAGATTTTGA GGGTGCCTCA
ACCGAGTTGC AGGCAGCTCT TTCCATCACA CCTGACTTCG ATGTCGCCTA CAGTCTCGCA
CTCGCCTATC TTCAATTGAA GAAGTTGCCT CAAGTCACTC CGCTCGTCGA CGAGATGAAA
GCCTCTATGG CGAAGTCCCC GGAGCTTTAT ATGCTGCTCG GACAAGCCTA CCGGCAGACT
GGCTATTACG ACCAGGCGGT GAGCGAGTTC AAGACTGCGC TTGCGCTCGA TGCCGCGCGT
CCGCGCCTGC ATAACCTACT TGGAACGACT TATGTGGCGT TGGGGGGCAA GCAAAATTAC
GAACTCGCGC GCGCCGAATT CCAGCAGGAG CTTGCAAAGA ATCCGAAGGA CTATTCGAGC
CACTATTATC TTGGCCTGAT CGAGTTGGAA GACGGGCAGT ATGCGAAGGC CGAAGCCGCG
CTCAAAACCG CGCATGCCCT CGCGCCCGAC GACCCCGCGG CGATGCTCTT GCTCGGACGC
CTCTACGACC AGCAGAAGAA CTGGAATGCC GCGATCGAAG TGTTGCGTCA GGTGCTCGCG
CGTTCCTCAG CCCAAGGTGC ATCACCCGTG CAACTCGCGA CTACGCACGA AATGCTGTCG
AAGGCGTACG CAGGCGCGGG GCAGATTCCC GAAAGCGAAA AGGAACTCGC CGCCGCGAAT
GCGCTCAAAA GTCAAGACGC AAGCAAGAGC GCAACTAGTG ATCCGGCCGT GGCAATCCAA
CCGGAAAATA GTGGCAAAGA GCTTCGCGCA ATGCTGATGC AGGGATCGCC CAAGGCTCAG
CCATCTGACG CGTCTGAACA GAAGTACGTC GCCGATATCT CGAAACTCCT CGGAAATGCC
TATAACAATC TCGGGGTCAT CGACGCGCGT GCCGGCTCCT ACAAGCAGGC CGCCGACGAA
TTCAAAGAAG CTGCCAAATG GGACGATTCC ATTCCTCAAC TGGACCGTAA TAGGGGCCTT
GCTGCGTTCC GGGCCCAAGC ATACGCCGAT GCGATTCCGC CTCTTGAACG GCTGTTAAAA
AAATCTCCAT CTGACTCCAA TCTGCGCGAG TCTCTCGGTC TCAGCTACTA CATGACCGAC
AAATTCAAGG AGAGCGCGGC GACGCTTCGA CCGATTGTGG ACACAATGTC GAACAATCCC
GGCCTCCTGC TCTCCGCTGG CGTCGCGTTC GTAAAATCCG GAGATATCCC GACCGGTCAA
CGCTTGTTCA CTCGCGCCTT TGAGGTCGGC AAGGCGACGC CAGAAATTCA CTTGATTATC
GGTCAGGCAT ACGCGGAACA GTCCGATAAC GACGAAGCCC TCGCCGAATT CAAACAGGCT
CTTGAACTCA ACCCGAAGTT GCCGGACGCT CACTTCTACA TCGGAATGGT GCGATTTAAG
CGTGGTGAAT TCGACGACGC TGCCAAAGAA TTTCAGCAGG AGCTCGAGGT CAATCCTCAG
AGCGTTCAAG CGATGTACCA GTTGGCATAC ATCCGAATGC AGCAACACCA GGCGCCCGAA
GCCTCAAGTC TGCTTTCGGA AGTGATCAAG CAACAGCCGA ACAATTCAGA TGCCCACTAT
CAGCTCGGGA AAGCATTGTT GGAACAAGGT GATGCAGGCG GTGCAACGCG GGAACTTGAA
ACCTCGGTGA AGCTACATCC GACTGACTAT GCGTATTTCC AATTGAGTCA CGCGTACGCG
CGAACAGGTC GCGAGGCGGA TTCCAAGCAA GCGCTCGAGG AATTCGAAAA GCTGAAGCCT
AAACCGAAAA CACCGATGGG TCCCTGA
 
Protein sequence
MKLTFEIREW LFTGVLAGIA FVSVSSMFAQ TPQAATVQKD ASSPEAHFDS AQTFQIASDL 
TRAAAEYRKG ISVALERVGN LKVAKGEFTS ALDLLRKAVA TDPENTDGQI DLSIAYFRAG
NYEGARTVLL PLVKSDPGSA RGRNLMGKIL FMQGDFEGAS TELQAALSIT PDFDVAYSLA
LAYLQLKKLP QVTPLVDEMK ASMAKSPELY MLLGQAYRQT GYYDQAVSEF KTALALDAAR
PRLHNLLGTT YVALGGKQNY ELARAEFQQE LAKNPKDYSS HYYLGLIELE DGQYAKAEAA
LKTAHALAPD DPAAMLLLGR LYDQQKNWNA AIEVLRQVLA RSSAQGASPV QLATTHEMLS
KAYAGAGQIP ESEKELAAAN ALKSQDASKS ATSDPAVAIQ PENSGKELRA MLMQGSPKAQ
PSDASEQKYV ADISKLLGNA YNNLGVIDAR AGSYKQAADE FKEAAKWDDS IPQLDRNRGL
AAFRAQAYAD AIPPLERLLK KSPSDSNLRE SLGLSYYMTD KFKESAATLR PIVDTMSNNP
GLLLSAGVAF VKSGDIPTGQ RLFTRAFEVG KATPEIHLII GQAYAEQSDN DEALAEFKQA
LELNPKLPDA HFYIGMVRFK RGEFDDAAKE FQQELEVNPQ SVQAMYQLAY IRMQQHQAPE
ASSLLSEVIK QQPNNSDAHY QLGKALLEQG DAGGATRELE TSVKLHPTDY AYFQLSHAYA
RTGREADSKQ ALEEFEKLKP KPKTPMGP