Gene Acid345_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2664 
Symbol 
ID4071918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3138536 
End bp3140149 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content63% 
IMG OID637984681 
ProductTPR repeat-containing protein 
Protein accessionYP_591739 
Protein GI94969691 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTGG GCCAAATATT TCCGCGTTCT CAGGCGGCGC TACTCTTCCT CGTCGTCCTT 
CTCGTTGCGT CTTCCGTCGC CCAGTCCCCC TGCGCCAAGT GCCACGCCGA CATCGTCCGC
TCCTACAGCA CCACTGCCAT GGCCAATGCC AGCGGCCCGG CCTCGCTGAA TCCGCTCACC
GGCGCATTCC ATCACGAACC GTCCGACGTA AATTACAAAA TCGAATTGCG CGATGGCCAT
CTCTTCCTCA CCTACGCACG CACCAACGAT GTCCACGGCC AGCGCGAGTT GCTCTACTAC
ATCGGCCAGG GCCGCCGCGG ACGCACCTAT CTCTTCGCCG ATGACGGCTT CCTCTTCGAA
TCTCCCGTCA ACTGGTACGC CGACGAAAAG AAGTGGGACG TCGCCCCCGG CTACACCGCG
TCGCGCGAAA TCCCCATGAA CCTCCCCGCG CTGCCGAGTT GTCTCGAGTG CCACACCAGC
AACTTCCGCC CGCCCATCGC CGGCACCGAG AACCGCTACA CGCTCCCGGT CTTCACCGCG
ACCGGGATCA CCTGCGAGCG CTGCCACGGC CCGTCTGACG CACACGCGAA CGGCAAGGGT
GCGATCCTAA ATCCCGCCAA ACTTCCCGCC GATCGACGCG ACCAGATCTG CATGCAATGC
CATCTCGAAG GCGACGCCGC CATCGAGCGA CCCGGCAAAC ATCTCTACGA CTTCCGGCCC
GGCGACGACC TTTCCCAATT CGTCCGCTAC TACGTTATGG CTGATGAGCA CGGCTCTTCC
CTGCGCGCCG CCAGCCAATT CGAAGCCCTC GCGCAAAGCA CATGCAAAAA GAAATCGGGC
GACAAGCTCT GGTGCGGAAC CTGCCACGAT CCGCACCGAA CCATCGCCCC CGAAGAACGC
GTCTCGTTCT ACCGCTCAAA ATGTCTGAGC TGTCATAGTG CAGAATTTAG TGCCAAGCAC
CACACTGAGA ATCCCGACTG CACCGCCTGC CACATGCCCG CGTCGCAGAG CAAAGACGTC
GCCCACACCG AGGTCACAGA CCACCGCATC CTGCGCCGTC CCGCAGCGCC GCCGGCGCCG
CCAACTTCGC TTCCGAAACT CGTTCCGTTC CCCTACTCCG CGGAAGCCGA CAACGATGCT
CGCGACAAAG GCCTCGCGTG GCAGGCGATC GTCAACTCCG GCATGACCGA CGCGCAGCCC
GAAGCCGAAC GCTGGCTCCG CAAAGCATCG GAATACGACA AGAACGATCC CGCCGTCCTC
TCCGCGCTCG CCTTCCTCTC CCAGAAGCGT GGCGACACCG AGACTGCCCG CGCCCTCTAC
ACCAGCGCGC TCTCGTTGAA CCCGAGTGAA CTCGACGCCG AAAACAATCT CGCCATCCTC
GAAGCCCGCG CTGGCCACAC CCGCCGCGCC GTCGAACTCT GGGAAGACGC CTTCCGCCGC
GCCCCCGGCC ACAGCGGCAT CGGCATGAAC CTGGCGCTAG CCTTCTGCAG CACAGGCCAA
TACGACGCAG CGCGTCAGTT CACCACACGT GTTTTGGAAT TCAATCCAGA CCTGCCGGCA
GCCAGGCACC TGTTGAGTGG CCTCAACGCC TCGCCGCCAA AATGTGCTCC GTGA
 
Protein sequence
MKVGQIFPRS QAALLFLVVL LVASSVAQSP CAKCHADIVR SYSTTAMANA SGPASLNPLT 
GAFHHEPSDV NYKIELRDGH LFLTYARTND VHGQRELLYY IGQGRRGRTY LFADDGFLFE
SPVNWYADEK KWDVAPGYTA SREIPMNLPA LPSCLECHTS NFRPPIAGTE NRYTLPVFTA
TGITCERCHG PSDAHANGKG AILNPAKLPA DRRDQICMQC HLEGDAAIER PGKHLYDFRP
GDDLSQFVRY YVMADEHGSS LRAASQFEAL AQSTCKKKSG DKLWCGTCHD PHRTIAPEER
VSFYRSKCLS CHSAEFSAKH HTENPDCTAC HMPASQSKDV AHTEVTDHRI LRRPAAPPAP
PTSLPKLVPF PYSAEADNDA RDKGLAWQAI VNSGMTDAQP EAERWLRKAS EYDKNDPAVL
SALAFLSQKR GDTETARALY TSALSLNPSE LDAENNLAIL EARAGHTRRA VELWEDAFRR
APGHSGIGMN LALAFCSTGQ YDAARQFTTR VLEFNPDLPA ARHLLSGLNA SPPKCAP