Gene Acid345_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4247 
Symbol 
ID4073174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5036458 
End bp5039625 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content62% 
IMG OID637986279 
ProductTPR repeat-containing protein 
Protein accessionYP_593321 
Protein GI94971273 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGGC GGATCGGGCG AGGGGCATGG CTTGCGATGA TCGCAGCGTG TTTGCTGGCG 
ATCGCAGGGC GAAGTTTCGA GTCGCGCCTT CGGGGCAGTG CCCGAGCCGA CAACCTACTG
GAAAAAGCAT ACAGCGACCC AAGGACTCTC GAACTGCGGC TGCGAGGGGC GGCGTGGTCT
CTGCTTCGCG AGCGCGCAAG CGGGCGGATC CGTGCAGAGG CGCGTTCCGC GGACCTGCTT
CGAGCGGAAG CGGAGGTGGC GCGGCTCTAC CAAGCGAATC CGGTTGGGAC GCTGGAGTTG
CGTGATCGCG GCCGCGCCAA CCTATTGGAG TGGAGCTTTG ACGAAGCTTT GGGCGACTTC
CACCTGGCTC TCGCGCATGA ACCGGAATCG CCGGAGATCC TGAACGATCT GGCCACTGCA
TACTATGAAC GCGGAGAAGC GCAAGGGGAC TGGGAGTCGC TAGTAGACGC TTATGAGCTA
GAGAGCCGGG CGGTGCAGGC GCGTTCGGGA GATACATTGC TCCTGTTCAA CCGCGCGGTG
ATCGCGAAGC GGCTTGGCAT GTACGGCCAA AGCATGGAGG ACTGGAGACG CGTATCCGAA
TTGGAAAAGG AAAAGGGATG GGCCGAGGAA GCGCGTTCTA ACTTTCAACA GCTCGCGGCG
TTGCAGAAGA CGCGGCTGGA AAAGAATGGA GGACCGCTTC TGAGCGCGAG AGAGTTTGCC
GACCGGGTGC AGCCGGAGGA CCCCACGACT TGGAAAGAGG TGGAGCCGCG CGTGGAAGAG
TACGTTTCGG AGGCGACGCG GAGTTGGCTT CCGGCAGCGT TTCCGCGAAA AGGCGACGCG
GACGAATCAG CGAGGCGCGC GCTCAAGGCG CTGGCGATCG TGCTCGAACG CGGCCACGGC
GACCGTTGGT TGCGTGACCT TCTTTGGCAG CCCGGATCCG ATTCGCTGGC AGATGCGGTG
GCGTCGCTCT CGACCGCTGC CCAGGCAGAC TGGACGCAAC AGAACTACAG CCTAGGCCGC
GCGGCGGCGC AAGACGCCAG GCGAAGTTTC GCGAAGATGG GAAATCGCGC GGGTGAATTG
CGCGCTGCGT TTGAAGAGCT GTATGCGAGC GAGTTTGCCG ACATGGGAAC CGTCTGCTCG
CGGCAAGCGA GCCAGTTGCA ATCCGCCCTG CGCGAAGTTT CCTACCCCTG GTTGAGCGCG
CAGACGGCGC TTGAGCGCTA CAACTGCGAA CTGGAAACGG GAAACTTCGG AGCCTCCGAA
TTTCTCAGGC GCGCCCGCCA GATCTCGCAA GCCGCATCCT ATGCGGGCAT CTCTCTGCGC
GCGCTCAACT TTCTTGCCGC CGATCGCTTC GCTCGCGGAG ACCTGGCCGG AGGCTGGCAC
GCTTCTTCCG ACGGGATTCG CGAGTTCTGG GCGGGCTCGC AGGACCTCAC TTATGGGTAC
AACCTCTATA CCACCGTGGA ATTCGGAGTG GAGGTTCGAA ACTCCTGGTT CTCCGATGTC
GCGTATGGCG AGCAGGCGCT CTCGCTGGTG GAAGGAAACC AGAAGCCCTT CGCACGGGCC
GAAGAACATC TCGCGCTAGC AAAGGCCAGT TTGCTCGCGA AGAGCCCGGC AGTGTCGCTC
GAGCATCTTC GTGCGGCAGA GTCTCTTATT GCGGATGTGC CGCCATCCTC CGAAACCAGC
AACTTCCGCA TGGACATCGC GACGCAGAGT GCTTACCTGC AAGCGCTTAC CGGCTCCGCC
ACGGCGCAGA TCTTTCCGGC GCCGGCGGAG ATTTCGCAGG TCGAGAACGT GTACACCCTC
GGCAGCTACT ACACCACGCG CGGCAAGACC CTGGCCTTTG AAGGCAAAGT AGAGGAAGCC
AAGAGCGCTT ATCGAAGCGC GGTCGCCCTG GCAGAACACG CGCGGAGAAG CCTGTCTTCC
GACGCGGACC GGCTCGCCTG GCGCCATTCC TGGACCGAAC CGTATCTGTT GTGGATTGAT
CTCGAACTGA AGACCGGGAA CACGCAGAAG GCTCTCGCGA TTTGGGAGCT CTGCCGGAAC
TCCGATCCAG CGGTGCTCCC GCTCGCGAAT GGCAGCCGCG GCACGAAGCT GGATGCATCG
GTCATGGAGA ACTCGCTCGC CTCGGCACTT GCGGCGGAAG AAGCTCGGGA CGCCCAGGTG
CGACCGAACT TGAGGGATGA TGCGTTGCTG TTGTTCACCC GTCTGCCGGA CCGCATCGTG
GCCTGGGCGA TCACGGAACA AGGCATCGAG ACCTCGGTGA TACCCGCGGA CGCCTCGGAC
GTGGTGATGC AGGGGCGCTT GTTCCGGGAG TTGTGCGCCC GTCCGTCTTC TTCGATGGAG
CAAGTGAGCC TCCAAGGAAG ATCGCTGTAC TCGCAACTGA TTGCGCCGGT CGAAAGCCAG
CTTCGCAAGG CGAAAAACGT TTTGATCGAG AACGACGATT CGCTGGCGGG AATTCCATTC
CAGGCGCTGA TCGCTCCCGC AGGCAAGTAT TTCTCAGATG AACACGCAAT TCGCTATGTG
TCGGGCGCTC GCGACGTCGA ACGAGAATCA GCGTCGGGCG CAGTCGTCAC GCGCGAGACG
AAGATGCTGT TGGTCGCCAA CTCCGGGTCA AGCATGGACG GCGTCCAGCC GCTCGACGAT
GTGGTGGCGG AGGCGCGATC GGTGTCTTAC CTCTTCCCGC GCGCCGAAGT GCTGGTCGAG
CGACAGGCGA CTCTTTCCGC GGTCATGAAG AAGATGCCGC AGGCGGAATC AGTTTACTTC
GTAGGCCATG CAGTTTCGGA CGGAGAGCGC ACAGCATTGC TGCTCAGCTC GGAAAGCGGC
TCAAGCCAAC CGTCGCTGCT GACGAGCCAG TCCCTCGGCA ACAGCAAGCT GGGCAGCGTC
CGGCTTGCGG TGCTTGCCGC GTGCTCGACG CAAGGCGGCA CCGAGCGAAG CTCCGACGAG
GCCGACAGCC TGGTGCGCGC ACTTCTAGGG CGCGGCGTCC GGCATGTGGT GGCAAGCGGA
TGGGACGTGG ACTCGCAGGT CACTTCAAGA ATGATGGACG CTTTCTACAA GAACCTGCTC
CGCGGCGCCA CGGTCTCAGA GGCGCTGGCG GGGGCCGAAG CGGAGACCAG AAGGGCCACG
CAACACCCGT ATTACTGGGC CTCTTTCGAT GCGTTTGGAA ATAACTGA
 
Protein sequence
MKRRIGRGAW LAMIAACLLA IAGRSFESRL RGSARADNLL EKAYSDPRTL ELRLRGAAWS 
LLRERASGRI RAEARSADLL RAEAEVARLY QANPVGTLEL RDRGRANLLE WSFDEALGDF
HLALAHEPES PEILNDLATA YYERGEAQGD WESLVDAYEL ESRAVQARSG DTLLLFNRAV
IAKRLGMYGQ SMEDWRRVSE LEKEKGWAEE ARSNFQQLAA LQKTRLEKNG GPLLSAREFA
DRVQPEDPTT WKEVEPRVEE YVSEATRSWL PAAFPRKGDA DESARRALKA LAIVLERGHG
DRWLRDLLWQ PGSDSLADAV ASLSTAAQAD WTQQNYSLGR AAAQDARRSF AKMGNRAGEL
RAAFEELYAS EFADMGTVCS RQASQLQSAL REVSYPWLSA QTALERYNCE LETGNFGASE
FLRRARQISQ AASYAGISLR ALNFLAADRF ARGDLAGGWH ASSDGIREFW AGSQDLTYGY
NLYTTVEFGV EVRNSWFSDV AYGEQALSLV EGNQKPFARA EEHLALAKAS LLAKSPAVSL
EHLRAAESLI ADVPPSSETS NFRMDIATQS AYLQALTGSA TAQIFPAPAE ISQVENVYTL
GSYYTTRGKT LAFEGKVEEA KSAYRSAVAL AEHARRSLSS DADRLAWRHS WTEPYLLWID
LELKTGNTQK ALAIWELCRN SDPAVLPLAN GSRGTKLDAS VMENSLASAL AAEEARDAQV
RPNLRDDALL LFTRLPDRIV AWAITEQGIE TSVIPADASD VVMQGRLFRE LCARPSSSME
QVSLQGRSLY SQLIAPVESQ LRKAKNVLIE NDDSLAGIPF QALIAPAGKY FSDEHAIRYV
SGARDVERES ASGAVVTRET KMLLVANSGS SMDGVQPLDD VVAEARSVSY LFPRAEVLVE
RQATLSAVMK KMPQAESVYF VGHAVSDGER TALLLSSESG SSQPSLLTSQ SLGNSKLGSV
RLAVLAACST QGGTERSSDE ADSLVRALLG RGVRHVVASG WDVDSQVTSR MMDAFYKNLL
RGATVSEALA GAEAETRRAT QHPYYWASFD AFGNN