Gene Acid345_4136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4136 
Symbol 
ID4072327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4895801 
End bp4897618 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content55% 
IMG OID637986167 
Productkelch repeat-containing protein 
Protein accessionYP_593210 
Protein GI94971162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.378672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.701449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCGA AATTCCTTCG GACACTATCC GTTGCATCTG TTATCACGGG TTGGATGTTT 
TGGACGGTGT CGCTGTATGG TGCGGCCAAC TTCACAACCA CCGGCAGCAT GTTGACGGCT
CGGAATCAGT TCACCGCGAC GCTGTTGAAC AATGGTAAGG TCCTCATTGT CGGCGGTGCC
GGATCCACCC ATATTCTTGC TAGCGCAGAA CTGTATGATC CGGCAAGTGG GACATTCAGT
ACAACGGGTA GTCTAGCTAC CAGTAGATTC CTCCACACTG CCACGTTGCT CGCAAACGGA
AAAGTCCTCG TAACCGGCGG CTACACCGAC AATCCGCCGC CAGGATACGG AACAGCGTGG
TTGGGGACAG CCGAACTTTA CGATCCGGCT TCCGGAACGT GGACATCCAC TGGCAGTATG
GCTGCGAAGC GTTATGATCA CACGGCTACA CGCCTCGCTG ACGGTCGTGT GCTTGTAGCT
GGAGGATGGG GAGCGGGAAT CGTCGGTTCG GCGGAAATTT ATAATCCCAA CACGGGAACT
TTCTCGGCGA CCGGAGGTTT GATCACAGCG CGTTACATGC ATACTGCCAC GCGATTGAAT
AACGGGACAG TCCTTGTCGC CGGCGGGGGG TGCTCAGGGA CTTGTCCGGG TGGCGGCTCC
ACTCTTTCCA ACGCTGAAAT TTACAGCCCA ACCACGGGAA GCTTTAGTGC AACTGGTTCC
ATGACTCAGG GTGTCGCAGG ACATGCCGCG ACACTGATGT TCGACGGACG GGTCGTCCTC
GCGGGTGGTA CTTGGGCTAA TGTTTACAAC CCGCAGGCCG GAACATTCAC CGCAACGGCG
GGAACGATGA CGTACTCTCA CGGCTCGCCC TCGCTCGTTG TTCTGGGAAA CGGAACCGTT
CTCGTCGCGG GTGGAAATAC CTCGATCGCG GTCGCAGAGA TATACAATCC TTCCACTTCG
ACCTTCTCAA CCACGGCGTC GATGGCTACG GCCCGTAGCC TTCAATCTTC GGTTCTGTTA
AACAACGGGT CGGTTTTGGT CGCAGGAGGA AACAGCAGCT CAAGTCTCTT GTCGAGCGCT
GAATTGTACC AACCGGCGAC CTACCTGCCC TCACCACCTT CGGGCGCTAC GGCATTTAAC
AATTTGCAAA GTGCCGGAGG AAATCCCGGG GTTTGGACAA AATGCGAGGG GGCGTGCGCA
GGTTCGGGTG GAAGCAGCGG TTCAGGGTCA CTAACCCTTG GGGTTGCTTC ACCGTCACTT
TCAGGTGCTT CCATGAAGCA AGTCGACAAC GGAGCGAGTT GGAACGTTTT GTACTACAGG
CACTTGGGTT GTGCCACATT GCCCGGCGGC GTTTGTACCG GCGTGTCCAA TTTTCTGGAC
GACCTCTGGT TTTACATCCC GTCCACCACG ACTCAATTGC AAGCTCTTGA GTTCGATCCC
GATCTTTACC TCGGAGGTTA CGATTACTTT GCGTCGATGC AATGTGATAA CGCTTCTCAC
ACGTGGCGCA GATGGCACGA AGATCATGAT GCGAACCACG GGTGGGTCCC GAGTTCAATT
CCTTGCACAA TTCTGTCGTC GGTGAACACC TGGCACCACC TGCAATTGTT CGTCACGATG
GATACAACAA ACCGCATTTA CGCCTACCAG ACATTTCTCG TGGATGGTGT ACCAATCTAT
TCAGCGGCGC AAGATACTTA TAACCCCTAT TTCGACGGCT CCGGCAACAA TCTGAACATC
CAACAGCAGA TCGACAACAA CTCAAGCGCC ACAAGCAATA CCGTCTACTA CGATAAGTAC
AACCTCACGG TTTGGTAG
 
Protein sequence
MFAKFLRTLS VASVITGWMF WTVSLYGAAN FTTTGSMLTA RNQFTATLLN NGKVLIVGGA 
GSTHILASAE LYDPASGTFS TTGSLATSRF LHTATLLANG KVLVTGGYTD NPPPGYGTAW
LGTAELYDPA SGTWTSTGSM AAKRYDHTAT RLADGRVLVA GGWGAGIVGS AEIYNPNTGT
FSATGGLITA RYMHTATRLN NGTVLVAGGG CSGTCPGGGS TLSNAEIYSP TTGSFSATGS
MTQGVAGHAA TLMFDGRVVL AGGTWANVYN PQAGTFTATA GTMTYSHGSP SLVVLGNGTV
LVAGGNTSIA VAEIYNPSTS TFSTTASMAT ARSLQSSVLL NNGSVLVAGG NSSSSLLSSA
ELYQPATYLP SPPSGATAFN NLQSAGGNPG VWTKCEGACA GSGGSSGSGS LTLGVASPSL
SGASMKQVDN GASWNVLYYR HLGCATLPGG VCTGVSNFLD DLWFYIPSTT TQLQALEFDP
DLYLGGYDYF ASMQCDNASH TWRRWHEDHD ANHGWVPSSI PCTILSSVNT WHHLQLFVTM
DTTNRIYAYQ TFLVDGVPIY SAAQDTYNPY FDGSGNNLNI QQQIDNNSSA TSNTVYYDKY
NLTVW