Gene Acid345_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1683 
Symbol 
ID4069351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2037997 
End bp2040825 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content58% 
IMG OID637983691 
ProductTPR repeat-containing protein 
Protein accessionYP_590758 
Protein GI94968710 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.354598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAAC TCGGATTTGC CCTGCTTGCG TTCCTTGTTG TTTCGGCCAT TTACCTGTAC 
GCGTGGCCTG CGCCGAACCT CGTCTATCCC GCGGTGGTGC TTGCACATGC GGGATTTGGA
GTGCTGGTCA CACTGCTCGG ACTCCGATAT ATCCCCAAAC TGCGGACGTC GGGACTCTTG
GCTGGAAGCG CCGGCGCCTT GATGGGCGTG GGAGCCGTGA TCGGGATCGT GCTGATTTTC
ATTGGCACCT CGCGACCACA ACGCTATGCG CTTTGGGCGC ACATCGCATT CTGCGTAGTG
GCAGTAGTGC TGCTGGCGAC GTGGCTGGTG CGGCAGAGGC GCGCGAATTC GGGTGTCTCG
AGTGGAACGC CGTGGGGCGC ATTTGCGGGA ATCGTCGCAC TGGTTGCGGT TGTGAGCGTT
GCAGCTTGGT ATGCGCGTGG TTTCTGGAGC AAGCAATACG CCATCAAGAA CCCCAATGCC
GCGCCACTGA CCATGGATGA TGAGGGCGAT GGCTCGACCG GCATGTTCTT TCCGAGTTCG
GCGCAGGTTG CAGGGAAGCG CAAGATTCCG GCAAAGTTCT TCATGGAATC CGACGCGTGC
GCGCGCTGCC ACCAGGACAT TTACAACCAG TGGCAGAAGT CGGCGCACCA TTTTTCGTCG
TTCAATAACC AGTGGTATCG CAAGAGCATT GAGTACATGC AGGACGTGCG GGGGACGAAG
CCGTCGAAAT GGTGCGGTGG CTGCCACGAT CCGGCGGTGC TCTACAGCGG CCTGATGGAT
ACGCCGATTA AACAGATTAT TCATCGGCCG GAGTCGCAGG CAGGGCTGGG TTGCATGATG
TGCCACTCGA TCGTGAAGGT GAAGAGCACG ATGGGGCAGG CCGACTTCAT GCTCGAGTAT
CCGGAATTGC ATGAGCTGGC AGCAACGAAG AATCCGGTGA TGCGCGCGCT GCATGACTTC
ACGATCAAGC TGAATCCCGA GCCGCATCGT CGAGTGTTTC TCAAGCCGTT TATGAAGGAA
CAGACGGCGG AATTCTGCTC GTCGTGCCAC AAAGTGCACC TCGACGCCCC GGTGAACAAT
TACCGCTGGA TCCGCGGTTT CAATGAGTAC GACAACTGGC AGGCGAGCGG CGTTTCGGGC
TTCGGCGCGC GGTCGTTTTA CTATCCGCCG AAGTCGCAAC AGTGCGCGGA TTGCCATATG
CCAGCGGCGA AGTCGAATGA CTTCGGAAAT ATTGATGGCT TTGTGCACTC GCACCAGTTC
CCGGGCGCAA ATACCGCGCT GCCAACGGCG AACGAGGATC CCGCGCAACT CAAGTTGATC
GAGAATTTCC TGAAGACGGC AGTCACCGTG GACATCTTCG CAATCTCGCA CGCGGCAACG
CCAGTAGGCG GAACCGCGCA GGCGGAAAAT TCAACTTCGT TTGCCGTCGG CGAAGAAGCA
GAAGTTAAGA CCACCGGTGA GAACACAACG GAAAGTGTAC CGGTCACAGC TCCGCTGGAT
GAAACGAACC CGGTACTGCG CCGCGGCGAA AGCGTACGCA TGGACGTTGT GGTTCGCACC
CGCAAGGTCG GGCACTTCTT CCCGGGCGGC ACGGTGGATG CCTTTGATAC CTGGCTTGAA
TTGAAAGCCG TGGATGACAA GGGACAGCCA GTCTTCTGGA GCGGCAAGGT CGAAGACGAC
GGCAAGGGGC CGGTGGAGAA GGGAGCGCAC TTCTATCGCT CGCTGCAGAT CGATGAACAC
GGCAACGAGA TCAACAAGCG CAACGCGTGG TCTACGCGAT CGACAGTTTA TGTGCGGTTG
ATCCCTCCAG GCGCGGCGGA TACGGTGCAC TTCCGTGTGA ATGTGCCCGA GAATGTCGGC
GACAAGATCA AGTTCACGGC GCGGCTCTGC TACCGAAAGT TCTCGTGGTA CAACACGCAT
TTTGCTTACG CGGGCCAATC GAAAGACCGA GTGGCTGAGC CTCTGACGAA GGCATCGTAC
GATGATCGCG GGTTTACTTT CGATGGCGAT CTGGCGGACG TCTCTGGCAA GATGAAGAGC
GTTCCTGATC TTCCGATTGA GGTGCTGGCG GAGAAGACGG TCGAGCTTCG AGTGGTTCCG
AAGAACACTC CACTGGAAAC GCCGAAGACA GTCACGAAGA AGGACGAGTG GCAGCGCTGG
AACGATTATG GCATCGGATT GTTACTACAA GGCGACTTGA AGGGCGCAGC GGCTGCGTTC
GTGAAGGTGA CCGAAGCAGA TCCGAATAAT CCAGATGGTT GGGTGAATCT TGGTCGCGTC
GCGGTGCAGG AAGGCGACAT GGAGCGGGCT CGCGAGGTAC TGACGAAGGC CCTGAAGATC
AATGCGAATC TTGCGCGTGC ACGGTTCTTC TACGCACGCG TCCTGCGTTC GGATGGCAAT
TACGATGGCG CGGCGCAGGA ATTGCAGGCT GTGCTGGCGC AGTATCCCAA GGACCGCGTG
GTGCGCAATG ATCTCGGGCG CATCTACTTC CTGCAACGCA AGTACGATCA GGCGATCGCT
GAATTGCAGC AGGTTATGGA AGTTGATCCA GAAGACCTGC AGGCAAATTA CAACCTGATG
CTCTGCTATC GTGGGTTGGG GAAGACGGAA GTCGCTGCGG ACTACGAGAA GCGTTATCTA
CGCTTCAAGG CGGACGAAGC ATCGCAGGCG ATCAGTGGCG AGTATCGGCG CAAGCATCCG
GAAGATAACA ACGAGCGGCA GCAGATTCAT GAGCACGTGT CGGTGCCGCT TGGCCCAACC
TCTAAGAGCG TGACGCCGGT GAAGACTGCG AAGACGACGC AAACACATGG CGCAAGCGCC
GGGAAATAG
 
Protein sequence
MRKLGFALLA FLVVSAIYLY AWPAPNLVYP AVVLAHAGFG VLVTLLGLRY IPKLRTSGLL 
AGSAGALMGV GAVIGIVLIF IGTSRPQRYA LWAHIAFCVV AVVLLATWLV RQRRANSGVS
SGTPWGAFAG IVALVAVVSV AAWYARGFWS KQYAIKNPNA APLTMDDEGD GSTGMFFPSS
AQVAGKRKIP AKFFMESDAC ARCHQDIYNQ WQKSAHHFSS FNNQWYRKSI EYMQDVRGTK
PSKWCGGCHD PAVLYSGLMD TPIKQIIHRP ESQAGLGCMM CHSIVKVKST MGQADFMLEY
PELHELAATK NPVMRALHDF TIKLNPEPHR RVFLKPFMKE QTAEFCSSCH KVHLDAPVNN
YRWIRGFNEY DNWQASGVSG FGARSFYYPP KSQQCADCHM PAAKSNDFGN IDGFVHSHQF
PGANTALPTA NEDPAQLKLI ENFLKTAVTV DIFAISHAAT PVGGTAQAEN STSFAVGEEA
EVKTTGENTT ESVPVTAPLD ETNPVLRRGE SVRMDVVVRT RKVGHFFPGG TVDAFDTWLE
LKAVDDKGQP VFWSGKVEDD GKGPVEKGAH FYRSLQIDEH GNEINKRNAW STRSTVYVRL
IPPGAADTVH FRVNVPENVG DKIKFTARLC YRKFSWYNTH FAYAGQSKDR VAEPLTKASY
DDRGFTFDGD LADVSGKMKS VPDLPIEVLA EKTVELRVVP KNTPLETPKT VTKKDEWQRW
NDYGIGLLLQ GDLKGAAAAF VKVTEADPNN PDGWVNLGRV AVQEGDMERA REVLTKALKI
NANLARARFF YARVLRSDGN YDGAAQELQA VLAQYPKDRV VRNDLGRIYF LQRKYDQAIA
ELQQVMEVDP EDLQANYNLM LCYRGLGKTE VAADYEKRYL RFKADEASQA ISGEYRRKHP
EDNNERQQIH EHVSVPLGPT SKSVTPVKTA KTTQTHGASA GK