Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1683 |
Symbol | |
ID | 4069351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2037997 |
End bp | 2040825 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983691 |
Product | TPR repeat-containing protein |
Protein accession | YP_590758 |
Protein GI | 94968710 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.354598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAAC TCGGATTTGC CCTGCTTGCG TTCCTTGTTG TTTCGGCCAT TTACCTGTAC GCGTGGCCTG CGCCGAACCT CGTCTATCCC GCGGTGGTGC TTGCACATGC GGGATTTGGA GTGCTGGTCA CACTGCTCGG ACTCCGATAT ATCCCCAAAC TGCGGACGTC GGGACTCTTG GCTGGAAGCG CCGGCGCCTT GATGGGCGTG GGAGCCGTGA TCGGGATCGT GCTGATTTTC ATTGGCACCT CGCGACCACA ACGCTATGCG CTTTGGGCGC ACATCGCATT CTGCGTAGTG GCAGTAGTGC TGCTGGCGAC GTGGCTGGTG CGGCAGAGGC GCGCGAATTC GGGTGTCTCG AGTGGAACGC CGTGGGGCGC ATTTGCGGGA ATCGTCGCAC TGGTTGCGGT TGTGAGCGTT GCAGCTTGGT ATGCGCGTGG TTTCTGGAGC AAGCAATACG CCATCAAGAA CCCCAATGCC GCGCCACTGA CCATGGATGA TGAGGGCGAT GGCTCGACCG GCATGTTCTT TCCGAGTTCG GCGCAGGTTG CAGGGAAGCG CAAGATTCCG GCAAAGTTCT TCATGGAATC CGACGCGTGC GCGCGCTGCC ACCAGGACAT TTACAACCAG TGGCAGAAGT CGGCGCACCA TTTTTCGTCG TTCAATAACC AGTGGTATCG CAAGAGCATT GAGTACATGC AGGACGTGCG GGGGACGAAG CCGTCGAAAT GGTGCGGTGG CTGCCACGAT CCGGCGGTGC TCTACAGCGG CCTGATGGAT ACGCCGATTA AACAGATTAT TCATCGGCCG GAGTCGCAGG CAGGGCTGGG TTGCATGATG TGCCACTCGA TCGTGAAGGT GAAGAGCACG ATGGGGCAGG CCGACTTCAT GCTCGAGTAT CCGGAATTGC ATGAGCTGGC AGCAACGAAG AATCCGGTGA TGCGCGCGCT GCATGACTTC ACGATCAAGC TGAATCCCGA GCCGCATCGT CGAGTGTTTC TCAAGCCGTT TATGAAGGAA CAGACGGCGG AATTCTGCTC GTCGTGCCAC AAAGTGCACC TCGACGCCCC GGTGAACAAT TACCGCTGGA TCCGCGGTTT CAATGAGTAC GACAACTGGC AGGCGAGCGG CGTTTCGGGC TTCGGCGCGC GGTCGTTTTA CTATCCGCCG AAGTCGCAAC AGTGCGCGGA TTGCCATATG CCAGCGGCGA AGTCGAATGA CTTCGGAAAT ATTGATGGCT TTGTGCACTC GCACCAGTTC CCGGGCGCAA ATACCGCGCT GCCAACGGCG AACGAGGATC CCGCGCAACT CAAGTTGATC GAGAATTTCC TGAAGACGGC AGTCACCGTG GACATCTTCG CAATCTCGCA CGCGGCAACG CCAGTAGGCG GAACCGCGCA GGCGGAAAAT TCAACTTCGT TTGCCGTCGG CGAAGAAGCA GAAGTTAAGA CCACCGGTGA GAACACAACG GAAAGTGTAC CGGTCACAGC TCCGCTGGAT GAAACGAACC CGGTACTGCG CCGCGGCGAA AGCGTACGCA TGGACGTTGT GGTTCGCACC CGCAAGGTCG GGCACTTCTT CCCGGGCGGC ACGGTGGATG CCTTTGATAC CTGGCTTGAA TTGAAAGCCG TGGATGACAA GGGACAGCCA GTCTTCTGGA GCGGCAAGGT CGAAGACGAC GGCAAGGGGC CGGTGGAGAA GGGAGCGCAC TTCTATCGCT CGCTGCAGAT CGATGAACAC GGCAACGAGA TCAACAAGCG CAACGCGTGG TCTACGCGAT CGACAGTTTA TGTGCGGTTG ATCCCTCCAG GCGCGGCGGA TACGGTGCAC TTCCGTGTGA ATGTGCCCGA GAATGTCGGC GACAAGATCA AGTTCACGGC GCGGCTCTGC TACCGAAAGT TCTCGTGGTA CAACACGCAT TTTGCTTACG CGGGCCAATC GAAAGACCGA GTGGCTGAGC CTCTGACGAA GGCATCGTAC GATGATCGCG GGTTTACTTT CGATGGCGAT CTGGCGGACG TCTCTGGCAA GATGAAGAGC GTTCCTGATC TTCCGATTGA GGTGCTGGCG GAGAAGACGG TCGAGCTTCG AGTGGTTCCG AAGAACACTC CACTGGAAAC GCCGAAGACA GTCACGAAGA AGGACGAGTG GCAGCGCTGG AACGATTATG GCATCGGATT GTTACTACAA GGCGACTTGA AGGGCGCAGC GGCTGCGTTC GTGAAGGTGA CCGAAGCAGA TCCGAATAAT CCAGATGGTT GGGTGAATCT TGGTCGCGTC GCGGTGCAGG AAGGCGACAT GGAGCGGGCT CGCGAGGTAC TGACGAAGGC CCTGAAGATC AATGCGAATC TTGCGCGTGC ACGGTTCTTC TACGCACGCG TCCTGCGTTC GGATGGCAAT TACGATGGCG CGGCGCAGGA ATTGCAGGCT GTGCTGGCGC AGTATCCCAA GGACCGCGTG GTGCGCAATG ATCTCGGGCG CATCTACTTC CTGCAACGCA AGTACGATCA GGCGATCGCT GAATTGCAGC AGGTTATGGA AGTTGATCCA GAAGACCTGC AGGCAAATTA CAACCTGATG CTCTGCTATC GTGGGTTGGG GAAGACGGAA GTCGCTGCGG ACTACGAGAA GCGTTATCTA CGCTTCAAGG CGGACGAAGC ATCGCAGGCG ATCAGTGGCG AGTATCGGCG CAAGCATCCG GAAGATAACA ACGAGCGGCA GCAGATTCAT GAGCACGTGT CGGTGCCGCT TGGCCCAACC TCTAAGAGCG TGACGCCGGT GAAGACTGCG AAGACGACGC AAACACATGG CGCAAGCGCC GGGAAATAG
|
Protein sequence | MRKLGFALLA FLVVSAIYLY AWPAPNLVYP AVVLAHAGFG VLVTLLGLRY IPKLRTSGLL AGSAGALMGV GAVIGIVLIF IGTSRPQRYA LWAHIAFCVV AVVLLATWLV RQRRANSGVS SGTPWGAFAG IVALVAVVSV AAWYARGFWS KQYAIKNPNA APLTMDDEGD GSTGMFFPSS AQVAGKRKIP AKFFMESDAC ARCHQDIYNQ WQKSAHHFSS FNNQWYRKSI EYMQDVRGTK PSKWCGGCHD PAVLYSGLMD TPIKQIIHRP ESQAGLGCMM CHSIVKVKST MGQADFMLEY PELHELAATK NPVMRALHDF TIKLNPEPHR RVFLKPFMKE QTAEFCSSCH KVHLDAPVNN YRWIRGFNEY DNWQASGVSG FGARSFYYPP KSQQCADCHM PAAKSNDFGN IDGFVHSHQF PGANTALPTA NEDPAQLKLI ENFLKTAVTV DIFAISHAAT PVGGTAQAEN STSFAVGEEA EVKTTGENTT ESVPVTAPLD ETNPVLRRGE SVRMDVVVRT RKVGHFFPGG TVDAFDTWLE LKAVDDKGQP VFWSGKVEDD GKGPVEKGAH FYRSLQIDEH GNEINKRNAW STRSTVYVRL IPPGAADTVH FRVNVPENVG DKIKFTARLC YRKFSWYNTH FAYAGQSKDR VAEPLTKASY DDRGFTFDGD LADVSGKMKS VPDLPIEVLA EKTVELRVVP KNTPLETPKT VTKKDEWQRW NDYGIGLLLQ GDLKGAAAAF VKVTEADPNN PDGWVNLGRV AVQEGDMERA REVLTKALKI NANLARARFF YARVLRSDGN YDGAAQELQA VLAQYPKDRV VRNDLGRIYF LQRKYDQAIA ELQQVMEVDP EDLQANYNLM LCYRGLGKTE VAADYEKRYL RFKADEASQA ISGEYRRKHP EDNNERQQIH EHVSVPLGPT SKSVTPVKTA KTTQTHGASA GK
|
| |