Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1566 |
Symbol | |
ID | 4068675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1913513 |
End bp | 1915729 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983575 |
Product | TPR repeat-containing protein |
Protein accession | YP_590642 |
Protein GI | 94968594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.411848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00925631 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCACTC GTTCGGCCCT GATCGTAGCT CTGCTCGCCC TCAGCCTTTC TCTCGGCTGC ACCGCTAACA AGCAATTCCA GCGCGCCTCT GCACTTCAAA AAGCGGGCAA GACGCAGGAA GCCCTCGATA TCTACGAGAA CCTCGTAGTC CGGACGCGCA GCCATAAGGC GCAATCGCAA TTGTTCGTCC GAATCGGCGA GTGCGAATGG ACGCTTGAGC AGCAGGGTCC ATCCCTCAAT GCTTTCTTAA AGGCCGCGGA ACTCGATCCC GCAAACAGTT CTGCGCATCT CCATCTCGCA CAACTCTTCC TCGCCGCTGG GGCTCCTGAC AAAGCGCTCA TCTTCGCCCA AATCGTCCTG TCGCATAATC CCAACGATCT TGATGCCATG GCTGCTGAAG CCAGTGCCTA CGCCTTCCAG GGCAACATCC CCGCCGCGAC GAAACGCTTT CAGGATGTTC TCGACCGTGA TCCCGCCCGA GAAGATGCTG CGGTAACGCT CTCGCAGATT TATTCCGCCA GTGGCCGCAT CGATCAAGCA CGCCACGTGC TCGAAACTGC CGCTGCGAAG GCCCCCAAAA GTTCCGTCAT CCAACTCGCA CTCGCGCACT TCGAAGAAGA GCAGGGTCGA CTCCCTGCCG CCGAAGCCGC CTATCGCAAA GCCGTGACGC TTCAGGATGA CGGCCCCACG AATCTTAAGC TTGCCCAATT CCTCGAACGC AGCGCCCGTG TCCCGGAAGC CGAAACCGTG CTTCGTCGAG TGGATGGGCT CACTCCCGCG AAGCCCTATG CCCTCGCCGA TTTCCAGTTG ATTTCAGGAC GCGACGGTGC CGCTTCGCAG CAGTACTTAA AGCTTCTCCT GAATCGGGAT AACAAGCGCG ACGGGAACAC CGCCACTCCG ATCGCCGCCC GCGCGATCGA AGCCAAACTC GCCGTTGCCA ACGGTCAGAG TGGCAGCAAG CGCACCCAGT CGCTTCTGGA AGCCAAGAGT GCCCTCGGCA TCCATCGCGC AGAATTTGGC GAAGAAACCA CGGCAGTCCT GGCTGCTGAA ATCGCTCTGG CTGAGGGTGA TTCAGCCACC GCAGCCGCCC TTGCGCGATC GGTGGTTGAC GAGCACGCCG ACAACAACTC CGCCCACTAC GTTCTCGGTC TCGCACTCTC CCGTATGGGA AAGAACGCAG AAGCCCGCGC CGAGTGGCAA ACCATCCTCG ACAATGACAC GACCTCAGTG CCGGCCCGGC TCTCTCTCGC ACAGCTTTCG CTCTCGGAAG GCAATATCGC CGATGCCGAG CAGATGGTTG TTCCCGTCGT TCGTCAGGAA CCGGCAAACC TCGGTGCTCT GGAATTGTTC GGCCGAGTGT TGATCGCCGA GAAAGATTTC GGTGCCGCGA ACAGCATTGC CGTGCGTTAT CAGCAAATCG ACAAGACAAG TCCTGTAGCG CACCTTCTTA AAGGCGACGC CGCTCTCGCC CAGCACCACC TCGCCTATGC GCTGATTGAA TATGAGCAAG CTGTTCTGCT CGATCCGAAC TCAACCGCCG CTCAGGAAGG GCTCGTTCGC GTCTATCGCT CCGGAACCAT CACCAAGCCA ATGCTGCAGC GCATGGAGAT GAGCGCAGCC GCCCCTCCGA AATCCGCATC ACTCATGGAA CTCGCCGGTC GGCTCTACTC AGAACATCAT TGGAACGATG ACGCTGCCCG CTGTTTCCGC GCCGCTTTAG CTATGGAGCC CCAACGCAGC AGCTCCGCCG TGGAACTCGC CAAGCTCCAG GCGCAGGATG GTAGTTCGAC GGATGCAGCG AGCGCCGCCG CGGCCATCAG CGCATCGAAT TCACTGCTCA TTCGCGGACT CGGTGCCCAG GATCGCAGCG ACCTGAATGC AGCTATTCGC AACTACGAAG CCGCCCTGAG CAAAGGCGAG AATACCGGTG TCGCCGCCAA CAACCTCGCG TGGCTTTATG CCGAGCAAGG CAGCAATCTC GACCGCGCCC TGGAACTCGC CCAGCGTGCG CGCGAGGCAA ATCCCGTGGA TCCGGCAGTC ACCGACACTC TCGGTTTCGT GCTGCTGAAA CGCCGCGAGT ACTCCATGGC CCTGATGGCC TTGAAAGAAG CAGATCAGTT GATGCGCGTC CAAAAGAACT CCGACGTCCA ACTCGCGCAA GCAATCCGGC AGCACATCCT GGAAGCATCG CGCCAGTCGG GAGCAACCAC ACCCTGA
|
Protein sequence | MPTRSALIVA LLALSLSLGC TANKQFQRAS ALQKAGKTQE ALDIYENLVV RTRSHKAQSQ LFVRIGECEW TLEQQGPSLN AFLKAAELDP ANSSAHLHLA QLFLAAGAPD KALIFAQIVL SHNPNDLDAM AAEASAYAFQ GNIPAATKRF QDVLDRDPAR EDAAVTLSQI YSASGRIDQA RHVLETAAAK APKSSVIQLA LAHFEEEQGR LPAAEAAYRK AVTLQDDGPT NLKLAQFLER SARVPEAETV LRRVDGLTPA KPYALADFQL ISGRDGAASQ QYLKLLLNRD NKRDGNTATP IAARAIEAKL AVANGQSGSK RTQSLLEAKS ALGIHRAEFG EETTAVLAAE IALAEGDSAT AAALARSVVD EHADNNSAHY VLGLALSRMG KNAEARAEWQ TILDNDTTSV PARLSLAQLS LSEGNIADAE QMVVPVVRQE PANLGALELF GRVLIAEKDF GAANSIAVRY QQIDKTSPVA HLLKGDAALA QHHLAYALIE YEQAVLLDPN STAAQEGLVR VYRSGTITKP MLQRMEMSAA APPKSASLME LAGRLYSEHH WNDDAARCFR AALAMEPQRS SSAVELAKLQ AQDGSSTDAA SAAAAISASN SLLIRGLGAQ DRSDLNAAIR NYEAALSKGE NTGVAANNLA WLYAEQGSNL DRALELAQRA REANPVDPAV TDTLGFVLLK RREYSMALMA LKEADQLMRV QKNSDVQLAQ AIRQHILEAS RQSGATTP
|
| |