Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0736 |
Symbol | |
ID | 4069078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 904350 |
End bp | 906008 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982742 |
Product | TPR repeat-containing protein |
Protein accession | YP_589815 |
Protein GI | 94967767 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.325309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.93082 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCA CAGCGCAAGC CGTTGTAACC GCGCCCATCA CCGCCAAAAC TTGGGTGATG CGGATCTTTT GCGCGGTGTT CGGAGCTGCG CTGATTCTCG GGTGCCTGGA AGGCGCTCTG CGGGTTTTCG ACGTTGGATT TCCGACTTCG TTGACCGTTC CCTGCACCAT GGAACAGCAA CCGGCGGCTT GCTACAACCT GTTCTTCACG GCGCCGTATT TTCCTCCAGG CATCATTCAC ACGCCGACTC TATTCGCGGT TCCCGCAGTG AAAGCGCAGA ACACGTACCG AATTTTCGTG CTCGGCGAAT CGGCGGCAAT GGGAGACCCT GATCCGGCGT ATGGGTTCAG CCGCTATCTC GAAGTCATGT TGCGCAATCG GTATCCGCAG ATGAGGTTCG AGGTAATGAA CACGGGCACC GTTGCCATCA ATTCGCATGT TGGATTGCCG ATCGCGCGCG AGATTGCGAA GCTCAAACCC GACGTGGTGA TCATCTACTC GGGGAACAAT GAGGTCGTGG GGCCGTATGG CGCGGGGACG GCGTTTGCGG CGTCTGCCAT GGAATTACCG GCGATTCGAA GCAGCATCTG GTACCACACC ACGCGCACGG GACAACTGCT GACCAAGCTC GGGATGCAGA AGTTGGAATG GCGCGGCATG GAGATGTTCC TCGACAAGCA GGTGCCGCAG TCGTCGCCTC TAATGCCTTA TGTTTACGCG AACTTCGAAG CCAACCTACG CGACACGATC GGGGTCTTGC GCGGAGCGGG AGCGACAGCG ATTGTCTCAA CCGTAGCGAC GAACTTGCGC GACTGCGCAC CTTTCTCTTC GCTGCATCGG GCAGGGTTGA GCAAAGAAGC GTTGCAGCGG TGGGACACTC TGGTGAATGA GGGCGCGAAG TTGGAAGAGG CCGGTGCTCA TTCTGAAGCA CTGAAGCTCT ACGCGCAAAC TCTTGCGATC GACGACGAGT ATGCGGAATT AGAGTTTCGA ATTGCGAGAG TGCAACTGGC GCTCGGCAAG CGCGAAGAGG CGCTCAAGCA CTTCGAACGC GCCCGCGACC TTGATACCCT GCGCTTCCGC GCCGATAGCC GAATTAACGC GATCAATCGC AGCACGGCGG AATCCGGCGG CGCGGAGTTG GTGGACGCAG AACAACTTCT GTATGCGAAC GCCGTTGATG GCATCACCGG AGGCGATCTC ATCTATGAAC ACGTTCATTT GACGCCGACA GGAAATTACC TGCTCGCGCG CGCGATGTTT CTGAAGATTG CCGGCAAGCT ATCGCCGACG GCGGGCGAAG CCGACGTGCC GTCAGAGTCT GAATGCGAGG AATGGCTCGC GCTTACCGGA CACGATCGGA TCCGAATCGC GCACGAGATG GCGGAACGGT TGCAGAAGCC GCCGTTCACG AACCAATCGA ACCACTCCGA GCAGCTGCTT CGAATTTCAA TGCAGGCGCA GCAGGCTGAC GAGAGCCCGC AGGACACGGC AGCGCAATAT CAACGGGCGC TGCAACAGGC GCCGAATGAC CATCTTCTTC ACTATGGCTT TGGGCGCTTT CTCTTCCGCT ACAATCCCGA CGCTGGCGCG AACGAACTGC GGCAATCGCG GCCGTGGGAC GGCTTTCCGG TCTTCGCGCC TAACGGTCAG ATATTTTAG
|
Protein sequence | MSATAQAVVT APITAKTWVM RIFCAVFGAA LILGCLEGAL RVFDVGFPTS LTVPCTMEQQ PAACYNLFFT APYFPPGIIH TPTLFAVPAV KAQNTYRIFV LGESAAMGDP DPAYGFSRYL EVMLRNRYPQ MRFEVMNTGT VAINSHVGLP IAREIAKLKP DVVIIYSGNN EVVGPYGAGT AFAASAMELP AIRSSIWYHT TRTGQLLTKL GMQKLEWRGM EMFLDKQVPQ SSPLMPYVYA NFEANLRDTI GVLRGAGATA IVSTVATNLR DCAPFSSLHR AGLSKEALQR WDTLVNEGAK LEEAGAHSEA LKLYAQTLAI DDEYAELEFR IARVQLALGK REEALKHFER ARDLDTLRFR ADSRINAINR STAESGGAEL VDAEQLLYAN AVDGITGGDL IYEHVHLTPT GNYLLARAMF LKIAGKLSPT AGEADVPSES ECEEWLALTG HDRIRIAHEM AERLQKPPFT NQSNHSEQLL RISMQAQQAD ESPQDTAAQY QRALQQAPND HLLHYGFGRF LFRYNPDAGA NELRQSRPWD GFPVFAPNGQ IF
|
| |