Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0768 |
Symbol | |
ID | 4069513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 947425 |
End bp | 949083 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637982774 |
Product | TPR repeat-containing protein |
Protein accession | YP_589847 |
Protein GI | 94967799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.400083 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATAGTCC CGAGATTCCT GCGAATTTGT TGCGCGGCCC TGCTGCTAGT TGTGTGCGCA AGTGTTGCGA GCGCGCAGCA CGACGAGCAT GCCGCGAGCG CCAAGCCAAC CACCCTGATG GAAGGCTTCG GCAACATCCA TCACAAAATT GCGACCGGCA ATCCTGAAGC GCAGAAGTAT TTCGACCAGG GACTGGCGCT TGCGTTCGGC TTCAACCATG AAGAGGCAGA GCGCGCATTC CGCAGAGCGG CTGAGTTAGA TCCGAAGGCC GCGATGCCCT GGTGGGGCGT GGCCTACGTC GTCGGTCCGA ACTACAACAT GGACGTGGAT CCCGAGCACG AGCGCGTGGC GTTCGAGGCG ATCCAGAAGG CCGTAGCGCT TTCCGCGAAT TCGCCGCAGG TCGAGAAGGA CTACGTCAAT GCGTTGGCGA AGCGCTATTC CGACGCGAAG AATCCCGACT ATCAGAAACT CGCGAGCGAC TATCACGACG CCATGGGAGC GCTCGCCAAG AAGTATCCCG ACGACCTCGA CGCGGCGACG ATCTACGCCG AGAGCGGGAT GAACCTGCAC CCCTGGAAGC TCTGGAAGAA GGATGGCACG CCGCAGCCGG GGACGGAGGA GATCGTCGCG ACGCTGGAAT CGGTGATCGC GCGCGATCCG AATAACATTG GGGCCTGCCA TTTCTATATT CATGCGGTGG AGGCATCGGC GCATCCCGAG CGCGGCGAGA CCTGCGCGAA GAAGCTGGCG GGGTTGGCGC CGGCGTCGGG TCACTTGGTG CACATGCCCG CGCACATCTA CATTCGCGTC GGCGAGCATG AGGCCTCCGA AGAAACCAAC GTAGCCGCAG CGAGGGCCGA CGAGGCCTAC ATCCAGCGAA CCGGCGCGCA GGGCGTCTAT CCGGCGATGT ACTACACCCA CAACCTGCAC TTCATCGCGA TTGAGAACGC GACCCTGGGC CGCTACTCTG CGGCGATGGA GGCGGCGCGC AAAGTCAGCG CGAACGTCGA GCCGATAGTG AAAGACATGC CGATGGCGGA CTTCTTTGCG CCCTTGCCGA CGATGGTGAT GGTGCGCTTC CGCCGCTGGG ACGACGTCCT ACAGGTGAAA CAGCCAAATT CGTATCAGCC CGACAGCACG GGCGATTATC ACTTCGCGCG CGGTCTGGCG CTGGCGCACA AGGGCAAGAT CGCCGAATCG AAAGTTGAGT TAGCGGCCCT GAACAAAGTC GCTGCAGAGA TGGCGAAAAT CCCGACAACT CCTGCCGGCC CCGAAAACGC CGCCAAGATC CCGCAGATCA TGGCGCACGT GGTGGAGGCG GAGATTGCGC TCGCACAACA AAAGAGTGAT GCGGCGATCG AGCATCTGAA GGCCGCCGTA CAGCTCGAGG ATTCGATGGA TTACAACGAG CCGCCGGACT GGTTTCTGCC CGTCCGCGAA ACGCTCGGCG GGACGCTCTT ACGCAGCGGC CAGCCGGTGG CCGCTGAAAT GGTGTTTCGG AAGGACCTGG AAATAAACCG GCGTAATCCG CGCTCCATGT ACGGCCTGAC CGAGGCACTG AAGGCGCAGA ACCGGATGCA GGACGCGCTG GCGCTGCAGG CGCAATTTGA CGAGGCCTGG AAGGGCGCCG ATACCAAGTT GACGATTGAG GAGCTATGA
|
Protein sequence | MIVPRFLRIC CAALLLVVCA SVASAQHDEH AASAKPTTLM EGFGNIHHKI ATGNPEAQKY FDQGLALAFG FNHEEAERAF RRAAELDPKA AMPWWGVAYV VGPNYNMDVD PEHERVAFEA IQKAVALSAN SPQVEKDYVN ALAKRYSDAK NPDYQKLASD YHDAMGALAK KYPDDLDAAT IYAESGMNLH PWKLWKKDGT PQPGTEEIVA TLESVIARDP NNIGACHFYI HAVEASAHPE RGETCAKKLA GLAPASGHLV HMPAHIYIRV GEHEASEETN VAAARADEAY IQRTGAQGVY PAMYYTHNLH FIAIENATLG RYSAAMEAAR KVSANVEPIV KDMPMADFFA PLPTMVMVRF RRWDDVLQVK QPNSYQPDST GDYHFARGLA LAHKGKIAES KVELAALNKV AAEMAKIPTT PAGPENAAKI PQIMAHVVEA EIALAQQKSD AAIEHLKAAV QLEDSMDYNE PPDWFLPVRE TLGGTLLRSG QPVAAEMVFR KDLEINRRNP RSMYGLTEAL KAQNRMQDAL ALQAQFDEAW KGADTKLTIE EL
|
| |