Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2242 |
Symbol | |
ID | 4072987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2661324 |
End bp | 2662949 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984258 |
Product | TPR repeat-containing protein |
Protein accession | YP_591317 |
Protein GI | 94969269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.433133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCA TTGCTACGCT TTCGATTTGT GCTGCGCTGG CGCTGGGCTC GGCTGCTGTC GCGCAGGAGG ATGTCCATAA GCACCATCAT GACGATGGCG TGGATCACAC CACGAACTTC GGGCATGTGA ACTTTCAGAC ATCGTGCTCG CCCGCGGCGC AAACGCAATT CGAGACCGGA GTAGCGGCGT TGCATTCGTT TGAGTACACC TCGGCAAAGA AGTTGTTTGG AGCGGCGGAA CAGGCGGATT CAGAGTGCGC CATCGCCTAT TGGGGCGAGG CCATGACGCT CTGGCACCAG CTTTGGGACA CTCCCAAACA GGATGTGCTC AACGAAGGTT GGGCCATGAT CCAGAAGGGC GAAAAAGCAA AGCATACCAG CGCACGCGAG AGCGGCTATC TGAAAGCCGT TGAGGCCTAC TACAAGCCGT CAAAACAGAC CCCGGACGAG CGCGCGACAG CGTACTCCGA CTCGATGGGC AAACTGCACG ACAAGTATCC GGACGACGAG GAGGCTGCCG TCTTCTACGC CCTCTCGCTG CTCGCTTCTG AGCCGCCGAC CGACACAACT CTCGCGAATC CGAAGAAAGC TGTCGCAATC CTGAACCAGG TTCTGGCGAA GGACCCTGAC CATCCGGGCG TGACGCATTA CATCATTCAC GCCAGCGACA ATCCGCACAT GGCGCAGGAC GCCGTCGCCG CAGCGAAGAA GTATGCGAGC ATCGCGCCGG GCTCGCCGCA CGCGGTGCAT ATGCCATCGC ACATCTTTGC CCGCGTCGGC TACTGGCAGG ACTCCATTAA CTCCAACCTC GCCGCCATCG CGATCGCGAA GAAAGGCAAC GAGGTCGACT ACCAACTCCA CCCCATGGAC TTCCTGATGT ACGCCTACCT GCAAACCGGG CAGGACGACA AGGCCCGCAC AACCGAGCAG GAAGCCGTCG GCATGGAGAA CAAGGGCTAT GGCCGCGGCC GCGAGCCGTT CTATTACTAC GTGCAGGCGC ACTTCCCTTC CATGCTTGCG CTCGAGCTGC GCGACTGGAA GGCCGCTGAG GCATTACAGC CCGTCGAAGG CGGGGAGCCC GGATTCAAAG CCATCACCTA TTGGGCGCAG GCCGTCGGCG CAGGGCATTT GAAAGATGTT GCGAAGGCTC AGGAAGCCGT AAAGAACGTG GATGCCGCCA TCGAGGCAGA AAACAAAGCG CATCCCGAGT ATTCCCACGC CCCCGTGAAC ACTGACAAAA ACGAAGCCCA CGCCTGGCTC GCCTACGCGC AAGGCAACAA CGACGAAGCA TTCCGTCTGC TGAAGGAAGT GATCGACTAC CAGGACAAAG TCGGCAAGGG CGAAGTCGAA CTGCCTGCCC GCGAAATGTA TGCCGACATG CTGCTCGAAC TCAATCGTCC GGCAGATGCG CTGGAACAAT ACAAAATTTC CCTGAAGACC GATCCGAACC GCTTCAACGG CGTCTATGGC GCCGGCAAAG CGGCGGAGAT GGCCGGACAG CATGAAGTCG CTGTCGGCTA CTACAAGCAG TTGGCCGAAA ACTGCAAAGA GGCCGCGCCA GTACGTTCCG AGTTGGCGCA CGCAAGAGAA GTAGCCGGCG GAGCGACGGT GGCCGCGGGA CAATAG
|
Protein sequence | MKRIATLSIC AALALGSAAV AQEDVHKHHH DDGVDHTTNF GHVNFQTSCS PAAQTQFETG VAALHSFEYT SAKKLFGAAE QADSECAIAY WGEAMTLWHQ LWDTPKQDVL NEGWAMIQKG EKAKHTSARE SGYLKAVEAY YKPSKQTPDE RATAYSDSMG KLHDKYPDDE EAAVFYALSL LASEPPTDTT LANPKKAVAI LNQVLAKDPD HPGVTHYIIH ASDNPHMAQD AVAAAKKYAS IAPGSPHAVH MPSHIFARVG YWQDSINSNL AAIAIAKKGN EVDYQLHPMD FLMYAYLQTG QDDKARTTEQ EAVGMENKGY GRGREPFYYY VQAHFPSMLA LELRDWKAAE ALQPVEGGEP GFKAITYWAQ AVGAGHLKDV AKAQEAVKNV DAAIEAENKA HPEYSHAPVN TDKNEAHAWL AYAQGNNDEA FRLLKEVIDY QDKVGKGEVE LPAREMYADM LLELNRPADA LEQYKISLKT DPNRFNGVYG AGKAAEMAGQ HEVAVGYYKQ LAENCKEAAP VRSELAHARE VAGGATVAAG Q
|
| |