Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2642 |
Symbol | |
ID | 4072051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3116231 |
End bp | 3117331 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984659 |
Product | kelch repeat-containing protein |
Protein accession | YP_591717 |
Protein GI | 94969669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.105005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGCA CTCACCACTG GGCCCCCGCT CTCGTATTCG TCTTAGTGTC CACCCTTTCA GCCGATCGTC CACAGCCGCA TTTCGGAAAA GTCGTTGAGG CGGGACATAT GCTCGCGCCA CGATCTGGAC ACACAGCCAC TCTGCTCAAC GATGGGCGTG TGCTCATTGT CGGCGGCATG GTGAGGAATG GTGAGTTCCT CGACAGCGCG GAATTCTACG ATCCCGCAAA GCGTTCGTTT ACGGCCAGCG GCGCACACAT GAAAATAAAG CGCGTTGGGC AGACGGCTGC ACTGCTCAAA GATGGGCGCC TGTTCATAGT CGGTGGCTGG ACCGGTGGAA TCGCCACGAC GGCAGAGATC TATGACCCGA AGACAGATCG CTTCACGAAC GAAATTCCGA TGACAGTCCC GCGAGCTCGA GCAACGGCGA CGACACTGCA AGATGGGCGG GTGCTGGTGA CGGGCGGAGC GCGCGCCGAT GATCGCAGCG GGCAGAAAGC AGCAGAGATA TTCGATCCGG CGACGATGAA GTTCACGGCA GTGGGCGATA TGAAAGATGG TCGTACCGCG CATACCGCAA CGCTGCTGCA CGATAGGACA GTGCTTATTG CTGGTGGGAT GTCCGACCAC CACTCAGTGG CGAGCGCGGA GGTTTTCGAT CCGAAGACAA ATAAGTTTTC CGTTGTCGGG CCGATGCGGC AGGAGCGGTA TAAGCACACG GCGCAAATGC TGGCAGACGG GCGTGTCTTG ATCGCGGGTG GCTCGGATGA TCGCGACTGG AAGGGAATGC TGGCCGAAGC CGAGGTCTAT GATCCCGCGA AGCGGAGCTT CACTCCGACG CAGGAAATGG CGGAGAAGCG TTTCAAACTC TCCGACGAAG CGGCGTTGTT ACCAGATGGC AGCGTGCTGA TTGCTGGCGG CGCTGCGAAA GCGGAGATCT TCGATCCCAA GCGCGGAAGC TTCAATTCGT TGAACAGCGG CACGGAAACG CCGCAGTGGT ACTTGAGCGA GACCACGCTC AAGAACGGCG AGGTGCTGCT GCTCGGCGGG TACTCGACGA GCATGACCGC TACCGACAAG GCATGGATCT ACCAGCCGTG A
|
Protein sequence | MLRTHHWAPA LVFVLVSTLS ADRPQPHFGK VVEAGHMLAP RSGHTATLLN DGRVLIVGGM VRNGEFLDSA EFYDPAKRSF TASGAHMKIK RVGQTAALLK DGRLFIVGGW TGGIATTAEI YDPKTDRFTN EIPMTVPRAR ATATTLQDGR VLVTGGARAD DRSGQKAAEI FDPATMKFTA VGDMKDGRTA HTATLLHDRT VLIAGGMSDH HSVASAEVFD PKTNKFSVVG PMRQERYKHT AQMLADGRVL IAGGSDDRDW KGMLAEAEVY DPAKRSFTPT QEMAEKRFKL SDEAALLPDG SVLIAGGAAK AEIFDPKRGS FNSLNSGTET PQWYLSETTL KNGEVLLLGG YSTSMTATDK AWIYQP
|
| |