Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1881 |
Symbol | |
ID | 4073040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2259250 |
End bp | 2260347 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983890 |
Product | rhomboid-like protein |
Protein accession | YP_590956 |
Protein GI | 94968908 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.210554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAT GCGAACAATG CGGCGCGGAG TTCTCTAACT GGTTTGGGGA TGTAAAACAT CGCTGCCCCA AATGCGAGCA GCAAGCTGCC GCCCCGCAGC AGCCGCCTCC ACAAGTGCTC CTGCCAGACG GCACGCTATC GCCGGCTCCG GTCCAACGCA CGCTGCCCGC GCCGATCGTC ACTCGCATAT TGATCGGAAT CAATGTGGCG GTGTTTTTAG CGATGGTGCT GCTCACACGC CAATTCGTGG AGTTCGATAC TCCTACCGCG CTGCGATGGG GGGCGGATTA CGGTCCGGCA ACCGCCAGCG GCGAGTGGTG GCGCATGCTG ACCTCGATGT TCCTGCACGG CGGTATTTTG CACATCCTGG TGAACATGTT TGCGCTGCGA AACCTTGGGT ACACCGCGGA ACTGTTCTAC GGGCGCAAGA ATTTCCTCAT TATTTACATG CTGAGCGGCT TTGGCGGAAG TGCGGCAACG CTGTTGTGGC GTCCCGACAG CGTGAGCGTC GGTGCGAGTG GTGCGATTTT CGGCGTGGCC GGAGCGCTCG CAGCGATGGT GTATTTCAAG AAGCTTCCGG TAGACCGTGC ATTGCTGAAA CGCGACATCG GCAGTATCGG CGCTGTAATC TTCTACAACC TGCTGATCGG CGCCGCATTG CCGATCATCA ATAACGCGGC CCACGTCGGC GGACTAGTTG CCGGCGCAAT CCTTGGCTTC ACCCTTCCTG CAATGATTTT CCGGACGGAA CGGGAAAAAT CGAATAGCTC GGGAACTATT GCCATCGGGA TAGTAATTGC CTTGATTGTC GCAATCGGTA TCACTGGACA CCAGAAGCTG GCGGCGGAAG AGGAACTCTA TCGCGCCGAT AAGGCTTACG AGGCTGGCAA AAAGAGCGAG GCGCTGCAAC ACCTGGAACG CGCGGCTGCC ATGCATCCCG AGACTTTCAC CGCTAATTTT ATGATCGGTG CGATTTATCT CGATGAAGGA CAGCCCGAAA AGGCCGTTCC CTTCCTCGAG AAAGCTGTGC AATTGCAGCC TGAGAATCGG GCGGCCAAGC AGGCGCTCGA CCGAGCCCGG AATTCCCTCA ATAAATAG
|
Protein sequence | MAKCEQCGAE FSNWFGDVKH RCPKCEQQAA APQQPPPQVL LPDGTLSPAP VQRTLPAPIV TRILIGINVA VFLAMVLLTR QFVEFDTPTA LRWGADYGPA TASGEWWRML TSMFLHGGIL HILVNMFALR NLGYTAELFY GRKNFLIIYM LSGFGGSAAT LLWRPDSVSV GASGAIFGVA GALAAMVYFK KLPVDRALLK RDIGSIGAVI FYNLLIGAAL PIINNAAHVG GLVAGAILGF TLPAMIFRTE REKSNSSGTI AIGIVIALIV AIGITGHQKL AAEEELYRAD KAYEAGKKSE ALQHLERAAA MHPETFTANF MIGAIYLDEG QPEKAVPFLE KAVQLQPENR AAKQALDRAR NSLNK
|
| |