Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1577 |
Symbol | |
ID | 4069015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1926441 |
End bp | 1928081 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983586 |
Product | hypothetical protein |
Protein accession | YP_590653 |
Protein GI | 94968605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.188499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.121402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAAG GGAACAATCA AGGCGGCCGT CCACTTCTGG CGTGCGAGAT CACGCGCACC CAGGTGTTTG CGACGCGCTG GGCCGAAAAA ACTACTGGCG TCGAGGTACT GCAAGTACGC ACGATTCCGG GCGGTGTCTC TCCGAATTTG ACGAGTCAGA ACGTTTCGGA CGCAGGCGCG CTTAAGAGCG TGGTGGCGGA TGCGTTGCAG GCCAGCGGCG CCCGCACCAA GGACGTGACT CTCATCGTTC CTGACGCCGC CGTTCGCGTT GCGCTTCTCG ATTTCGATAC GCTTCCTGAA AAGAAGCAGG AAGCCGATGC GGTGGTGCGC TTTCGACTGA AGAAGTCATT GCCGTTCGAA GTAGACCGCG CAGCCATTTC CTACCACGCT CAGCCGAATG GCACGACATT GCGCGTGCTC GTGTGCGTGA TGTTGAACTC GGTGCTGCAC GAGTACGAGT CTGCAGTGCG CGACGCAGGA TTTTTGCCGG GTGTCGTTCT GCCATCGACC CTGGCAGCGC TGGGCAATGT AAGTGTTGAC GCTCCCACGA TGGTCGTGAA AATCGCAGAC GGGACCACGA CGATCGCGAT TCTCGATCAG GGACGCCTGC AGCTTTATCG AACTCTCGAC CACGGCTCTC CCGACGTGGA GCCAGCCTCG TTGGCGCATG ATATCTACCC GTCGGTCGTG TTCTTTCAAG ACACCTACGG CGTACCGATC GAGAAGATCT ACGTTTCCGG AGCGAATAAC TTCGCTGCTG TGGCGCCGCA TCTTGCGCAG GAGACAGCGG CGGAAATAGA AGAACTCGAC AATCCCGTGT TCGCAGGACT GAATCCCGGC ACATTGCCGA AGAGCATGCT TGCTGGCGTG CTGGGTTCAG GGCTGACGAA GACCAGGATC AACCTGGCGA GCGAGCCCTA CGAGGACGCG AAGCTGTACC TGGCGCGCTT CGGGACAATT GCTGCAGCGC TCTTGCTGGT TGCGGTGGGG CTGTTGTGGT TCACGATTCA CAGTGTTCGG CGCTCGAGCG ACATCAATCG GAAGTTGTCT GCCGTGCGAG GGCAGATCGA CACACTCGAC CGGGAAAGGG TCCTGGCCGA GAAGATGCTC GCCCTGCCGC AGAACCATGG CACTGTGAAT AAATCGGAGT TCTTGAACAG CGTCTTTGCG CGTAAGGCGT TTTCGTGGAC GACGGTTTTC TCCGACATGG AGAAGATCAT GCCGCCGGGG CTGCACGTGG TTTCGATCGC GCCGGAACTC GACGCGCAGA ACCAATTGAA AGTACAGATT GTTGTGGCAG GTGAGAACCG AGACCGGGCG ATCACACTGG TGCGAAATAT GGAGCAGACG CCGCGGTTCC GCGATGTGAT CCTGCGAAGC GACATACAAA ATACGGTGCT AGGTGGCACT AGCGCTGAAG ATCGCGATCC GATCCGTTTC GACATCGTTG CGCAGTATCT GCCTTCGGCG CCGAATCCGG CGCCACCAGC GAGCGGAGTC GCGTCGAAGG CAGAAGATGC GCCGTCCGCA GCGAGCGCAG AGCCTAAGGC TGCCGAGGCA TCGGCAGGTC CGGCGCCAGC ACAACCTGCT GCCCAGCCGA AGCCGCAGAC CGTCGCCAGA AAGGCGGGAG CACAACGATG A
|
Protein sequence | MLKGNNQGGR PLLACEITRT QVFATRWAEK TTGVEVLQVR TIPGGVSPNL TSQNVSDAGA LKSVVADALQ ASGARTKDVT LIVPDAAVRV ALLDFDTLPE KKQEADAVVR FRLKKSLPFE VDRAAISYHA QPNGTTLRVL VCVMLNSVLH EYESAVRDAG FLPGVVLPST LAALGNVSVD APTMVVKIAD GTTTIAILDQ GRLQLYRTLD HGSPDVEPAS LAHDIYPSVV FFQDTYGVPI EKIYVSGANN FAAVAPHLAQ ETAAEIEELD NPVFAGLNPG TLPKSMLAGV LGSGLTKTRI NLASEPYEDA KLYLARFGTI AAALLLVAVG LLWFTIHSVR RSSDINRKLS AVRGQIDTLD RERVLAEKML ALPQNHGTVN KSEFLNSVFA RKAFSWTTVF SDMEKIMPPG LHVVSIAPEL DAQNQLKVQI VVAGENRDRA ITLVRNMEQT PRFRDVILRS DIQNTVLGGT SAEDRDPIRF DIVAQYLPSA PNPAPPASGV ASKAEDAPSA ASAEPKAAEA SAGPAPAQPA AQPKPQTVAR KAGAQR
|
| |