Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0426 |
Symbol | |
ID | 4069652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 497199 |
End bp | 499040 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637982430 |
Product | glycoside hydrolase family protein |
Protein accession | YP_589505 |
Protein GI | 94967457 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0129037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCACG CTGCGAAGAT GTCATCTGTC CTCCTCCTTC TCTGTGCGTC TGCGTTTTGC GCGAATGATC TCAAGGTTCT AACCGATCAC GTTGGTTATG AGACCACAGG TGCGAAACAT GCTGTCGTTC TCGGTAAAGC CGGTGACCGT GTTTCCGAAT GCTCGATCAA GAATTCAACC GACGACAGAG TTGTTGCGCC GATAAAGGCG GTGGCGGTCG GCCCAGTGAA GAAGTGGCGC GATTGGTATT TTTGGACATT GGACTTCGAC AGCCTCACGC AGGAAGGCCA CTACTACATT GAGTGCGCCT CATCGCGAGG CGCAGTGCGA TCGTTTCCCT TTGCCGTTCA AGCCAATCTC CTGGAACAAG GCACTCTCTC TGATGTCCTG TACTACTTCA AAGACGAACG CAGTTCGGGA CCAATGGACA AGGCGGACAG TCACCTGCCC TTCGATCCTC CGAAGCAAGG CACTCTCGAT GCGCATGGTG GCTGGTGGGA CGCGACAGGC GATTACGGGA AGCATCTGTC GCACCTCTCA TTCTCAACCT ACTTCAATCC GCAGCAGATC CCGCTCGTCG TGTACTCGCT GCTAAAGAGC TACGGGCAAC TCACTCGGCG GGGACTCCCG GAGGTTACAC GCTACAAAGA CCGCATTCTC GACGAAGCGA TGTTTGGTGC TGATTATCTC GTGCGCGTAA AAGATCCGAG TGGTTCTTTC TATCGCTCAA TTTCGACGGG CGGCGTAAAG CAGGTGCCCG AAGAGCGCAA GGTCGCCGGC GAGATGAAGA AGTTCGCGAT CTACCAGTCG AACGACAAGC GTCCTGACAT GATTGAGAAG GCGAACAACG ATCTTGAGTA CGAAGTTAGC TATCGTTCTG GCGGCGGCAT TGCGATCGCT GCTTTGGCGA TGGCTAGCAC TGCTCCTATC TCAGGTGAAT ACAAGAATGC GGATTACCTG AAGGCTGCCG AAGACGCTTT CGCTTACCTT GAGAAGAACA ACCTGAAAAT GGTCAACGAT GGCAAAGAGA ACATCGTTGA CGATTACTGT GCACTCACCG CAGCGACTGA GTTGTTCCGC GCAACGAAGA AGCCAATATA CAAGGAAGCA GCTGATCGCC GCGCGTCAAG CCTAGTGTCG CGCCTGGCGA GCGATGGTCA GCACCAGAAT TACTGGCGCG CCGACGACCA TGATCGTCCC TTTTTTCATG CGTCTGATGC CGGTCTTCCT GTGGTTAGCT TGCTGTACTA CGCGGAGGTT ACCGACGCGC AAACTCGCAC AAAGGTTCTC GAAACGGTTA AGAAGTCGCT CGCTTTCGAG CTTGCGACTA CGCGTGAGGT CCCAAATCCT TTTGGCTATG CCCGAGAGTT TGTTCAGGAC AAAACCGGCG CCCGTCGCAC CAGCTTCTTC TTCCCACATA ACAGCGATGC GGCACCGTGG TGGCAGGGCG AAAATGCGCG GCTCGCATCT TTGTCTTCAG CAGCCAGACT TGCTGCGCTT CAATTCACCG ACGATCCGGA GTTCGCGAAG CAACTCAATT CGTATGCTCT GAACCAGCTC AATTGGATCG TTGGACTGAA TCCGTTCGAT TCCTCGATGT TGAACGGCGT CGGCCATAAC AATCCGCAGT ACCTGTTTTT CGATTCCTGG GAATTCACCA ATGCCCCGGG CGGCATATCG AACGGCATTA CCAGCGGCTT CCGCGACGAA GACGATATCG ACTTTAACCT TACGTACAAA CAGACCGGGG CCGACAACGA TTGGCGTTGG CAGGAACAGT GGCTGCCACA TGCGTCGTGG TATTTGCTTG CGGTTTCAAC GGGCAACACC TCGCCTCGCT GA
|
Protein sequence | MLHAAKMSSV LLLLCASAFC ANDLKVLTDH VGYETTGAKH AVVLGKAGDR VSECSIKNST DDRVVAPIKA VAVGPVKKWR DWYFWTLDFD SLTQEGHYYI ECASSRGAVR SFPFAVQANL LEQGTLSDVL YYFKDERSSG PMDKADSHLP FDPPKQGTLD AHGGWWDATG DYGKHLSHLS FSTYFNPQQI PLVVYSLLKS YGQLTRRGLP EVTRYKDRIL DEAMFGADYL VRVKDPSGSF YRSISTGGVK QVPEERKVAG EMKKFAIYQS NDKRPDMIEK ANNDLEYEVS YRSGGGIAIA ALAMASTAPI SGEYKNADYL KAAEDAFAYL EKNNLKMVND GKENIVDDYC ALTAATELFR ATKKPIYKEA ADRRASSLVS RLASDGQHQN YWRADDHDRP FFHASDAGLP VVSLLYYAEV TDAQTRTKVL ETVKKSLAFE LATTREVPNP FGYAREFVQD KTGARRTSFF FPHNSDAAPW WQGENARLAS LSSAARLAAL QFTDDPEFAK QLNSYALNQL NWIVGLNPFD SSMLNGVGHN NPQYLFFDSW EFTNAPGGIS NGITSGFRDE DDIDFNLTYK QTGADNDWRW QEQWLPHASW YLLAVSTGNT SPR
|
| |