Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1898 |
Symbol | |
ID | 4073360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2276280 |
End bp | 2279531 |
Gene Length | 3252 bp |
Protein Length | 1083 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983908 |
Product | glycosyl hydrolase |
Protein accession | YP_590973 |
Protein GI | 94968925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000419875 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000213557 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAAGC TCATTTGCGC AGTTTTCGCG TCTCTCATGT TCGTTGCCGC TGGCTATCCA CAGGAACCGA TTGCGGACGT GGAGCACGAA GATACCGAAG CCACCCCGCA GCAGCGTCAG CAGTGGTTCT ACGGTCAGCG CGCCTATCCG TTCAAGGTCA CGCCCGCCGG CGCCCATCGC CGCGCCTTTG GCGAAGCCAC GCAAATGCGC ATCGATGAAG AAGCGGTTCG CGCCAACGCT CCTGCTGGTG GCAAAACCAA CACGCTGAAT CCCTGGACCA TGATTGGTCC CAAGCCGTCG AACCAGGGCG GCTACGGCGT GACCTCGGGT CGCATCACCG CAGTTGCGGT GGACCAGACG ACTTCCGGCG CGAGCACCGT GCTCTACATC GGCGGTGCGG AAGGCGGTAT CTGGAAGAGC AGCGACAACG GCTCAACCTG GACCGCGCAG AGCGACAGCC AGCCTACGCT TGCTATCGGT TCGATCGCGA TTGATCCAAA CAATCACAGC ATCATCTATG CCGGCACCGG CGAAGAGAAC TTCAGTGGCG ACTCCTACTA CGGCGGCGGA GTGCTCAAGT CCACGAACGG TGGCTCGACC TGGACAATGC TGGGCGCCTC CTATTTCGGA GGCCCGATCG GATCCGGTTC CTATTACGGC GGCAGCTTCA TCGGCGCGAT CGCCGTCCAG CCAGGAGTTT CTTCGGGAAC GCCTATCGTT CTCGCTGGCT CAGAATTCTC CAGCAACGCT AGTTCTGGTG TCTGGCGCTC CACGGATGGC GGCACAACCT GGGCACGTGT ATTCCCGACG ACTGCACAGC TCTATTCGCA CGTGACGTCG GTTGTTTGGG TCAGCAAGAC TAAGGCATAC GCCGCCGTCA GCAACGTCTT CGGTGCTTCT AGTGTGCCAG TCGGCGTGTA CGTTTCAAGC GACAGTGGTG CTACATGGGG CCCCGCGAAT GGCGTCTCTG GACAGGCATT GCCGGATGGC ACCACAACAG CAGGTCGCTT CACCCTCGCC GTATCTCCTT CAACGCCGGC GACAATGTAT GTTTCCGTCA GCGATTACAA CACCAGCGGC CTCTACGGAA TGTACTTCAC CACTGACAGC GGCGGACATT GGAATCCGCT TAAGTCACCT CTGAATGCGG TTGGCACCAC CAACGATTTC TGCGGTCCGC AGTGTTGGTA CGACATGCCA CTTGCCGTCC ACCCGACTCA TCCCGGCACT CTCTACGCTG GCGGAAATTT CAACTATGGC GCGGGCAACG GCGGAGTTTA CGTCAGTCTC AACGCCACCA ATGGCGCTAC CGCGACCTGG TCTACCCCGA ACCCTGGCAC CAACAGCGTG ACCATGCACC CGGATTTCCA CGCCTTCGCC TTTTCTGCAG ACGGCAACAC ACTGTACATC GGTGAAGATG GCGGCCTCTG GCGCGGTACG CCAACCAACA GTGCCACCAT GGCGTGGACC GATCTCAACA CTAATCTTGC GATCACCGAG TTCTACCCCG GCCTCGCGAT CTACAAAGGC AGCAAAAACA CCGCGCTCAA CGGGACGCAG GACAATGGCG CCCAACTCTA CACCGGCTCG CTGCAGTGGA CGGTCGTCAC CTGCGGTGAC GGCGCTTGGG CGGCGATCGA TCCAACGACC GCCAATAATC TCTACGCCGG TTGCACCTCA GCGAATTTTG AGGGTGTCAT TCGCTCGCTC GACGGGGGCG GCAGTTGGGC CAGCCTCGGC ACGGGCATCA ACAATTTCGA GAATGTCGCG TTCATCCCAC CGATGATCAT GGATCCCAAA AGCTCAACCA CCCTGTATTA CGGGACCGAT CACCTCTACA AGATGGTCAA CTCTTCGCAG CCAAGCCCGT TCCCAACTTG GTCGTACGTA AATGCTTCCG CTCTCACAAG CGGCTATCTC TCAACGATTG CCGTCAGCGC GGTGAACGGT GCGTACTTCT TCGTCGGTGA CAGTACTGGC GCGGCGCAAT TCTCAATCAA TTCCGGCGCT AGTTGGACAG CTTTCACCGG ACTCCCAGGG CGCTTTGTCA GCATGGTTCA AGCCGATCCG CACACTGCCA CCATCGCTTA CGTCACGGTC TCCGGTTTCA GCGGCTTTAA CGGCGACACC AAGGGCCACG TCTTCAAATG CCTCACGACG ACTAGCGCTT GCACCGACAT GAGCGGCAAT CTGCCGAACA CGCCCGCCAA CGACATCGTC ATCGATCCCG ATCTTGCCAA CACCTTCTAT GTCGCGACCG ACGTCGGCGT CTTCACCAGC ACCAACAACG GCACGACTTG GTCCACCTCA GGAACTGGCT TGCCGAACGT CGCGGTCGTC GGACTGAAAC TGCACGAAAG CTCCCGCACA TTGCGCGCTG CTACCCATGG CCGCAGCACA TGGGACCTCT CAGTGCCCAC CTCCACCGTA ACGCCTGCCG CCATGACCAG CCCGGCGAAC GGCGCGACGA TGACCGGCGC GAGCGCTACC TTCAACTGGA GCGCAGGCAC GGGAGCGACG CAGTACTCGC TCTATATCGG AAACACCTCG GGCGCGCATG ACATCGCTTT TGTCAGCACG ACTTCGCTCT CGGCGACCGT CAACACGCTC CCAACCAACG GCGAGAAGTT CTTCGTCAGC TTGTACTCGT ACATCGGAGG GAAGTGGTAC TACAACGCGT ACTCCTACTA CGCCTCCGGC ACCGGCGCAG CCGCGACGAT GAGTACACCT ACGCCCGGCA CGAAGTTGAG CAGCGCCAGC CAGACCTTCA CCTGGACCAA GGGCACCGGC ATCAACTCGT ACTCGCTCTA CATCGGTACG AAAGCCGGCC TGCACGACAT CGATTTCCTG AACACGAGCA ACACGTCAGC CAGCTTCAGC AATCTGCCGA CGAACGGCGG GACGTTCTAC GTCACGCTCT ACTCGCTGAA CGGGAAGACC TGGCTCTCGC ACCCATACAC CTACGTCGCT TCCGGTTCGG GCACGGCCGC AACCATGTCC ACCCCGACGC CCGGCAGCAC CTTGCCCGGC GCCAGTGTCA CCTTCAATTG GACAACTGGT TCGGGCGTGA CATCGTACTC GCTGTACATT GGTACGACGG CGGGCGCGCA CAACCTCGAC TTCATCAACA CCACCTCAAC TTCTGCCAGT GTCACGAATC TTCCAACCAA CGGATCCACC GTGTACGTCA CCCTGTACTC GTTGATCGGC GGGGTGTGGC ACTCCAACGC CTACACCTAC AAAGCGCAGT AG
|
Protein sequence | MKKLICAVFA SLMFVAAGYP QEPIADVEHE DTEATPQQRQ QWFYGQRAYP FKVTPAGAHR RAFGEATQMR IDEEAVRANA PAGGKTNTLN PWTMIGPKPS NQGGYGVTSG RITAVAVDQT TSGASTVLYI GGAEGGIWKS SDNGSTWTAQ SDSQPTLAIG SIAIDPNNHS IIYAGTGEEN FSGDSYYGGG VLKSTNGGST WTMLGASYFG GPIGSGSYYG GSFIGAIAVQ PGVSSGTPIV LAGSEFSSNA SSGVWRSTDG GTTWARVFPT TAQLYSHVTS VVWVSKTKAY AAVSNVFGAS SVPVGVYVSS DSGATWGPAN GVSGQALPDG TTTAGRFTLA VSPSTPATMY VSVSDYNTSG LYGMYFTTDS GGHWNPLKSP LNAVGTTNDF CGPQCWYDMP LAVHPTHPGT LYAGGNFNYG AGNGGVYVSL NATNGATATW STPNPGTNSV TMHPDFHAFA FSADGNTLYI GEDGGLWRGT PTNSATMAWT DLNTNLAITE FYPGLAIYKG SKNTALNGTQ DNGAQLYTGS LQWTVVTCGD GAWAAIDPTT ANNLYAGCTS ANFEGVIRSL DGGGSWASLG TGINNFENVA FIPPMIMDPK SSTTLYYGTD HLYKMVNSSQ PSPFPTWSYV NASALTSGYL STIAVSAVNG AYFFVGDSTG AAQFSINSGA SWTAFTGLPG RFVSMVQADP HTATIAYVTV SGFSGFNGDT KGHVFKCLTT TSACTDMSGN LPNTPANDIV IDPDLANTFY VATDVGVFTS TNNGTTWSTS GTGLPNVAVV GLKLHESSRT LRAATHGRST WDLSVPTSTV TPAAMTSPAN GATMTGASAT FNWSAGTGAT QYSLYIGNTS GAHDIAFVST TSLSATVNTL PTNGEKFFVS LYSYIGGKWY YNAYSYYASG TGAAATMSTP TPGTKLSSAS QTFTWTKGTG INSYSLYIGT KAGLHDIDFL NTSNTSASFS NLPTNGGTFY VTLYSLNGKT WLSHPYTYVA SGSGTAATMS TPTPGSTLPG ASVTFNWTTG SGVTSYSLYI GTTAGAHNLD FINTTSTSAS VTNLPTNGST VYVTLYSLIG GVWHSNAYTY KAQ
|
| |