Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0922 |
Symbol | |
ID | 4070574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1166740 |
End bp | 1168374 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982929 |
Product | Alpha,alpha-trehalase |
Protein accession | YP_589999 |
Protein GI | 94967951 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.643949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00590138 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCAAAAGA AACTCTGCAT CACCATCATC GTTCTTTTCC TTTCCGCATC AGCATTCGCT CAAAAACCTG AAAACATGCC GCAGGTTCTG GACTATATCC ATAAGGGATG GGACGTCCTG ACACGCACGA TGAATTCGTG CGAGACGATC ATTGATAAGC ACGCGGGTGT GCATTCGATG TTGTATCTGC CGGTCGGGTA TCCGGAAAAC GAGGCAATGA AGGAAGTGAC GGAGCGATGC AAGGTCGAAC TCGCGCACCT GCCGCAGAAG GTCACGGCGA TTGGAAGCCT CGATACCAGC CATGGCTTTC GACAGGGCAC GCTCTATCTG CCGAATCCCT ATGTGGTGCC GGGCGGATTC CTGAACGAGC AGTACGGATG GGACAGCTAC TTCATCATCG TTGGGCTGCT TCGTGATGGC CGGTACGACA TGGCGAAGGG GATGGTGGAG AATTTCTTCT TCGAGATTGA TAACTACGGC GACATCCTGA ACGCGAATCG CTCTTACTTC ATGACGCGAT CGCAGCCGCC GTTTCTAAGT TCGATGGTGT TGCTGGTTTA CGAGGCGACT CCGGATGCGA CGGCCAAAGA GACTTGGTTG AAGAAGGCGT ATCCCTACAT CGAACGCGAT TACAACATGT GGACGAGCGG CGACAAACTC GCTGGCGATA CTGGTCTCTC GCGGTATTTC GACTATGGGC ACGGGCCTGT ACCGGAAGTG GCCGACGGGC ATGATCCGTA CTACCGGGAT GTTTTCCGAT ACATTCAGGC GTCGAACGAG AAAGACGATT ACCTCGCAGA GTCGGGGAAG TCGGCCGCGA AGCTGATTGG GCCTGAGTTC ACGATGGAAG TCTGCGAGAA GCGGGACGGG GAAAAGCCGA CGTGCGGAAC TTCATCGCCG ATGCAGTACA CCGCCGACTA CTACAAGGGC GATCGTGCGA TGCGGGAATC TGGGTTTGAT ATTTCGTTTC GTTTCGGCGA CTTCAGCGGG AAAACGCACC ACTTTGCTGC GGTGTGCCTG AACAGCCTGC TCTACAAGGC AGAGACCGAT ATGCAGAAGG TCAGCGAAAT CCTTAAGAAC GGGCAGGCAC AGACTTGGGC GGGGCGGGCG GCGAAGAGGA AGCAGCTCGT GGATAAGTAT CTGTGGGACG CACAGCGGGG ACGCTATTTC GATTGGGACT TCACGCTGGG CAAGCGCTCG ACCTACGACT ACATCACGAC GTTTTATCCA TTGTGGGTTG GGCTGGCGTC GCAGCAGCAG GCGAAGCAAG TGATGTCCCA CTTAGATGTG TTCGAGCGGG CGGGCGGAAT GTCGATGAGC CCGTACACGA CGGGCGTGCA GTGGGACCAG CCGTATGGAT GGGCTCCCAC GATGATGATT GGCGTCGGAG GCATGAGGCG CTATGGATAC AAGAACGAAG CGGATCGCGT GTCGGCAAAG TGGGTGAATA CGATTGCGCG GAATTTTGCG AAAGATGGGA CAATCCGCGA GAAGTACGAT GTCGTGCAGA GTTCGAGCGA ATTCCAGGCA AAGGCGGGTT ATAGCGAGAA CGTCGTAGGA TTCGGATGGA CGAATGCCTC GGCGCTGCAG TTCGCTTACG ATCTGGGATG GGTGACGAAG GCCGCAGCGA ACTAG
|
Protein sequence | MQKKLCITII VLFLSASAFA QKPENMPQVL DYIHKGWDVL TRTMNSCETI IDKHAGVHSM LYLPVGYPEN EAMKEVTERC KVELAHLPQK VTAIGSLDTS HGFRQGTLYL PNPYVVPGGF LNEQYGWDSY FIIVGLLRDG RYDMAKGMVE NFFFEIDNYG DILNANRSYF MTRSQPPFLS SMVLLVYEAT PDATAKETWL KKAYPYIERD YNMWTSGDKL AGDTGLSRYF DYGHGPVPEV ADGHDPYYRD VFRYIQASNE KDDYLAESGK SAAKLIGPEF TMEVCEKRDG EKPTCGTSSP MQYTADYYKG DRAMRESGFD ISFRFGDFSG KTHHFAAVCL NSLLYKAETD MQKVSEILKN GQAQTWAGRA AKRKQLVDKY LWDAQRGRYF DWDFTLGKRS TYDYITTFYP LWVGLASQQQ AKQVMSHLDV FERAGGMSMS PYTTGVQWDQ PYGWAPTMMI GVGGMRRYGY KNEADRVSAK WVNTIARNFA KDGTIREKYD VVQSSSEFQA KAGYSENVVG FGWTNASALQ FAYDLGWVTK AAAN
|
| |