Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2146 |
Symbol | |
ID | 4068782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2564039 |
End bp | 2566273 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984161 |
Product | transglutaminase-like |
Protein accession | YP_591221 |
Protein GI | 94969173 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.360106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCCA CAGCGACCCA ATCCGGACTG AACCGCGCCC TGCACGAAAC CGCAGCGCGC GTTCCCGAAC CGATTGAGCA CTATTTCCAG GTGTCGTTGT TCCTGCTCTT GATCACCGGT TTCGTAACCT TGGCAGGCAC CGGAAAACTG GACTTGCTGT CGGTTGTCTT CGTGCTCGCC GCGTTAGGTC TCCGCGCAAT CCATCTCGCG CAGAACAAGC AAATCATTAT TCCGGAACGC TGGACCACTG CGCTCACGAT TGCCTACGTC GCCGTCTACG CCGCCGATTA TTTCTTTCTC TCACGCGACT TCGTCACCGC CACGGTTCAC CTCGTTCTAT TCGGAATGGT TGTTAAGGTC TTCTCGGTTC ACCGCGAGCG CGACCTGCTG TACCTCGGTG TGCTGTCGTT CCTCATGATT CTCGCCGCCT CGGTCCTCAC GGTGAACACT GCGTTCCTCG GTGGCTTCGC ACTCTTCCTG CTCGTCGCGA TCGCAACCTT CGTCAGCTTC GAAATGCGCC GCTCCGCGCT GGCAGCCGAC TCCGTGCAAT CGTTGAACGC GATCCCATCG CGTTCGCGAC CGCACGCGAC GAAGGTCAAC ACTTCCCTGT CGCGCACCGC GCTGATGTTG GCGACCACAA TCCTCCTCGG CGCGACGGTC CTGTTCTTCA CCCTTCCGCG CATCTCGGGC GGCTATCTCG GCTCTTACAC GCGTGGCTCC GATCCGGTCA GCGGCTTCCG CGACAACATC CTGCTCGGAC AAATCGGCCG CATCCAGCAA TCGTCGCAAG TCGTCATGCA CCTGCAGATC ACCGGGGACC ATCCGGCCTT CGACGGCAAA GTTCGCGGCT CGGTCCTCTC CCGCTTCGAT GGCCGCTCCT GGGCCGACAC ACCGCGCTAC ATGAACGTCA TCAATTCGCG CTTCGGCCGC TATGACATCT CCAACGAGAC GCTGGCTGCC GATCCGTATC TCGAGCGCGT CTCGGCCACG CACAAGAACC AGAACATGCC TTATCGCGTC TCGATGGAGC CGACGATGAG TTCGGTCCTG TTCCTGGTGA AGGGCACGAT TGAACTCCAG GGCAGCTTCC GTCAGATCGC GTTCGATAGC GCACAGTCCT ACATCAACCT CGACGGCGGC CATCCCTCGA ACGACTACTG GGGAGTCGCG AACGTCGCGC CTCCCGATCC CGCCCTGCTG CGGCAAGCGG GCAGCGACTA TCCCGCCCGC GTCGCGCAGC GCTATTTGCA ACTGCCGCCA CTTGATCCGC GGATTCCCGA TCTCGCGCGC AAAGTAACGG CGAAGGCATC GAACCCCTAC GACAAAGCGC ATGCTATGGA GACCTATCTC CAGAGCAGCT ACGGCTACAC CCTCGAGTTT CCTCTCGTAC CTCCAGCCGA TCCACTTGCG AATTTCCTCT TCGAGCGCAA GCAGGGCCAC TGCGAATACT TCGCCAGCGC CATGGCGGTC ATGCTGCGTT CCGTGGGCAT TCCTACGCGC GTCGCTACCG GCTTCCGTGG CGGCGAGTAC AACGACATCA CGGGCAGCTA CATCATTCGT GCCCGCGACG CTCACGCCTG GGTCGAAGTC TATTTCCCAA ACCAAGGGTG GGTGACATTC GATCCAACCG CTGCGGCGCC AATGGAGCCG GCCGGTCTAT TCGGGCGCTT ACGCCTCTAC GCCGATGCCA TGAACGAGTT CTGGCGTGAG TGGATCATCA ACTACGACTT CCAGCACCAG CGCACCCTCA CGGCCGCGGT GACCACCGAG TCCCTGCAAA AAGGCATGAG CCTGCGCGAT TGGATCTCCG CGAAATACGA CCGCATGCTC GGCCGCGCGC GCCAGGTACA GAAATCATTC TCAGAATCGC CGCAGCGCCA ATCGCGACTA GCCGTGACCG TAATTTGCCT CATGCTGCTC ATCGTCATCG CGCCGCGCGC CTGGCATCTT CTCAAGATGC GCCGGATCGC CGCGCATCCC GGCGACGCGC CCGAAGCCGC TGCTTCCATC TGGTATGGCC GCGCTACGCA TCACCTCGCA CGCTACGGTT GGGCCAAGCA ACCCTCGCAA ACGCCCGCGG AATACGCGCA GAGCATCGAC CACGAAACCA TGCGCCGGAC GATGGAAGAG TTCACACTGC TTTATGAACA GGCACGATTC GGAGGTTCTG CGACCAGCGC GAGCCGTTTG CCGGAGCTGT TTCAACGCCT GAAGGTTCGT CAGCGGGAAC GCTAA
|
Protein sequence | MASTATQSGL NRALHETAAR VPEPIEHYFQ VSLFLLLITG FVTLAGTGKL DLLSVVFVLA ALGLRAIHLA QNKQIIIPER WTTALTIAYV AVYAADYFFL SRDFVTATVH LVLFGMVVKV FSVHRERDLL YLGVLSFLMI LAASVLTVNT AFLGGFALFL LVAIATFVSF EMRRSALAAD SVQSLNAIPS RSRPHATKVN TSLSRTALML ATTILLGATV LFFTLPRISG GYLGSYTRGS DPVSGFRDNI LLGQIGRIQQ SSQVVMHLQI TGDHPAFDGK VRGSVLSRFD GRSWADTPRY MNVINSRFGR YDISNETLAA DPYLERVSAT HKNQNMPYRV SMEPTMSSVL FLVKGTIELQ GSFRQIAFDS AQSYINLDGG HPSNDYWGVA NVAPPDPALL RQAGSDYPAR VAQRYLQLPP LDPRIPDLAR KVTAKASNPY DKAHAMETYL QSSYGYTLEF PLVPPADPLA NFLFERKQGH CEYFASAMAV MLRSVGIPTR VATGFRGGEY NDITGSYIIR ARDAHAWVEV YFPNQGWVTF DPTAAAPMEP AGLFGRLRLY ADAMNEFWRE WIINYDFQHQ RTLTAAVTTE SLQKGMSLRD WISAKYDRML GRARQVQKSF SESPQRQSRL AVTVICLMLL IVIAPRAWHL LKMRRIAAHP GDAPEAAASI WYGRATHHLA RYGWAKQPSQ TPAEYAQSID HETMRRTMEE FTLLYEQARF GGSATSASRL PELFQRLKVR QRER
|
| |