Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0753 |
Symbol | |
ID | 4068629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 929268 |
End bp | 930848 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982759 |
Product | Alpha-L-arabinofuranosidase |
Protein accession | YP_589832 |
Protein GI | 94967784 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.406878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCCTGC GCACAATTCT CGTTGTAGTG ACTCTTTTCG GTCTTCTTTC TAACGCAACG ATTGCGGCTG CCCAAAAGGT GCAGGTGTCG ATTGATCCAT CCCGGCCTGG GGTGAAGATC GATCGTAATC TGTTCGGGCA ATTTGCTGAG CATCTAGGAC ATGGCATTTA CGAAGGAATC TGGGTCGGCA GCGACTCCAC GATTCCGAAC ACCCGGGGCA TTCGCAACGA TGTGGTTGCC GCACTGAAAG CGATCCATGT GCCGAATGTG CGCTGGCCCG GCGGGTGCTT CGCCGACGAG TATCACTGGC GCGACGGGAT CGGGCCGCAA AGAGTGGTGA GGCTGAACCC GAACTGGGGC GGCGTGATCG AACCCAACAC CTTCGGCACC CACGAGTTCA TGGACTTCAT TGGGCAGATC GGGAGCGAGG CGTACGTGTC GGTCAACGTG GGCTCTGGTA CTCCGCAGGA AGCGTCAGAT TGGCTGGAGT ACATGACGGC AGCTCAGGCA ACGACGCTCC AGAAGGAGCG CGCTGCGAAC GGGCATCCGG CACCGTACAA GATCGCATTG CTGGGCCTCG GCAATGAAAG CTGGGATTGC GGCGGCAACA TGACGCCCGA TTACTACCTG GACCGGATGA AGGTCTTCAG CCGATTCGTT CGCAACTACA ATCCGGCGCA GACGGACAAG AACCAGATGT TGAAGATCGC AGTCGGTCCG GGCGGAGGCG AAGAGCGCTG GACGGAGTGG ACCGATACGG TGATGAAGGC TTACCAGAAG CACACGTGGA GCTGGGACAT CAACGGCCTC TCGATGCACA GTTACACGAC GGTGAAATGG CCGCCGGCGT ACAAGTCCGT GGGGTTCGGA GAGGACGAGT ACGCGCAGAT TCTGAAATCG ACGCTGGAGA TGGAAGACCT GGTCAAGAAG CATTCCGCGA TCATGGACAA GTACGATCCG GAAAAGAAGG TCGCCCTCAT CGTGGACGAA TGGGGCAGTT GGTATGCGCC CTTGCCGGGG AGCAATCCGG GCTTTCTCGT ACAGCAAAAC AGCATTCGCG ATGCGATCCT GGCCGCGCTG AACATCAACA TCTTTGCTCG CCACAGTGAT CGGGTGCGCG GCGCGAACAT TGCCCAGATG ATCAACGTGC TGCAGGCGAT GATCATCACC GATAAAGAGA AGATGGTGCT GACACCGACC TACTATGTTT ACAAGATGTA CCTGCCCTTC CAGGATGCGA CTTTCGTTCC GGTGACATTT GACGCGGGCA CCTACAAGCA CGGCGACAGC ACGCTGCCGC GCATCGATGC GCTCGCTGCG AGAGGAAAAG ACGGCAAACT GTGGCTGGAG ATCACGAATG TGGACCCGAA CCAGACGGCG GATGTGGAGT TGAATGTGAC TGGGTTTGCT ACGAAGTCTG CGTCGGGAGA AACGCTCGCC GGACCGAAGG TCGACAGCGT GAATACGTTC GAGGCACCGA ACACGGTTGT GCCGAAACCC ACATCGGCCC GCGTAGAGGG TGGAAAGGTG ATGCTTAAGT TGGAGCCCAA GTCCGTCACG GTGGTGTCAC TGGAGCAATA G
|
Protein sequence | MFLRTILVVV TLFGLLSNAT IAAAQKVQVS IDPSRPGVKI DRNLFGQFAE HLGHGIYEGI WVGSDSTIPN TRGIRNDVVA ALKAIHVPNV RWPGGCFADE YHWRDGIGPQ RVVRLNPNWG GVIEPNTFGT HEFMDFIGQI GSEAYVSVNV GSGTPQEASD WLEYMTAAQA TTLQKERAAN GHPAPYKIAL LGLGNESWDC GGNMTPDYYL DRMKVFSRFV RNYNPAQTDK NQMLKIAVGP GGGEERWTEW TDTVMKAYQK HTWSWDINGL SMHSYTTVKW PPAYKSVGFG EDEYAQILKS TLEMEDLVKK HSAIMDKYDP EKKVALIVDE WGSWYAPLPG SNPGFLVQQN SIRDAILAAL NINIFARHSD RVRGANIAQM INVLQAMIIT DKEKMVLTPT YYVYKMYLPF QDATFVPVTF DAGTYKHGDS TLPRIDALAA RGKDGKLWLE ITNVDPNQTA DVELNVTGFA TKSASGETLA GPKVDSVNTF EAPNTVVPKP TSARVEGGKV MLKLEPKSVT VVSLEQ
|
| |