Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0328 |
Symbol | |
ID | 4070090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 355532 |
End bp | 357187 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982331 |
Product | Alpha-L-arabinofuranosidase |
Protein accession | YP_589407 |
Protein GI | 94967359 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.48322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.88573 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTC GCCGTGATTT CCTTCGTTCC ACTGCCATTG GCGCTGCTGG ATTGGCGCTG ACCCGTTTCT CGCTGCCGTC GCTCGCGCAA AGCCGCAGCG TCGATTCCCG GATCGAGGTG CTGCTCGACG AACCGCTCGG GACCATCTCG CCGAACATTT ACGGTCACTT CACCGAGAAC CTGGCCGGGG TGTTTTACGA CGGAATCTGG GTTGGCGAGA ATTCGAAGGT CCCGAACGTC GGCGGGCTGC GTAAGGCCGT GATCGACCAC ATGCGCAAGA TCAAGGCCCC CGTGATTAGA TTCCCAGGCG GATGCTTTGC CGACACCTAC GACTGGCGCG ACGGCATCGG CCCACGCGAA AAGCGGCCGC GGCGCCCGAA CTTCTGGGGC AATGGCGACC CGAAGCAAAA CGTAAAGCAC AAGTACGACC CCAATGAGGT TGGTACCGAC GAGTTCATGC ATTTCTGCCG CGAGATTGGC GCACAGGGAT ATCTTGCAGC CAACGTGCGC AGCCTTCCCG CGGAGCAATT CCAGCAGTGG GTGGATTATT GCAATTCGCC TGCGGGGAGC ACGACGCTGG CGGAAACGCG CGCGACGAAC GGCTCGCGTG AGCCTTACAA GGTGGAGTTC TGGGGCGTGG GGAACGAGTC TTGGGGATGC GGCGGCGACT TCGAGCCCGG CCAATATGCG ATGGAATTTC GCCGCTACGC CACGTGGGTG CCAGGCTTCG GAATGAACTT GAAATTCATT GCCTCCGGGC CAAACGTGGA GGACTACCAC TGGACCAGCG GCTTCTTCGA GGCGATGCAG AAGCGGATGG GCTCCCTGCA CATGGTCTAT GGCTGGGCGC TCCACCATTA CGCCTGGAAC CTGAGCCGCG GCAAAACCAA CGACTGGGAC AAGGGCAAAG GCGAAGCTGT GAACTTCGAC GCCACCGATT GGTACGAGCT GATGAAGGAA GGTCAGCGCA TGGAAGGTCT GATCGAGGGA CACTGGCAGG TGATGGGTGA AACCGATCAC GAGCATCGCG TAAAGCTAGT AGTGGACGAG TGGGGCCCCT GGTACCGCCC GGGCAGCGAA GCAACTCCCG GGGATGCGCT GGAGCAGATG CCAACCCTCC GCGACGCGGT GTTCAGTGGC ATGACCCTCG ATATCTTCAA CCGCCATCCT GAAAAAGTGG CGATGGCGAA CTGCGCGCAG CTCATCAACT GTCTGAACAG TTTGTACCTG GCGCATGAAG ACAATTTCAC GGTGACTCCA GTTGGGCACG TGTTCGATAT GTATGCGCCT CACCAGGGCG GACAGTCGCT GCGCACGATC TTCTCTTCAC CGCAGGTGAA GTACGAGCGC GACGGCACAC CGGCGACCTT CTGGGGGCTG CGCGGTGCCG CGTCTTTAAC CGGGAAAAAA CTCTTCGTCA CGGCGGTCAA TCCGGATACC AGCTCACCAC GAGAAAGCGA GATTGCGATC CGCGGGGCGA GCGCGGCTTC TGGCACGCTG ACCGTGCTCT CGGCGCCCGA CATCCACGCG CACAACACCT TTGAACATCC GGATGCGGTA GTACCCCGTC GGAATGAATT GAAGGTGAAT GGCGCGACGG TGTCGCTCGT GATTCCGCCA GCCTCCGTGG TCGCAATCGA ATTGGAACTG AGTTGA
|
Protein sequence | MSTRRDFLRS TAIGAAGLAL TRFSLPSLAQ SRSVDSRIEV LLDEPLGTIS PNIYGHFTEN LAGVFYDGIW VGENSKVPNV GGLRKAVIDH MRKIKAPVIR FPGGCFADTY DWRDGIGPRE KRPRRPNFWG NGDPKQNVKH KYDPNEVGTD EFMHFCREIG AQGYLAANVR SLPAEQFQQW VDYCNSPAGS TTLAETRATN GSREPYKVEF WGVGNESWGC GGDFEPGQYA MEFRRYATWV PGFGMNLKFI ASGPNVEDYH WTSGFFEAMQ KRMGSLHMVY GWALHHYAWN LSRGKTNDWD KGKGEAVNFD ATDWYELMKE GQRMEGLIEG HWQVMGETDH EHRVKLVVDE WGPWYRPGSE ATPGDALEQM PTLRDAVFSG MTLDIFNRHP EKVAMANCAQ LINCLNSLYL AHEDNFTVTP VGHVFDMYAP HQGGQSLRTI FSSPQVKYER DGTPATFWGL RGAASLTGKK LFVTAVNPDT SSPRESEIAI RGASAASGTL TVLSAPDIHA HNTFEHPDAV VPRRNELKVN GATVSLVIPP ASVVAIELEL S
|
| |