Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1103 |
Symbol | |
ID | 7409660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1194918 |
End bp | 1196435 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643715469 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_002572977 |
Protein GI | 222529095 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG CAAAAGTCAT CTACGATAAG GAGTTCGTAA TCGGGCAAGT AGACAAGAGA ATCTACGGTT CATTTTTAGA ACACATGGGA AGAGCAATAT ACACAGGAAT CTATGAACCA GACCATCCGC AGGCTGATGA AATGGGGTTT AGAAAGGATG TTTTAGAACT TGTTCGCAAG CTGAATGTTC CTATTGTAAG ATATCCTGGC GGCAATTTTG TGTCGGGGTA TAACTGGGAA GACGGTATTG GTCCAAAAGA AAAAAGACCG AGAAGACTTG AGCTTGCGTG GAGAGCCATC GAGACAAATG AGGTTGGTGT AAACGAATTT GTTGAATGGG CAAAAAGAGC AAACACCTCT GTTATGATGA CAGTAAACCT TGGCACACGA GGAATTGACG CTGCAAGAAA CTTAGTTGAG TATTGCAACT TCCCAGGCGG TACATACTAC AGTGATTTGA GACGTCAGCA TGGTTATCAG CAGCCACACA ACATAAAAGT ATGGTGTCTT GGTAACGAGA TGGACGGGGA CTGGCAGATA GGTCATAAAA CTGCATATGA GTATGGAAGG CTTGCAAGAG AGACAGCAAA GGTTATGAAG TGGATAGATC CGAGTATTGA GCTTGTTGCA GCGGGAAGCT CAGGTCCCAA AATGCCAACA TTTCCTGAGT GGGAAGCAAT TGTTTTGGAC CACACATATG ACCTTGTAGA TTATGTGTCG CTACATGTAT ACTATGGAAA TCCTGAAAAA GACACAAAGA ATTTTGTTGC AAAATCGCTT GAAATGGAAG AGTTTATCAA AACAGTTATA TCAACAATTG ACTATGTAAA GGCTAAAAAG AGAAGCAAAA AGGTTGTCAA TATCTCATTT GACGAATGGA ATGTATGGTA CCATGCTCAT CTTGAGGGGA AAGACCAGAA AGCAGAACCC TGGGCACAAG TTCGTGCTAT TGCTGAAGAA GATTATGTGT TCGAAGATGC AATTTTGGTA GGATGCATGC TGATTGCGCT TTTGAAACAC TGTGATAGAG TCAAGATGGC GTGCATGGCA CAGCTTGTAA ATGTAATTGC TCCAATTACC ACTGTAAAAG GTGGAATTGC TTACAGACAG GTAATCTATT ATCCTTTCAT GCATGCTGCA AACTTTGGAC ATGGAGTTGC ACTGCTTCCC AAGGTAAATT CTCCTAAATA TGATTCAAAA GACTTTACTG ATGTTCCATA TATTGAAACA GTTGCAACAT ACAATGAGGA AAAGGATGAA ATAACAATCT TTGCAGTCAA CAGAGATTTA GAAGAGGAGA TGCAAGTTGA GTTTAAGCTT GATGGTTTTG AAGGCTTTGA GGTTGTGGAG CACATTGTAT ATGAAAGTGA TGATATTTAC AAAGGAAACA CTCAAGATAA GCCTGACAAT GTTGTGCCCC ACAAAGGTGG AAATTCAAAG ATAGAAGGCA ATGTTTTAAC ATCCATATTG CCCAAATTCT CCTGGAATGT TATCAGGTTA AAGAAGAAAG AAAATTAA
|
Protein sequence | MKKAKVIYDK EFVIGQVDKR IYGSFLEHMG RAIYTGIYEP DHPQADEMGF RKDVLELVRK LNVPIVRYPG GNFVSGYNWE DGIGPKEKRP RRLELAWRAI ETNEVGVNEF VEWAKRANTS VMMTVNLGTR GIDAARNLVE YCNFPGGTYY SDLRRQHGYQ QPHNIKVWCL GNEMDGDWQI GHKTAYEYGR LARETAKVMK WIDPSIELVA AGSSGPKMPT FPEWEAIVLD HTYDLVDYVS LHVYYGNPEK DTKNFVAKSL EMEEFIKTVI STIDYVKAKK RSKKVVNISF DEWNVWYHAH LEGKDQKAEP WAQVRAIAEE DYVFEDAILV GCMLIALLKH CDRVKMACMA QLVNVIAPIT TVKGGIAYRQ VIYYPFMHAA NFGHGVALLP KVNSPKYDSK DFTDVPYIET VATYNEEKDE ITIFAVNRDL EEEMQVEFKL DGFEGFEVVE HIVYESDDIY KGNTQDKPDN VVPHKGGNSK IEGNVLTSIL PKFSWNVIRL KKKEN
|
| |