Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2072 |
Symbol | |
ID | 7408781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2188823 |
End bp | 2190283 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716439 |
Product | Alpha-L-fucosidase |
Protein accession | YP_002573922 |
Protein GI | 222530040 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3669] Alpha-L-fucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAGT ATTTTAAAAC TATCCAAAAA GGTCCCTTTG AAGCAACTTG GGAATCTTTA AGACAATTCA AATGTCCTCA GTGGTTTTTA GATGCCAAGT TTGGTATTTG GGCGCACTGG GGACCACAGT CTGTACCAAT GTATGGGGAC TGGTATGCAA GAAATATCTA CAGAGAAGGA GAACCACAGT ATTACTACCA TTGGCGAAAG TATGGGCATC CTTCAAAGGT TGGTTACAAA GACATTGTTC AAATGTGGAA GGCTGAAAGG TTTAACCCAG AGGAATTAAT TGATCTGTAT ATAAAAGCAG GTGCAAAATA TTTTATTGCT CAGGCTGTTC ACCACGATAA CTTTGATAAC TGGAATTCCC GATACCATAG GTGGAATGCA GTAAATATGG GTCCTAAGAA AGATATTGTT GGAATGTGGT GCAGAGCTGC AAGAGAAAGA GGACTTCCAT TTGGTGTATC AGAACACTTG GGTGCAAGCT TTTCGTGGTT TGCACCAAGT AAAGGACATG ACACAAAAGG ACCATATAAG GATGTTCCAT ATGATGGAAA TGACCCTGCC TATGAGGATT TTTATCACCC AAATAGGGAC GAGTATGAGC TAGAAAAACA GTATGGAAAG ATTGTCAATT GGTACTCGCC AAACCCACAG TGGCATATGA AATGGTTTTT AAGAATAAAA GATTTGATCG ACCAGTATGA ACCAGACTTT TTGTATTCAG ATGGTGGTGT TCCATTTGGT GAGACTGGGT TGAGTATTGT TGCGCATCTT TACAATACAA GTGCCAAAAA TCACGGTGGT ATAAATCAAG CAGTGTACAC TCAAAAAGAC ACAAATCCAG AAGTCTATAA AATTGGTGTT TTGGACATTG AACGTGGATC AGCCGAAGAT ATTCTATCTC ATCCATGGCA GACAGATACA TGTGTAGGTG GCTGGTTTTA TGATGTAAGA GCTGTATATA AAACCCCGCA ACAGGTAATT GAAATGCTTA TTGATATTGT CAGCAAAGGT GGAAACCTTC TTTTAAACAT TCCGCAAAAA CCAGATGGCA CGTTGGATGA TGAATGTCTT TATATTCTTG ATGAGATTGC AAAATGGATG AAAGTAAACG GAGAAGGTAT ATATGCAACA CGTCCATGGA TAAGATATGG AGAAGGTCTG ACAAAAGCTC AAGGTGGAGC TTTTCAGGAG AAAAAACTTG AGTGGACGCA AGAGGATTTT AGATTTACAC AAAAAGATGG AAAAATTTTT GCCTTTCAAA TGAAGTATCC AGAAGATAAC AGAGCAATCA TCAAAAGTTT GGGACTATCA AGTGGTATTT TTGTGAAAAA GATAAAACTT TTAGGGTTTG AAGGTGAGCT TGAATTTGAA CAGTTAGAAA ACGCTTTGGT CATCAAATTG CCAGAAAAAT GCTATAGCAC AGGATATCCA CATTGTTTTT GCATAAAGTA A
|
Protein sequence | MDEYFKTIQK GPFEATWESL RQFKCPQWFL DAKFGIWAHW GPQSVPMYGD WYARNIYREG EPQYYYHWRK YGHPSKVGYK DIVQMWKAER FNPEELIDLY IKAGAKYFIA QAVHHDNFDN WNSRYHRWNA VNMGPKKDIV GMWCRAARER GLPFGVSEHL GASFSWFAPS KGHDTKGPYK DVPYDGNDPA YEDFYHPNRD EYELEKQYGK IVNWYSPNPQ WHMKWFLRIK DLIDQYEPDF LYSDGGVPFG ETGLSIVAHL YNTSAKNHGG INQAVYTQKD TNPEVYKIGV LDIERGSAED ILSHPWQTDT CVGGWFYDVR AVYKTPQQVI EMLIDIVSKG GNLLLNIPQK PDGTLDDECL YILDEIAKWM KVNGEGIYAT RPWIRYGEGL TKAQGGAFQE KKLEWTQEDF RFTQKDGKIF AFQMKYPEDN RAIIKSLGLS SGIFVKKIKL LGFEGELEFE QLENALVIKL PEKCYSTGYP HCFCIK
|
| |