Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2579 |
Symbol | |
ID | 7409533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2715857 |
End bp | 2717587 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716943 |
Product | alpha amylase catalytic region |
Protein accession | YP_002574417 |
Protein GI | 222530535 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000456037 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGGTTA TCCACAATCA AGTTCATGAC ATTTTTGCTA TGAGCAAAGA TAGGTTTTAT GTAAAACTTT GGGTGCAAAA AGGATTTGCA AGAAGTGTCA GTTTGATATT CTCAGACAGG TATGATTTGG ATGTTCAGAA AGTGAAGATG GATTTTTACA TGAACGTTGG AAGTTTTGAA GTGTATACAG CGCATGTGCA AAACAAAACG CCAAGATTTG CGTACAAGTT TTTGATTGAG CTTTTGGATG GAAGTTTTAA AATATTTAAC CAGTTTGGAC TTGTAGATAC TGAAGAAAAC CTGTATTTTG ATTCTTTCCA GTTTCCTTAT GCGAATGAGG CAGATATTTT TGAAAAGCCG TCTTTTGCAG AAGGTTTGGT TGTCTACGAG ATTTTCCCAG ACAGGTTCAA AAGAGGCAAA AAAGAAGTGC ACAGCAAAAA ATTATTTGAT TGGGATTACT GCAGCTGGGA TGTACCAGGC TCTGAAGTTT TTCTTGGTGG TGATTTTGCA GGCATAAAAG AAAAAATAGA GTATTTTAAA ACTCTTGGAA TAAATGCCAT TTATCTTACA CCAATTTTTA AATCAACCTC TAGCCACCGC TACAATGTAG ATGACTACTT TGACGTTGAC CCGATTTTAG GTACAAAAGA GGAGTTCAAA GAACTTGTTG ACAGTCTTCA CGAAAATGGC ATCAGGATAA TACTTGACAT GGTTTTTAAC CACACAGGCG TTGGATTTTT TGCTTTTCAG GATGTTATAA AAAATGGAGA AAATTCAAAG TATTATAGCT GGTATAATAT AAGGTCTTTA CCTGTGGATA TCCAAAAAGG AAACTATGAG ACCTTTGCAA CAAATGTGAG GAGCATGCCA AGAATAAATA CTTCAAACAA AGATGTCCAG GACTTCTTTT TAGAAGTTTT AAAGTATTGG CTTTTGGAGT TTGACGTTGA CGGCTTTAGG TTTGATGTTG CAAACGAGCT TGACAAAAAC TTTATAAGAA GGATAAGAAA TGAGCTAAAA GCCATAAAAA AAGACATTCT TTTAATTGGT GAGGTAATGC ACAGGAGCGA AAATTTCCTT ATGGGAGACA TGTTTGATGG GGTGATGAAC TACTTTTCGT GGGAGGTTTT TGCAAGGTAT TTGATGGGTA AATATAATGC AGAGGATGCA TCAAGGATTT TGGCAGATTA CAGGCTAAAA TTTAATCCTA TACTTTTTTC GTGCCAGCTG AACCTCATTG GCAGCCACGA CACAGAAAGG GTTTTAACAA GACATGGAAA CAAAAAACTT GCAATGCTTG CTGCAGTGTA CAACCTTACC TATCAGGGAA TTCCTATGAT TTACTATGGT GATGAGATTG GAATGGAAGG CGGACATGAC CCCGACTGCA GAAGGGGGAT GATATGGGAA GAAGAAAAGC AGGACAAGGA GATTTTTAAG CTTTACAGAA GATTGATAGA TCTCAAAAAA ACATCTTCGG CTTTAAACAG TGACTATGTT AAAGAGTTTT CGATTGGTGA TGTGCTTTGC TTTGAAAGAA AAAGTGAAAG TGAAGCTGTG TACATTCTTT TTAATCCACG CAAAGCTTTG CAAAAAGTAA AGCTGTGGTC AGAGTTTATT GTTGACAGAG AAATTGAATT TTTCAGCACG CAGCAGAAAA TTAAAAACCA TTCATGCTAT ATTGAACTTG AACTAAATCC CGAGAGCTTT GAAATTGTAA TTGTTAAATA A
|
Protein sequence | MQVIHNQVHD IFAMSKDRFY VKLWVQKGFA RSVSLIFSDR YDLDVQKVKM DFYMNVGSFE VYTAHVQNKT PRFAYKFLIE LLDGSFKIFN QFGLVDTEEN LYFDSFQFPY ANEADIFEKP SFAEGLVVYE IFPDRFKRGK KEVHSKKLFD WDYCSWDVPG SEVFLGGDFA GIKEKIEYFK TLGINAIYLT PIFKSTSSHR YNVDDYFDVD PILGTKEEFK ELVDSLHENG IRIILDMVFN HTGVGFFAFQ DVIKNGENSK YYSWYNIRSL PVDIQKGNYE TFATNVRSMP RINTSNKDVQ DFFLEVLKYW LLEFDVDGFR FDVANELDKN FIRRIRNELK AIKKDILLIG EVMHRSENFL MGDMFDGVMN YFSWEVFARY LMGKYNAEDA SRILADYRLK FNPILFSCQL NLIGSHDTER VLTRHGNKKL AMLAAVYNLT YQGIPMIYYG DEIGMEGGHD PDCRRGMIWE EEKQDKEIFK LYRRLIDLKK TSSALNSDYV KEFSIGDVLC FERKSESEAV YILFNPRKAL QKVKLWSEFI VDREIEFFST QQKIKNHSCY IELELNPESF EIVIVK
|
| |