Gene Athe_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2579 
Symbol 
ID7409533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2715857 
End bp2717587 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content36% 
IMG OID643716943 
Productalpha amylase catalytic region 
Protein accessionYP_002574417 
Protein GI222530535 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000456037 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAGGTTA TCCACAATCA AGTTCATGAC ATTTTTGCTA TGAGCAAAGA TAGGTTTTAT 
GTAAAACTTT GGGTGCAAAA AGGATTTGCA AGAAGTGTCA GTTTGATATT CTCAGACAGG
TATGATTTGG ATGTTCAGAA AGTGAAGATG GATTTTTACA TGAACGTTGG AAGTTTTGAA
GTGTATACAG CGCATGTGCA AAACAAAACG CCAAGATTTG CGTACAAGTT TTTGATTGAG
CTTTTGGATG GAAGTTTTAA AATATTTAAC CAGTTTGGAC TTGTAGATAC TGAAGAAAAC
CTGTATTTTG ATTCTTTCCA GTTTCCTTAT GCGAATGAGG CAGATATTTT TGAAAAGCCG
TCTTTTGCAG AAGGTTTGGT TGTCTACGAG ATTTTCCCAG ACAGGTTCAA AAGAGGCAAA
AAAGAAGTGC ACAGCAAAAA ATTATTTGAT TGGGATTACT GCAGCTGGGA TGTACCAGGC
TCTGAAGTTT TTCTTGGTGG TGATTTTGCA GGCATAAAAG AAAAAATAGA GTATTTTAAA
ACTCTTGGAA TAAATGCCAT TTATCTTACA CCAATTTTTA AATCAACCTC TAGCCACCGC
TACAATGTAG ATGACTACTT TGACGTTGAC CCGATTTTAG GTACAAAAGA GGAGTTCAAA
GAACTTGTTG ACAGTCTTCA CGAAAATGGC ATCAGGATAA TACTTGACAT GGTTTTTAAC
CACACAGGCG TTGGATTTTT TGCTTTTCAG GATGTTATAA AAAATGGAGA AAATTCAAAG
TATTATAGCT GGTATAATAT AAGGTCTTTA CCTGTGGATA TCCAAAAAGG AAACTATGAG
ACCTTTGCAA CAAATGTGAG GAGCATGCCA AGAATAAATA CTTCAAACAA AGATGTCCAG
GACTTCTTTT TAGAAGTTTT AAAGTATTGG CTTTTGGAGT TTGACGTTGA CGGCTTTAGG
TTTGATGTTG CAAACGAGCT TGACAAAAAC TTTATAAGAA GGATAAGAAA TGAGCTAAAA
GCCATAAAAA AAGACATTCT TTTAATTGGT GAGGTAATGC ACAGGAGCGA AAATTTCCTT
ATGGGAGACA TGTTTGATGG GGTGATGAAC TACTTTTCGT GGGAGGTTTT TGCAAGGTAT
TTGATGGGTA AATATAATGC AGAGGATGCA TCAAGGATTT TGGCAGATTA CAGGCTAAAA
TTTAATCCTA TACTTTTTTC GTGCCAGCTG AACCTCATTG GCAGCCACGA CACAGAAAGG
GTTTTAACAA GACATGGAAA CAAAAAACTT GCAATGCTTG CTGCAGTGTA CAACCTTACC
TATCAGGGAA TTCCTATGAT TTACTATGGT GATGAGATTG GAATGGAAGG CGGACATGAC
CCCGACTGCA GAAGGGGGAT GATATGGGAA GAAGAAAAGC AGGACAAGGA GATTTTTAAG
CTTTACAGAA GATTGATAGA TCTCAAAAAA ACATCTTCGG CTTTAAACAG TGACTATGTT
AAAGAGTTTT CGATTGGTGA TGTGCTTTGC TTTGAAAGAA AAAGTGAAAG TGAAGCTGTG
TACATTCTTT TTAATCCACG CAAAGCTTTG CAAAAAGTAA AGCTGTGGTC AGAGTTTATT
GTTGACAGAG AAATTGAATT TTTCAGCACG CAGCAGAAAA TTAAAAACCA TTCATGCTAT
ATTGAACTTG AACTAAATCC CGAGAGCTTT GAAATTGTAA TTGTTAAATA A
 
Protein sequence
MQVIHNQVHD IFAMSKDRFY VKLWVQKGFA RSVSLIFSDR YDLDVQKVKM DFYMNVGSFE 
VYTAHVQNKT PRFAYKFLIE LLDGSFKIFN QFGLVDTEEN LYFDSFQFPY ANEADIFEKP
SFAEGLVVYE IFPDRFKRGK KEVHSKKLFD WDYCSWDVPG SEVFLGGDFA GIKEKIEYFK
TLGINAIYLT PIFKSTSSHR YNVDDYFDVD PILGTKEEFK ELVDSLHENG IRIILDMVFN
HTGVGFFAFQ DVIKNGENSK YYSWYNIRSL PVDIQKGNYE TFATNVRSMP RINTSNKDVQ
DFFLEVLKYW LLEFDVDGFR FDVANELDKN FIRRIRNELK AIKKDILLIG EVMHRSENFL
MGDMFDGVMN YFSWEVFARY LMGKYNAEDA SRILADYRLK FNPILFSCQL NLIGSHDTER
VLTRHGNKKL AMLAAVYNLT YQGIPMIYYG DEIGMEGGHD PDCRRGMIWE EEKQDKEIFK
LYRRLIDLKK TSSALNSDYV KEFSIGDVLC FERKSESEAV YILFNPRKAL QKVKLWSEFI
VDREIEFFST QQKIKNHSCY IELELNPESF EIVIVK