Gene Athe_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2072 
Symbol 
ID7408781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2188823 
End bp2190283 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content38% 
IMG OID643716439 
ProductAlpha-L-fucosidase 
Protein accessionYP_002573922 
Protein GI222530040 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGT ATTTTAAAAC TATCCAAAAA GGTCCCTTTG AAGCAACTTG GGAATCTTTA 
AGACAATTCA AATGTCCTCA GTGGTTTTTA GATGCCAAGT TTGGTATTTG GGCGCACTGG
GGACCACAGT CTGTACCAAT GTATGGGGAC TGGTATGCAA GAAATATCTA CAGAGAAGGA
GAACCACAGT ATTACTACCA TTGGCGAAAG TATGGGCATC CTTCAAAGGT TGGTTACAAA
GACATTGTTC AAATGTGGAA GGCTGAAAGG TTTAACCCAG AGGAATTAAT TGATCTGTAT
ATAAAAGCAG GTGCAAAATA TTTTATTGCT CAGGCTGTTC ACCACGATAA CTTTGATAAC
TGGAATTCCC GATACCATAG GTGGAATGCA GTAAATATGG GTCCTAAGAA AGATATTGTT
GGAATGTGGT GCAGAGCTGC AAGAGAAAGA GGACTTCCAT TTGGTGTATC AGAACACTTG
GGTGCAAGCT TTTCGTGGTT TGCACCAAGT AAAGGACATG ACACAAAAGG ACCATATAAG
GATGTTCCAT ATGATGGAAA TGACCCTGCC TATGAGGATT TTTATCACCC AAATAGGGAC
GAGTATGAGC TAGAAAAACA GTATGGAAAG ATTGTCAATT GGTACTCGCC AAACCCACAG
TGGCATATGA AATGGTTTTT AAGAATAAAA GATTTGATCG ACCAGTATGA ACCAGACTTT
TTGTATTCAG ATGGTGGTGT TCCATTTGGT GAGACTGGGT TGAGTATTGT TGCGCATCTT
TACAATACAA GTGCCAAAAA TCACGGTGGT ATAAATCAAG CAGTGTACAC TCAAAAAGAC
ACAAATCCAG AAGTCTATAA AATTGGTGTT TTGGACATTG AACGTGGATC AGCCGAAGAT
ATTCTATCTC ATCCATGGCA GACAGATACA TGTGTAGGTG GCTGGTTTTA TGATGTAAGA
GCTGTATATA AAACCCCGCA ACAGGTAATT GAAATGCTTA TTGATATTGT CAGCAAAGGT
GGAAACCTTC TTTTAAACAT TCCGCAAAAA CCAGATGGCA CGTTGGATGA TGAATGTCTT
TATATTCTTG ATGAGATTGC AAAATGGATG AAAGTAAACG GAGAAGGTAT ATATGCAACA
CGTCCATGGA TAAGATATGG AGAAGGTCTG ACAAAAGCTC AAGGTGGAGC TTTTCAGGAG
AAAAAACTTG AGTGGACGCA AGAGGATTTT AGATTTACAC AAAAAGATGG AAAAATTTTT
GCCTTTCAAA TGAAGTATCC AGAAGATAAC AGAGCAATCA TCAAAAGTTT GGGACTATCA
AGTGGTATTT TTGTGAAAAA GATAAAACTT TTAGGGTTTG AAGGTGAGCT TGAATTTGAA
CAGTTAGAAA ACGCTTTGGT CATCAAATTG CCAGAAAAAT GCTATAGCAC AGGATATCCA
CATTGTTTTT GCATAAAGTA A
 
Protein sequence
MDEYFKTIQK GPFEATWESL RQFKCPQWFL DAKFGIWAHW GPQSVPMYGD WYARNIYREG 
EPQYYYHWRK YGHPSKVGYK DIVQMWKAER FNPEELIDLY IKAGAKYFIA QAVHHDNFDN
WNSRYHRWNA VNMGPKKDIV GMWCRAARER GLPFGVSEHL GASFSWFAPS KGHDTKGPYK
DVPYDGNDPA YEDFYHPNRD EYELEKQYGK IVNWYSPNPQ WHMKWFLRIK DLIDQYEPDF
LYSDGGVPFG ETGLSIVAHL YNTSAKNHGG INQAVYTQKD TNPEVYKIGV LDIERGSAED
ILSHPWQTDT CVGGWFYDVR AVYKTPQQVI EMLIDIVSKG GNLLLNIPQK PDGTLDDECL
YILDEIAKWM KVNGEGIYAT RPWIRYGEGL TKAQGGAFQE KKLEWTQEDF RFTQKDGKIF
AFQMKYPEDN RAIIKSLGLS SGIFVKKIKL LGFEGELEFE QLENALVIKL PEKCYSTGYP
HCFCIK