Gene Athe_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1103 
Symbol 
ID7409660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1194918 
End bp1196435 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content40% 
IMG OID643715469 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_002572977 
Protein GI222529095 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG CAAAAGTCAT CTACGATAAG GAGTTCGTAA TCGGGCAAGT AGACAAGAGA 
ATCTACGGTT CATTTTTAGA ACACATGGGA AGAGCAATAT ACACAGGAAT CTATGAACCA
GACCATCCGC AGGCTGATGA AATGGGGTTT AGAAAGGATG TTTTAGAACT TGTTCGCAAG
CTGAATGTTC CTATTGTAAG ATATCCTGGC GGCAATTTTG TGTCGGGGTA TAACTGGGAA
GACGGTATTG GTCCAAAAGA AAAAAGACCG AGAAGACTTG AGCTTGCGTG GAGAGCCATC
GAGACAAATG AGGTTGGTGT AAACGAATTT GTTGAATGGG CAAAAAGAGC AAACACCTCT
GTTATGATGA CAGTAAACCT TGGCACACGA GGAATTGACG CTGCAAGAAA CTTAGTTGAG
TATTGCAACT TCCCAGGCGG TACATACTAC AGTGATTTGA GACGTCAGCA TGGTTATCAG
CAGCCACACA ACATAAAAGT ATGGTGTCTT GGTAACGAGA TGGACGGGGA CTGGCAGATA
GGTCATAAAA CTGCATATGA GTATGGAAGG CTTGCAAGAG AGACAGCAAA GGTTATGAAG
TGGATAGATC CGAGTATTGA GCTTGTTGCA GCGGGAAGCT CAGGTCCCAA AATGCCAACA
TTTCCTGAGT GGGAAGCAAT TGTTTTGGAC CACACATATG ACCTTGTAGA TTATGTGTCG
CTACATGTAT ACTATGGAAA TCCTGAAAAA GACACAAAGA ATTTTGTTGC AAAATCGCTT
GAAATGGAAG AGTTTATCAA AACAGTTATA TCAACAATTG ACTATGTAAA GGCTAAAAAG
AGAAGCAAAA AGGTTGTCAA TATCTCATTT GACGAATGGA ATGTATGGTA CCATGCTCAT
CTTGAGGGGA AAGACCAGAA AGCAGAACCC TGGGCACAAG TTCGTGCTAT TGCTGAAGAA
GATTATGTGT TCGAAGATGC AATTTTGGTA GGATGCATGC TGATTGCGCT TTTGAAACAC
TGTGATAGAG TCAAGATGGC GTGCATGGCA CAGCTTGTAA ATGTAATTGC TCCAATTACC
ACTGTAAAAG GTGGAATTGC TTACAGACAG GTAATCTATT ATCCTTTCAT GCATGCTGCA
AACTTTGGAC ATGGAGTTGC ACTGCTTCCC AAGGTAAATT CTCCTAAATA TGATTCAAAA
GACTTTACTG ATGTTCCATA TATTGAAACA GTTGCAACAT ACAATGAGGA AAAGGATGAA
ATAACAATCT TTGCAGTCAA CAGAGATTTA GAAGAGGAGA TGCAAGTTGA GTTTAAGCTT
GATGGTTTTG AAGGCTTTGA GGTTGTGGAG CACATTGTAT ATGAAAGTGA TGATATTTAC
AAAGGAAACA CTCAAGATAA GCCTGACAAT GTTGTGCCCC ACAAAGGTGG AAATTCAAAG
ATAGAAGGCA ATGTTTTAAC ATCCATATTG CCCAAATTCT CCTGGAATGT TATCAGGTTA
AAGAAGAAAG AAAATTAA
 
Protein sequence
MKKAKVIYDK EFVIGQVDKR IYGSFLEHMG RAIYTGIYEP DHPQADEMGF RKDVLELVRK 
LNVPIVRYPG GNFVSGYNWE DGIGPKEKRP RRLELAWRAI ETNEVGVNEF VEWAKRANTS
VMMTVNLGTR GIDAARNLVE YCNFPGGTYY SDLRRQHGYQ QPHNIKVWCL GNEMDGDWQI
GHKTAYEYGR LARETAKVMK WIDPSIELVA AGSSGPKMPT FPEWEAIVLD HTYDLVDYVS
LHVYYGNPEK DTKNFVAKSL EMEEFIKTVI STIDYVKAKK RSKKVVNISF DEWNVWYHAH
LEGKDQKAEP WAQVRAIAEE DYVFEDAILV GCMLIALLKH CDRVKMACMA QLVNVIAPIT
TVKGGIAYRQ VIYYPFMHAA NFGHGVALLP KVNSPKYDSK DFTDVPYIET VATYNEEKDE
ITIFAVNRDL EEEMQVEFKL DGFEGFEVVE HIVYESDDIY KGNTQDKPDN VVPHKGGNSK
IEGNVLTSIL PKFSWNVIRL KKKEN