Gene Athe_1573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1573 
Symbol 
ID7409082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1663095 
End bp1666262 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content37% 
IMG OID643715944 
Producthypothetical protein 
Protein accessionYP_002573442 
Protein GI222529560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA AAAATAGACA CAGATTTTTA TTTTTGGTTA TCTGTGTTAT TACTCAACTT 
TTAATAATGA AAACTGTGAG TGCACAAACA AAAGAATATT TATCGATTGA TATGTACAAG
CTTGCAGTGG AAAAGTTGTC TAAGATAGGT GTTGTATCAT CAAAAGATTA TCTCAAGCCA
AACAGTTACG TGACTCGTCA GGAGTTTGTT AGGGCTATTG CAAAAATTTC AAATGTTGAT
GAAAAAACTT TAGTTCAACA ACCATTTTCA TACTGTATTG ATGTAAAATC TAATACAGAA
CTTTGTGGTT ATGTGAATTG GGCCGTTAAA AATAAGTATT TGAATATTTC AATAGATGGG
AAATTTAGAC CATTTGAAGA AATTACATTT TCTCATGCAG TAACTGCAAT GGTAAAAATG
CTAAATTATA CTGATTCAGA TATAGAGGGT ATCTGGCCTT ATAACTATAT CAGAAAGGCT
TCTGAACTTG GACTGTTGAA AGGCTTAAAT GTGTCGGCTA AGAAAAAAGT TACAAAATAT
GATTTGGCCA TTATGCTATA TAGGCTGCTT GAAACAAATG TAAAAAATAC AAATACAAAA
TTTTCTGAAT ATGTTGGGCT TTACAAATCA TACGTTGTGC TTGATACTGG CAAGACATCA
TCAAAGCTTC TTCCAAATGA GGTTTTGACA GACAGCAGAG TGCTTGTGAA TGCAACAAAG
ACGCAACTTG AAGTTGGGAA AAAGTACATG CTTCAGGTTG ATACCAATAA AATCACGAAA
GTGTTTGGCA CTGAAGCTGA TTCTTTCCAG ATTGTCAGCA CAAAGGTAAG CAGCAGGACT
GTATATTACA AGGAAAGTGG AAAGACAAAA TCAATAACTT TGCCATCTTC GGCAACATAC
TATTACAACG GCTCAAAGCA GAGCTATGAT GCAGTAGAAA ATGTGCTAAA ACCGAACCAG
AAAATAAGCT TTATATATTC TGAAGATAGG AGCAAAATGG ACTTTGTTGT AATTTCCGAC
ATATATGCAC AAGAGATTTA TGGAAACTAC GATGAGGTTT TGATTTTGGC AACTCCAAAA
ACATCATCAG CACTGGATGC AAACCAGGTT CAGACAGACA AGGGAATATA CTTTGTTGCA
TCTTCAATAA AACCTGAAAA CCTTGAAATT GGAGCAAAGT ATGGAGTGTA TATAAAAGAT
GATACAATCA CTGCAGCTTT GCAAAAAGTG TGGGTATCAG AAAAGTTTAC AATTACAAAT
ATAGATGATT ACACACTTGA TGCTGCTCAA AACGGCAAAA CACAAAGGAT TCAGCTGACA
AGCAAACCTT TATATTACTA TCAGGGAACA AAACAGAGCT ATGAAAATTT ACCGAACATT
TTAAAAGAAG ACCAGATACT CTATGTATCA AAAGACCCTG ACACAGGCAA GGTTATGGCA
TATGTTATAC AAGACCCATA TGGTACACAG TATGGAAACT ATATTGAGGC AATAGTTTTG
CAGGATGCAC TTTTAAACTC AAGCCTTGAG AACAACCAGG TTGTGACTGA CAAAGGTATT
TTTTACCTTC CATCTGCCGA GACAAAACTT GAAATAGGTG CAAAGTACGG ACTTTATGTT
AAAGACGATA AGATAACGCT TGTTGTAAGG AAGTTGAATA ATACTCAGCA ATATGAGGTA
ACAGATGTTG TAGGTGATAC TAACGTCAAG CTAAAAGGTA CTCAGGGGCA GGAGAACATT
ATTCTACCTC AAAAGCCGGT TTATTATTAC AATGGTGCAA AAACAAACTA CCTTGAGCTC
AAGAATGTGC TAAAAGTGGG TCAGAAAATC TATTTTGGTT TTGCAAAGGA CGGCAAGACA
TACGAATATG TTGTAATTCA GGACCCATAC TCTTTTGAGT ATGGCACATA CACAGAGGTC
ATAGTGATGG CAGATAGCAT TTCGTCAAGC AAGCTTGCAA CAAACGAGGT TCTAACAGAC
AAGGGCATAT ATGTTGTGGG AAGATCTGCA GGAAAGCTTT CTGTTGGTGC AAAGTATGGT
GTGTACATCA AAAATGATAC AATCACCAAG GTTGTGAAAA AGCTCAACAA CGTGGAGTCA
GCAGAGATTA CAGCAGTTGT GAGCGCTACA TCCGTGAAGC TCAAAAAAAG CAGCACTGAG
AATGCGCTGC CTCTTCCTCA AAAACCTACG TACTATTACA ATGGAAATAA GCTCAGCTAT
GACCAGCTAA AGAGTATTCT CAAAACAGGT CAGAAAATCT ATTTTGGCTA CAACCAGGCA
GGAAATTCGT ATGAGTATGC AATCATCCAG GACCCATATT ATGATGAGTA TGGGACATAC
ATGGAAGTTA TTGTAATGGG AACAAGCAAG GTAACAAAAG GTCTTGCAGA CAATGAAGTT
CTAACAGACA AAGGCATTTT GACCCTGCCG TCAAACCAGA ATATAAACTT GGAGCTTGGC
GCAAAATATG GGCTTTATGT TGATTCAGAT AATCAAATAA CTCTTGTCTA CAAAAAATTC
AACTCAACAG ATAGCATGAC AGTTGTGTAT GCTCTTGGAA GCAAAGTAAC AGTTGACAAA
GGCGGAACAC AGGTTGAGAT GACCTTGCCA CAAAACATAA TCTACTACTA CAGCGGCACA
AAGATTGACT ACTCACAGGT GCTTCAGAAG ATGCAAAGGG CAACATCGCT TGTGTTTGGA
ATATCAACAC AAAAGAGCGG GTATGACTAC TGTGTAATAT TTGACCCGGT ATACAGCAAA
CCATACCTTG CAAATGAGCA GACATATCTG ACCTTAAAAG CAGGTGATTT GGATATAAGT
GGTAGTAGCA AAGTTATAAA AGATGGGGAT GTTGTGGATT ACAGCTATAT TCAGAAAAAC
GATGTTGTCT ATGCTGTGTC AGACATCTGG GGTGGTAATA AGTTCATACT TGTTGTAGAT
GACAGGGTTG AATGTTACAT CAAAAGCTAC CAGCCAACAA GGTTCACACC AAAGTCAATT
GTTGTAAGTG CAGTAGACCT GACAACAGGA AAGCTTGTGG ACAGAACATA TGAGGTGAGC
GAAGATTTTG ACCCATCTGT GCTTCTTTCT GACACCTTCA AAGTTGGTCA GAGAGTGTAC
CTCATTTTAG GGTACGATGG CAAGGTTGTG AGTATGGTGA AACCATAA
 
Protein sequence
MKYKNRHRFL FLVICVITQL LIMKTVSAQT KEYLSIDMYK LAVEKLSKIG VVSSKDYLKP 
NSYVTRQEFV RAIAKISNVD EKTLVQQPFS YCIDVKSNTE LCGYVNWAVK NKYLNISIDG
KFRPFEEITF SHAVTAMVKM LNYTDSDIEG IWPYNYIRKA SELGLLKGLN VSAKKKVTKY
DLAIMLYRLL ETNVKNTNTK FSEYVGLYKS YVVLDTGKTS SKLLPNEVLT DSRVLVNATK
TQLEVGKKYM LQVDTNKITK VFGTEADSFQ IVSTKVSSRT VYYKESGKTK SITLPSSATY
YYNGSKQSYD AVENVLKPNQ KISFIYSEDR SKMDFVVISD IYAQEIYGNY DEVLILATPK
TSSALDANQV QTDKGIYFVA SSIKPENLEI GAKYGVYIKD DTITAALQKV WVSEKFTITN
IDDYTLDAAQ NGKTQRIQLT SKPLYYYQGT KQSYENLPNI LKEDQILYVS KDPDTGKVMA
YVIQDPYGTQ YGNYIEAIVL QDALLNSSLE NNQVVTDKGI FYLPSAETKL EIGAKYGLYV
KDDKITLVVR KLNNTQQYEV TDVVGDTNVK LKGTQGQENI ILPQKPVYYY NGAKTNYLEL
KNVLKVGQKI YFGFAKDGKT YEYVVIQDPY SFEYGTYTEV IVMADSISSS KLATNEVLTD
KGIYVVGRSA GKLSVGAKYG VYIKNDTITK VVKKLNNVES AEITAVVSAT SVKLKKSSTE
NALPLPQKPT YYYNGNKLSY DQLKSILKTG QKIYFGYNQA GNSYEYAIIQ DPYYDEYGTY
MEVIVMGTSK VTKGLADNEV LTDKGILTLP SNQNINLELG AKYGLYVDSD NQITLVYKKF
NSTDSMTVVY ALGSKVTVDK GGTQVEMTLP QNIIYYYSGT KIDYSQVLQK MQRATSLVFG
ISTQKSGYDY CVIFDPVYSK PYLANEQTYL TLKAGDLDIS GSSKVIKDGD VVDYSYIQKN
DVVYAVSDIW GGNKFILVVD DRVECYIKSY QPTRFTPKSI VVSAVDLTTG KLVDRTYEVS
EDFDPSVLLS DTFKVGQRVY LILGYDGKVV SMVKP