Gene Athe_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1855 
Symbol 
ID7408968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1933527 
End bp1935488 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content41% 
IMG OID643716227 
ProductPectate disaccharide-lyase 
Protein accessionYP_002573716 
Protein GI222529834 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACA GGAAGATTTT AGCCATTGTA GTCAGTTTGA TAATGGTTGT TTCATTGTTT 
ACAGGGATTG GGTTGCGTAA TGAAGTTGCA AAGGCAGCGA CACTTTTAAC AGATGATTTT
GAAGATGGCA ACAGAGATGG ATGGTCGACA TCGAACGGTA GTTGGAGTGT AGTAGTGGAT
GGGAGCAAGG TTTTAAAGCA GGCTAGCACA GGTTCTGAGG CGAGAGCATA TACTGGTTCA
TCTGATTGGA GTGATTATAC AGTTGAAGCG AAAGTTAAAG TATTAAATGT GAAGGATTCG
AGTTCAGGTG CGGGAGTGAT AGTGAGATAT AAAAACTCAG GTAACTTTTA TGCGTTGGTG
CTAAGGGGTT CAAAGATAGA AATAGGGAAG AAATTAAACA GTAACTGGAG TACATTGGCG
TTCAAGTCAT TTACGTTGGA TCAGGATACC TGGTATAATG TGAAATTAGA AGTAAATGGG
AGCAAGTTAG TTGGATATGT TAATGGGAGT CAAGTATTAA GTGCAAGTGA TTTATCGATT
ACGACAGGAA AAGCAGGTTT AATAGCTGAC AGGTGTGTTG CTGAATTTGA TGATGTTGTT
GTAAATTCAA GTGTGAGCGG TACAGCACCT ACTCCGACAC CAACACCGAC TTCATCAGTG
ACACCAACAC CGACATCGAC TCCAACGCCA ACCAAAACAC CTACTCCAAC TTCCACACCA
GTACCAACAC AGACCCCAGC AGTAACACCG ACGCCGACCC CAACGCCGAC GACAGTTCCA
ACACCTGCCC CGACACCTGT ACCTGGGGTG AATGCTATTT ATGTGGCACC AAGTGGGAGC
TCAGATAATC CTGGTACCAT TGATCGACCT ACTACATTAG AAAAAGCAAT CACGATAGTA
CAACCTGGGC AGATAATCTA CATGAGAGGT GGGACGTATA AGTATTCTGC GCAGATCACA
ATTGAAAGAA ATAATAGTGG TACAAGCAAT GCAAGAAAAT GTATTTATGC ATTTCCAAAT
GAAAGGCCAA TATTGGACTT TTCATCTCAA ACATATGGGA GTGTGGACTC AAATCCAAGA
GGATTACAGA TTAATGGGAA CTATTGGCAC ATAAAAGGAT TAGAAGTCAT GGGAGCTGCG
GACAACGGAA TCTTCGTAGG AGGCAGCTAC AATATAATTG AACAATGTGA AATTCATCAT
AACAGAGATT CAGGTTTGCA GATAAGCAGG TATATAAGTT CTGCAACCAG AGATGAGTGG
CCAAGTTATA ACTTGATATT GAATTGTACA TCACATGACA ATATGGATCC AGATAACGGT
GAAGATGCAG ATGGTTTTGC ATGCAAACTA ACAGCAGGAC CAGGAAATGT ATTCCGAGGT
TGTGTAGCGT ACTACAATGT TGATGATGGT TGGGATTTAT ACACAAAAAG TGAGACAGGA
GCTATTGGTG AAGTATTAAT TGAGGATTGT GTGGCATATG GTCACGGGCA AACATCAACC
GGGAGTGCCA CATCTAGCAG TGATGGAAAT GGCTTTAAGC TAGGAGGCAG TAATATAAAG
GTCAATCATA CAGTGAGAAG ATGTATAGCA TTTAATAACA ACAAACATGG ATTTACTTAT
AATAGTAATC CGGGTAGCAT AACAGTGGAA AATTGTACGG GCTATAATAA CGGTTTAAAG
GTAAGTGGAA GGAACTTTTA TTTTGAAGAA GGTACACACG TGTTGAAGAA TTGTTTATCC
TACAAAGAGA GTGCATCGAG TGATTTGGTA AGTGGAACGA TAATTAATTG TGTTTTGTGG
AGTAATAGGC AAGCAATAAA GCTAAATGGT CAACTGGTAA CCGATAATGA CTTTTACAGC
TTAACACCAA CCATAACAAG GAATAGTGAT GGGGGTTTAA ACTTAGGAGA CTTTTTAAAG
CCAAAGCCTG GTAGTGGTTT AGAAGGAATA GGAGCAAGGT AA
 
Protein sequence
MSNRKILAIV VSLIMVVSLF TGIGLRNEVA KAATLLTDDF EDGNRDGWST SNGSWSVVVD 
GSKVLKQAST GSEARAYTGS SDWSDYTVEA KVKVLNVKDS SSGAGVIVRY KNSGNFYALV
LRGSKIEIGK KLNSNWSTLA FKSFTLDQDT WYNVKLEVNG SKLVGYVNGS QVLSASDLSI
TTGKAGLIAD RCVAEFDDVV VNSSVSGTAP TPTPTPTSSV TPTPTSTPTP TKTPTPTSTP
VPTQTPAVTP TPTPTPTTVP TPAPTPVPGV NAIYVAPSGS SDNPGTIDRP TTLEKAITIV
QPGQIIYMRG GTYKYSAQIT IERNNSGTSN ARKCIYAFPN ERPILDFSSQ TYGSVDSNPR
GLQINGNYWH IKGLEVMGAA DNGIFVGGSY NIIEQCEIHH NRDSGLQISR YISSATRDEW
PSYNLILNCT SHDNMDPDNG EDADGFACKL TAGPGNVFRG CVAYYNVDDG WDLYTKSETG
AIGEVLIEDC VAYGHGQTST GSATSSSDGN GFKLGGSNIK VNHTVRRCIA FNNNKHGFTY
NSNPGSITVE NCTGYNNGLK VSGRNFYFEE GTHVLKNCLS YKESASSDLV SGTIINCVLW
SNRQAIKLNG QLVTDNDFYS LTPTITRNSD GGLNLGDFLK PKPGSGLEGI GAR