Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1855 |
Symbol | |
ID | 7408968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1933527 |
End bp | 1935488 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643716227 |
Product | Pectate disaccharide-lyase |
Protein accession | YP_002573716 |
Protein GI | 222529834 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAACA GGAAGATTTT AGCCATTGTA GTCAGTTTGA TAATGGTTGT TTCATTGTTT ACAGGGATTG GGTTGCGTAA TGAAGTTGCA AAGGCAGCGA CACTTTTAAC AGATGATTTT GAAGATGGCA ACAGAGATGG ATGGTCGACA TCGAACGGTA GTTGGAGTGT AGTAGTGGAT GGGAGCAAGG TTTTAAAGCA GGCTAGCACA GGTTCTGAGG CGAGAGCATA TACTGGTTCA TCTGATTGGA GTGATTATAC AGTTGAAGCG AAAGTTAAAG TATTAAATGT GAAGGATTCG AGTTCAGGTG CGGGAGTGAT AGTGAGATAT AAAAACTCAG GTAACTTTTA TGCGTTGGTG CTAAGGGGTT CAAAGATAGA AATAGGGAAG AAATTAAACA GTAACTGGAG TACATTGGCG TTCAAGTCAT TTACGTTGGA TCAGGATACC TGGTATAATG TGAAATTAGA AGTAAATGGG AGCAAGTTAG TTGGATATGT TAATGGGAGT CAAGTATTAA GTGCAAGTGA TTTATCGATT ACGACAGGAA AAGCAGGTTT AATAGCTGAC AGGTGTGTTG CTGAATTTGA TGATGTTGTT GTAAATTCAA GTGTGAGCGG TACAGCACCT ACTCCGACAC CAACACCGAC TTCATCAGTG ACACCAACAC CGACATCGAC TCCAACGCCA ACCAAAACAC CTACTCCAAC TTCCACACCA GTACCAACAC AGACCCCAGC AGTAACACCG ACGCCGACCC CAACGCCGAC GACAGTTCCA ACACCTGCCC CGACACCTGT ACCTGGGGTG AATGCTATTT ATGTGGCACC AAGTGGGAGC TCAGATAATC CTGGTACCAT TGATCGACCT ACTACATTAG AAAAAGCAAT CACGATAGTA CAACCTGGGC AGATAATCTA CATGAGAGGT GGGACGTATA AGTATTCTGC GCAGATCACA ATTGAAAGAA ATAATAGTGG TACAAGCAAT GCAAGAAAAT GTATTTATGC ATTTCCAAAT GAAAGGCCAA TATTGGACTT TTCATCTCAA ACATATGGGA GTGTGGACTC AAATCCAAGA GGATTACAGA TTAATGGGAA CTATTGGCAC ATAAAAGGAT TAGAAGTCAT GGGAGCTGCG GACAACGGAA TCTTCGTAGG AGGCAGCTAC AATATAATTG AACAATGTGA AATTCATCAT AACAGAGATT CAGGTTTGCA GATAAGCAGG TATATAAGTT CTGCAACCAG AGATGAGTGG CCAAGTTATA ACTTGATATT GAATTGTACA TCACATGACA ATATGGATCC AGATAACGGT GAAGATGCAG ATGGTTTTGC ATGCAAACTA ACAGCAGGAC CAGGAAATGT ATTCCGAGGT TGTGTAGCGT ACTACAATGT TGATGATGGT TGGGATTTAT ACACAAAAAG TGAGACAGGA GCTATTGGTG AAGTATTAAT TGAGGATTGT GTGGCATATG GTCACGGGCA AACATCAACC GGGAGTGCCA CATCTAGCAG TGATGGAAAT GGCTTTAAGC TAGGAGGCAG TAATATAAAG GTCAATCATA CAGTGAGAAG ATGTATAGCA TTTAATAACA ACAAACATGG ATTTACTTAT AATAGTAATC CGGGTAGCAT AACAGTGGAA AATTGTACGG GCTATAATAA CGGTTTAAAG GTAAGTGGAA GGAACTTTTA TTTTGAAGAA GGTACACACG TGTTGAAGAA TTGTTTATCC TACAAAGAGA GTGCATCGAG TGATTTGGTA AGTGGAACGA TAATTAATTG TGTTTTGTGG AGTAATAGGC AAGCAATAAA GCTAAATGGT CAACTGGTAA CCGATAATGA CTTTTACAGC TTAACACCAA CCATAACAAG GAATAGTGAT GGGGGTTTAA ACTTAGGAGA CTTTTTAAAG CCAAAGCCTG GTAGTGGTTT AGAAGGAATA GGAGCAAGGT AA
|
Protein sequence | MSNRKILAIV VSLIMVVSLF TGIGLRNEVA KAATLLTDDF EDGNRDGWST SNGSWSVVVD GSKVLKQAST GSEARAYTGS SDWSDYTVEA KVKVLNVKDS SSGAGVIVRY KNSGNFYALV LRGSKIEIGK KLNSNWSTLA FKSFTLDQDT WYNVKLEVNG SKLVGYVNGS QVLSASDLSI TTGKAGLIAD RCVAEFDDVV VNSSVSGTAP TPTPTPTSSV TPTPTSTPTP TKTPTPTSTP VPTQTPAVTP TPTPTPTTVP TPAPTPVPGV NAIYVAPSGS SDNPGTIDRP TTLEKAITIV QPGQIIYMRG GTYKYSAQIT IERNNSGTSN ARKCIYAFPN ERPILDFSSQ TYGSVDSNPR GLQINGNYWH IKGLEVMGAA DNGIFVGGSY NIIEQCEIHH NRDSGLQISR YISSATRDEW PSYNLILNCT SHDNMDPDNG EDADGFACKL TAGPGNVFRG CVAYYNVDDG WDLYTKSETG AIGEVLIEDC VAYGHGQTST GSATSSSDGN GFKLGGSNIK VNHTVRRCIA FNNNKHGFTY NSNPGSITVE NCTGYNNGLK VSGRNFYFEE GTHVLKNCLS YKESASSDLV SGTIINCVLW SNRQAIKLNG QLVTDNDFYS LTPTITRNSD GGLNLGDFLK PKPGSGLEGI GAR
|
| |