Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0009 |
Symbol | |
ID | 7407244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 9187 |
End bp | 11097 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643714423 |
Product | secreted protein |
Protein accession | YP_002571948 |
Protein GI | 222528066 |
COG category | [R] General function prediction only |
COG ID | [COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00303672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA AACGTTTTTT AGGAATGATG GTAGCTTTAT TTATAATATT ATCTTCTGGT ATCAATTTTG TCATGTCAGA CCAGTCGAAA CCAAAAAGCT TTCTACAAAA AGTAGGAACT TTTAGCAACT TTAAAAAACT TGTTGATGAA GCTCTGAAAA AAAATCCATA CATGGGATCA GGGGAAAATA TAATGTATGA ATCAGCACCA GATTTTGTTA TAAAGGGTGA AAATAAAGAG GGTTCTGAGT CTTTTTATTC ACAGACAAAT GTGCAGGTTA TGGGCGTTGA TGAAGCTGAT ATTGCAAAGA CAGACGGGCA GTATATATAC ATTGCAAAAC CATATCCAAA GATAAAAGAT AATGGAATTG TAATTGTTAA AGCTTATCCA CCTGAGGAAA TGAAAGTTTT ATCTAAAATT AAATTGAGCG ATGAGTTCTA TCCAGAAGAA TTTTATGTTG ATGGCAGGTA TCTTGTTGTT ATTTGTGAAA AAGAAAAGAT TGTAGATAGG GAACCTGTTA TTCAAAAAAA TAATTTTCCT GATATCGATA CAAAGAAAAT AAAAGTTTAT GTGCCATATC AGTTATTGAT AGAGACGTAT TGTATTGTTT ATGACATTTC TGACAAATCA AATCCTAAGG AAATAAGAAG GGTTTCAATT TCGGGAAGAT ATCTGACATC AAGGAAAATA GGAACAAGTC TTTACATTGT GATAGAAAAG CAATTCCCAT ATAGAATTTA CGCTAATAAA TCCTACTCTG AGCAAGATTT CAAACCATAC TTTTCAGATA GTATTACAGG TAGTTCAAAA AAGATATATA TAAACTTTGA CAGAATAAAG TATATTCCAG ATTTTATAAA CTGTAGCATT ACTGTTATTG GAAGTTTCGA TATTGAGTCA AAAGATAAAA TTTGTGTTGA GTGTGTGCTT GGTGGAGGGA ATATAGTTTA CTGTTCGCAA GAAAATCTTT ATTTGTGCTC TGAAGTAATA AAAAAAGTTT ACTGGCCTGA AAAATGGGAA GATAATACAA GACCATGGTA TAGTTATGTA AGAAAAACAA TGATTAGCAG ATTTGAACTT TCAAAAGGGA AAATTAATCT TGAGGCAGCA AGTGTTGTCA GTGGAAAAGT GTTGAATCAA TTTTCAATGG ATGAACAAGA TGGTTATTTC AGAATTGCAA CAACTGGTGA AAGGCTTTAT TTTCCAGAAA AAAATTACGA TTATTTTAAT GCTGTGTATG TTCTTGATAA AAACTTGAGA GTAGTTGGGA AGATAGATAA TATTGCAAAA GGAGAAAGAA TATATTCAGC AAGATTTATT GGAAAAAGAC TATATCTTGT TACCTTCAAA GAGCTTGACC CATTTTTTGT AATTGACCTT GAAGACCCTC ATAATCCAAA AGTGCTTGGG TATCTTAAAA TTCCAGGATA CTCAACATAT CTTCATCCAT ATGATGAAAA TCACATAATA GGATTTGGCA GGGATGCAGA GGATTTAAAT GAAGAATATG CAATTCCTTT AGGACTGAAA ATTGCAATGT TCAATGTTGA GGATGTAAAA AATCCAAAGG AGCTGTTCAA AATCATAATT GGGGGCAAGG GTACTTATTC TGAACTTCTT AATAATCACA AAGCGCTTTT GTTTGATAAA AGTAAAAACA TTTTTGCATT CCCTGTAGAG GTTTACGACA AAAAAGGGCA TAACTTCACA GGTGCTTTTG TATATAGCAT AGATTTGAAA GAAGGTTTTG TTTTGAGGGG CAAGATTTTG CATGAAATTG GTGATGGATA TTGTGAGGAG ATAGACAGGC TTTTGTATAT TGGTGATGTG CTCTACTCAG TTTCAAACTC AATGATAAAA GCAAGCTCTC TTGAAAGCTT CAAGGAGATA GCAAGGTTGA GGTTGGATTG A
|
Protein sequence | MMKKRFLGMM VALFIILSSG INFVMSDQSK PKSFLQKVGT FSNFKKLVDE ALKKNPYMGS GENIMYESAP DFVIKGENKE GSESFYSQTN VQVMGVDEAD IAKTDGQYIY IAKPYPKIKD NGIVIVKAYP PEEMKVLSKI KLSDEFYPEE FYVDGRYLVV ICEKEKIVDR EPVIQKNNFP DIDTKKIKVY VPYQLLIETY CIVYDISDKS NPKEIRRVSI SGRYLTSRKI GTSLYIVIEK QFPYRIYANK SYSEQDFKPY FSDSITGSSK KIYINFDRIK YIPDFINCSI TVIGSFDIES KDKICVECVL GGGNIVYCSQ ENLYLCSEVI KKVYWPEKWE DNTRPWYSYV RKTMISRFEL SKGKINLEAA SVVSGKVLNQ FSMDEQDGYF RIATTGERLY FPEKNYDYFN AVYVLDKNLR VVGKIDNIAK GERIYSARFI GKRLYLVTFK ELDPFFVIDL EDPHNPKVLG YLKIPGYSTY LHPYDENHII GFGRDAEDLN EEYAIPLGLK IAMFNVEDVK NPKELFKIII GGKGTYSELL NNHKALLFDK SKNIFAFPVE VYDKKGHNFT GAFVYSIDLK EGFVLRGKIL HEIGDGYCEE IDRLLYIGDV LYSVSNSMIK ASSLESFKEI ARLRLD
|
| |