Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1839 |
Symbol | |
ID | 7408953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1914071 |
End bp | 1915798 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643716216 |
Product | S-layer domain protein |
Protein accession | YP_002573705 |
Protein GI | 222529823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAT TTAAAAAGCG TCTTTTGCTG ATATGTGTAA TGTTGGTTTT TGCTATAGTA CAAATTTTTT CTGCAATTGC TTTTGCGCAA GGCACATCAA ATCCTATTTT TTCTGACCTT CCTCAAAATC ATTGGGCATA CAATGCAGTG AAATTCATGG TAGAAAGAGG AATTATAACA GGTTATCCAG ATAACACATT CAGACCAGAC AATCCAGTTA CAAGAGCTGA ATTTGCAAGG ATTATGGTAA TTAGCTTGAA CCTTCCAATC AAAGTGACAG ATAATCCATC CTTTAAAGAT GTTCCAAAAG ACCACTGGGC ATATCCACAT GTAGAGACTG CAAAATTTTA TTTGACAGGT TTTAGAACTC AGAATGGTGA CTACTTTAAG CCATCTGACT ATGCGGTAAG AGAGGATATG GCAGTTGCCC TTGTAAAAGC AAAGGGATTG CAGAATGAAA ATGTTGACCT GAGTATTTTA AGTAACTACA TTGATAAAGA CCAGATATCA AAGAATCTTG TTAAACATGT CGCAATTGCC ATTGCAAAGG GTATTATGGT GGGAAGCCCA GTTTCAAATT CTAATCAATA TAAGTTTGAC CCGCAAGGAA TTCTAACACG TGCACAGGCA GCAGTGCTGT TGTATAATGT TATTAATGCT CAATCAACTG AAGAAAAAGT TACCTATGAC GATTCTTCAT CAGGTTCTAA TCAGCAATAT ACTTATCCTG TACCCAATGT TACTGCCTAT ACAAAGGGGG ACAGAGTTGT CCTGATATGG AATAGAATAA ATGACAAAAA ACTGAAAGGA TATGCAGTTG TTATCTCAAA AAACAATAGC CAGCCGGGAT ATCCGCAAGA TGGTTATCTT ACAATCTTAT CTGATAGAAA TGCCAATTAT ATAGAAATTG GAGTAAACTC AAAATATAAC AATGGCGATT TTGGAGCTTA TATAAAGAGC GGAGAAGAGT ATTATTTCAG TGTTACAGCA ATCTATGAAG GAAATGTCTA CGTAAAAGGC AATGCTGTGA AAATGAGAAT GCCAGTTATA CCAAATTATT TTGAAAAACC ATCTGTTAAG TATGAATATA AAGACAATAA ATTTGTTTTA AGCTGGCAAA AGATAGACGA TTTCCGACTT ATAGGATATT GGATTGTGAT ATCCAAAAAG ACTAAAGAAC CTAAATATCC GGACAATGGT TATCTTGTTT TTATCAATGA CAAAAATACA ACTCAGATTA TTATTGACAA CACAATTCCT TACAAAAATG GAGATTTCGG TGAGTATTTA AAAGATGGTG AAGAATATTA TTTTAGTGTA ACAGCTCAGT ATCAGGACAG GGTTGTCCCT GGGAATAGCA TCAAGGCCAT TTATTATTCT AATTTGGAGA TTGCAAAATT AAGACCAAAG TTGCAGGCAA AAACAGTAAG ATGGAGGGGA CAGTGGTATA TAAACTTAAG ATGGGACAAA ATTGATAGCG ATAAGCTTCA AGGATATAAA GTTGTAGTAT CTGATAAAAA CTCAACACCC GACTTAAATA AAGATGGGTT ACTGGCAGTA ATATCAGATA AAAACGTTAC TTCTGTAAAT ATCAAAGCAA AAGATAAGTA TTTACTAAAT GGAGAATATA AGGAACTCAA AAGAGGACAT TACTATTACT TTACAGTTTA TGCTATTTAC TCTGATAGAG TAGTAGATGG CAATGTAATT AGAATAAAGA TACCATAA
|
Protein sequence | MKLFKKRLLL ICVMLVFAIV QIFSAIAFAQ GTSNPIFSDL PQNHWAYNAV KFMVERGIIT GYPDNTFRPD NPVTRAEFAR IMVISLNLPI KVTDNPSFKD VPKDHWAYPH VETAKFYLTG FRTQNGDYFK PSDYAVREDM AVALVKAKGL QNENVDLSIL SNYIDKDQIS KNLVKHVAIA IAKGIMVGSP VSNSNQYKFD PQGILTRAQA AVLLYNVINA QSTEEKVTYD DSSSGSNQQY TYPVPNVTAY TKGDRVVLIW NRINDKKLKG YAVVISKNNS QPGYPQDGYL TILSDRNANY IEIGVNSKYN NGDFGAYIKS GEEYYFSVTA IYEGNVYVKG NAVKMRMPVI PNYFEKPSVK YEYKDNKFVL SWQKIDDFRL IGYWIVISKK TKEPKYPDNG YLVFINDKNT TQIIIDNTIP YKNGDFGEYL KDGEEYYFSV TAQYQDRVVP GNSIKAIYYS NLEIAKLRPK LQAKTVRWRG QWYINLRWDK IDSDKLQGYK VVVSDKNSTP DLNKDGLLAV ISDKNVTSVN IKAKDKYLLN GEYKELKRGH YYYFTVYAIY SDRVVDGNVI RIKIP
|
| |