Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2303 |
Symbol | |
ID | 7407722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2436246 |
End bp | 2439302 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716667 |
Product | S-layer domain protein |
Protein accession | YP_002574146 |
Protein GI | 222530264 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TCAAAAGACT CATCGCTATC GTAACAGTTC TCTTATTTGC TCTTTCAATA ATTGCGCCTG TGTTTGCACA GGACGAAGCT ACTACAGAAG AGACAGCTGG TTCTGTGTAT GACCAGGCAG CAAAGATTCT CCAAGACAAA GGTATCTTGA AAGGTAATGA GCAAGGCGAC CTGATGCTCG ACAAGCAACT TACAAGAGCA GAAATCTTGG CAATGATTAT CAGAGCAACA GGTCAAGAAG ATGTGGTAAA AGACTATGTT TATGCTGAGC AGTCATTCAC AGATGTTCCA CAGGATCATT GGGCATTTGC ATATGTTGAG GCAGGTAAAG ACCTCGGCAT TGTAAATGGT TACCCAGATG GAACATTCAA GCCAGACAAA CCTGTCAAGT TTGAAGAGCT CTGCAAGATG CTTGTTGCTG CAAAAGGTGA AAGCCCAGCA GCAGGAAAAT GGCCTCTCAA CTATGTAAGA AAAGCTCTTG AACTTGGGTT CTTCAATGGT ATTGAAGATG AAGTTGGAAT TGGTGATGTT GTAATCAGAG GTCAAGCAGC TGTAGCATTC GCAAATGCAT TCTTCCCACC AGAAAAGACT ATAGTTGTTA AAGATGTTAA AGCAGTTGCA AATGACACAA TTGAAGTTTA TGTTGATGCA TATCTTGGAA ATGAGCCTGC AACACTTGAA GATGGCGATG TAATTCCTCT TGACTTTGAG ATCAAGGATG CATCTGATGC ATCAAAGACA ATTGCGGTAA CATTAATTGA TTCTCAGGCA TCTGACTTTG GAGCAGGAAA GCTTGTGCTC AAGACAGCAG CGCAAACAGA AGGTGCTACT TATAAACTGT ATTACAAGGG TAATGATACA GGCAAAACTT TTGTAGCAGT ACCAGTACAG CTTCAAGTTG CAAAAGTTGA AGTACCAAAC TTGAAGCAAG TTGTTGTAAC GTTCAATAGA GATGTAAAGG ACGTTTATGC TGTTGATAAA AATAACTATG AAATTAAAGT AGGAGATGCA ACAAAGTCAA TTGGTGCAGT GCGACTGAGC GAAGATAAGA AGAAGGTTAC ACTAATTTTG AAAGACGATT TAGCAAACCA GGACAAAGTT AAAGTTACAC TCAAAACAGG TTTGGGTCTT AAGGAGGCAT ATACAACTGA AATTGGCCCT GTACAAGATG GAACAGCTCC GGCTATAGTA AAGGTTACAG CTGAGAACCC GCAGAAGCTT AGAGTTGAGT TCAGTGAACC AGTTAAAAAC TATGCAAATC CTGCAAACTA TACATTGAAC GGAATGTACC TCGTAAAAGA GGTTAAGGGA TTCAATGAAG ACAGTTCTAT AGACCCAAAT GTAATTGTTC TGAATCTTTA CATTCCTCTG AATGTTGGTA ATAACACATT GAGTGTTACA GGTGTTACAG ATATAGCTGG ATTGCTTGTA GTAAATCCAT CAATGAGCTT TAGTGTTGCA GAAGACAAGT CACCTATTGA GTTGAAGAAT GTCACAGCAA CACTTAGCCA GGTTAAGCTT GAATTTAGCA AAGCAATACA GAGCATTGTA AGTGTATCGT TGTCAAATGG TATTATAGCA GGTACATCAG TAGATGGTAA CGTAGTAACA ATATCTTCTG GTGATGAGTT GGCAACTTGT ATACCAGTAT CTGGTGCTAA GGTTACAATA GAAGTTAAGG ATTATACTGG GCAGACAGCA AAATTTGAAA AATACGTTGT TCCAACAATT GACACAGAAA GACCTACTGT TAAATCAGTT ACAATAGCTG ATTCTACTAC AGTAAAGGTA ACATTTAGTG AAGATGTACG TGTTCCTAAT CCTTCTGACA ACAAGATTAT TGTGAAAGAT AAAGATGGTA ACCAAAAAGT TATCAGTGTA ATTACCTGGG ATACAGACAG TAGTGGTAAT CAGATTAAGA ATACATTGAA AGTTGTATTG TCTTCACAAC TGCCAGCAGG TGTTGTAACA GTTCAAGTAA GTGGCATTGA AGACCTCACA CCACTTAAGA ATGCAAGCTT GCCGCAGACA GTTACAGCTA CACTTTCTGA TACAAGTGCA CCGGATGTAG TTGCTCCAAT AGTATACAAT GATGTTTCTG GCGACACAAC AGAACTCTAT ATAACATTTA ATAAGACATT GAATGCAGCA AGCGCTAATA TAGCTACAAA TTATAAGTAT CTGGATAGCA GTTATGTACT TAAGGATTTC AGTGGTGCAA CTGCAAGCGT GTTGGCAAAT GGTAAAACTG TGAAACTTGT AATTAAAGAT TCTGAATTTG CTAATATGAC ATACTTGCAG ATAATTGGTG TTGCAGATAC AAATGGTAAT AAGGCAACAT TGGCAATTCA GAAAGATGCT ACTAAATTTG TATCTTCATC CACTGCAATT GTTACTATTG ACTCAACTAA CGGTGTTCAA GCTGTTTCAA CAACACAGTT AAAAGTATTC TTAACAGGCA ATATTAATGA ATATACACTT TATGCTGGGG ATTTTGAGGT CAAAGCAGGG TCAAATACAA TAGGAGTTCT TTATGCGACA TGGGATGCAG GTAGCAAAGC TGTTGTTTTG AACCTTGCAA CAGCAATTGG TGCTGATGCT AAGAAAGATG GAAATGGTGT GACTGTAACT ATCAAGGCTA ACTCAATTAC AAAAGACCTG CTTGGAAGAT CAATTAATAG TGGCAGTGCA GTAGGACCAG TGACTGCAAG TGATAAGATT GCACCAACAA TTACTGCAGT TGAAGCTGTA TACAATGCAA CATATAATGT TACAGAGGTT ACAGTTAAAT TCAGTGAAGC AATTGTTGTA GATTCAACAT ATGTTCTTGA CCAATTCAAG GTGTATATCG GAGGAGCAAT AACAAATCCT GATGCAGGCG TTGATGTTAA GACCGACAAT ATAGTCTTCA AGTTCAGTGG TGACAAGAGA TACAGCACAA TTAAGGTAGA CTATGTACCA GCATATGACA CAAGTAAAAG AGTAAAAGAT TCATTAACAA ATAATAATGA ACTTGCACAA ACAAGTGTGA GTGGAACATG GAAATAA
|
Protein sequence | MKKFKRLIAI VTVLLFALSI IAPVFAQDEA TTEETAGSVY DQAAKILQDK GILKGNEQGD LMLDKQLTRA EILAMIIRAT GQEDVVKDYV YAEQSFTDVP QDHWAFAYVE AGKDLGIVNG YPDGTFKPDK PVKFEELCKM LVAAKGESPA AGKWPLNYVR KALELGFFNG IEDEVGIGDV VIRGQAAVAF ANAFFPPEKT IVVKDVKAVA NDTIEVYVDA YLGNEPATLE DGDVIPLDFE IKDASDASKT IAVTLIDSQA SDFGAGKLVL KTAAQTEGAT YKLYYKGNDT GKTFVAVPVQ LQVAKVEVPN LKQVVVTFNR DVKDVYAVDK NNYEIKVGDA TKSIGAVRLS EDKKKVTLIL KDDLANQDKV KVTLKTGLGL KEAYTTEIGP VQDGTAPAIV KVTAENPQKL RVEFSEPVKN YANPANYTLN GMYLVKEVKG FNEDSSIDPN VIVLNLYIPL NVGNNTLSVT GVTDIAGLLV VNPSMSFSVA EDKSPIELKN VTATLSQVKL EFSKAIQSIV SVSLSNGIIA GTSVDGNVVT ISSGDELATC IPVSGAKVTI EVKDYTGQTA KFEKYVVPTI DTERPTVKSV TIADSTTVKV TFSEDVRVPN PSDNKIIVKD KDGNQKVISV ITWDTDSSGN QIKNTLKVVL SSQLPAGVVT VQVSGIEDLT PLKNASLPQT VTATLSDTSA PDVVAPIVYN DVSGDTTELY ITFNKTLNAA SANIATNYKY LDSSYVLKDF SGATASVLAN GKTVKLVIKD SEFANMTYLQ IIGVADTNGN KATLAIQKDA TKFVSSSTAI VTIDSTNGVQ AVSTTQLKVF LTGNINEYTL YAGDFEVKAG SNTIGVLYAT WDAGSKAVVL NLATAIGADA KKDGNGVTVT IKANSITKDL LGRSINSGSA VGPVTASDKI APTITAVEAV YNATYNVTEV TVKFSEAIVV DSTYVLDQFK VYIGGAITNP DAGVDVKTDN IVFKFSGDKR YSTIKVDYVP AYDTSKRVKD SLTNNNELAQ TSVSGTWK
|
| |