Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2722 |
Symbol | |
ID | 7408292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2873292 |
End bp | 2875178 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643717079 |
Product | protein of unknown function DUF303 acetylesterase putative |
Protein accession | YP_002574548 |
Protein GI | 222530666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTAA GAATGCCTTA TTTAATTAGC GACGGTATGG TGCTTCAAAG AAACAAACAA ATAAATATCT GGGGCTGGGC TGAACCATGC AAGATGGTAA CAGTAAATTT CCTGGGAAAG TCATACACGG CAGTAGCTGA CCACTTGGGA AAATGGAAAG TTACTTTGCC ACCCATGGAT GCCGGCGGTC CATACTTTAT GGAAATCAAA TGCCAACATC ATGCTGTTAC AATCAAAGAC ATTCTCATTG GAGATGTGTG GGTATGCTCT GGACAGTCTA ATATGGTTTT GCCGATGGAG AGAGTCATCG ATTTATATCC TGAAGAGCTT GATGACTGCA ATATTCCGCT TATCAGACAG TTTACAGTCC CCGAAAAATA CAATTTTAAA GGTCCTCAGG AAGAGTTAGA AGGTGGTACT TGGGATGTTC TCAGCAAAGA AACACTTCTT AAGTTTTCTG CCGTAGGATA CTTTTTTGCA AAAGCGCTCT ATAAAAAATA CAATATACCA ATTGGATTGA TTAAATCATG TGTTGGTGGA ACACCAATTG AGGCGTGGAT GAGCAGTGAT ATAGTATACA AATTTCTTGA GAATCCCGAT GAACTTGAAA AACTCAAAGA TGACAGTTAT ATAGAAGCTG TATCCAAAGA AGAGGAAGCT AAAATAAAAG CTTGGTTTGA TTATTTAAAT GCAAACGATA CTGGTCTTAA TAGCAATCCT CCATTTTTTG ATGAAAATTG TTCTACATCG GACTGGAAAG CTATAACCAT ACCAGCTACA TGGAAAGAGA TGGGGCTTGA TTCAACAATA GGTGTTGTAT GGTTCAAAAA AGAAATAAAC ATACCTTCTT GCATGGTTGG AAACCCAGCA AGGTTATATC TTGGGACAAT TGTTGACAGC GACTTTACAT ATGTCAACGG AAAACTTGTT GGTTCAACTT CATATCGATA TCCGCCAAGA AAGTATAATA TACCTGCTGG TCTTCTAAAA GAAGGAAAAA ATACAATTGT TGTAAGAGTT ATAAGCAATG ATGGAAATGG TGAGTTTGTA AAAGGAAAGG AATACAAGTT GTTTACAGAA GATTGCAAGA TAGACCTCAA AGGTCAGTGG CTGTGCAAGG TGGGTGTTAG AAGCCCAGAA CCTTTGCCGC AACAAACTTT TTGGCAGTAC AAACCTACAG GTCTTTTTAA TGGAATGATT GCACCACTAT TAAATTACAG TATAAAAGGT ATAATATGGT ATCAAGGTGA ATCCAATACA GACAGGCCTG AGGATTACTG TAATAAACTG TGTAATCTTG TTGATGATTG GAGAAAAAAA TGGGGTGATA GCAGTCTACC TTTTCTATAC GTACAGCTTG CGAACTTTAT GGAACCAAAA CCGCAGCCTT GTGAGAGCAA CTGGGCAAGG TTGAGGGAAG AACAAAGAAG AGCACTTTTG CATCTTGACA ATGTGGGAAT GGCAGTTGCA ATTGACCTTG GTGAGTGGAA CGACCTTCAT CCATCGAACA AAAAAGATGT GGGTGAAAGA CTTGCTTTAC TTGCACAAAA AGTTGCATAC GGTGAAAAGG ACTTGGTTGC ATCCGGGCCT TTATACAAGT CTATGAAAAT TGAAGGAAAT AAAATTATTT TGGAATTTTC AGAAGTCGGA AGCGGACTTA TTGCAAAAGG TGATAGTATA CTTAAACATT TTGCAATTGC TGAAAAAGAT AAAAGATTTA TTTGGGCAAA TGCTATTATT GAAGGTAACA AAGTTATTGT TTGGAACGAC AGTATTAAAA ACCCTGTTTA CGTGCGATAC GCTTGGGCTG ACAATCCAGA AGGCGCAAAT CTTTATAACA AAGAAGGATT GCCTGCATCA CCTTTTACAA CCGAAGATGA GATTTAA
|
Protein sequence | MSLRMPYLIS DGMVLQRNKQ INIWGWAEPC KMVTVNFLGK SYTAVADHLG KWKVTLPPMD AGGPYFMEIK CQHHAVTIKD ILIGDVWVCS GQSNMVLPME RVIDLYPEEL DDCNIPLIRQ FTVPEKYNFK GPQEELEGGT WDVLSKETLL KFSAVGYFFA KALYKKYNIP IGLIKSCVGG TPIEAWMSSD IVYKFLENPD ELEKLKDDSY IEAVSKEEEA KIKAWFDYLN ANDTGLNSNP PFFDENCSTS DWKAITIPAT WKEMGLDSTI GVVWFKKEIN IPSCMVGNPA RLYLGTIVDS DFTYVNGKLV GSTSYRYPPR KYNIPAGLLK EGKNTIVVRV ISNDGNGEFV KGKEYKLFTE DCKIDLKGQW LCKVGVRSPE PLPQQTFWQY KPTGLFNGMI APLLNYSIKG IIWYQGESNT DRPEDYCNKL CNLVDDWRKK WGDSSLPFLY VQLANFMEPK PQPCESNWAR LREEQRRALL HLDNVGMAVA IDLGEWNDLH PSNKKDVGER LALLAQKVAY GEKDLVASGP LYKSMKIEGN KIILEFSEVG SGLIAKGDSI LKHFAIAEKD KRFIWANAII EGNKVIVWND SIKNPVYVRY AWADNPEGAN LYNKEGLPAS PFTTEDEI
|
| |