Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2106 |
Symbol | |
ID | 7408815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2233160 |
End bp | 2234242 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716472 |
Product | protein of unknown function DUF362 |
Protein accession | YP_002573955 |
Protein GI | 222530073 |
COG category | [S] Function unknown |
COG ID | [COG2006] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000135032 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAACA ACATTTACAT TATCTACGGT AAAGATGCTA AATCTATGAC AAAACAGCTT CTTGAGTATG CCGATGTGAA AAGTTATATT CCTCAGGGGA GCAAAATTGC TATAAAACCC AACTTGGTTG TTGCAAAGCC GTATACCTCA GGTGCTACAA CAAATCCACA TATTGTTGAA GGAATCATAG AGTACTTGAG GGAAAATGGG TTTGAAAACA TTGCAATTTT AGAGGGTGCA TGGCTTGGCG CTTCCACAAA AAGGGCGTTT GAAGTTTGTG GGTATACTGA GATTGCAAAA AAATACGGTG TAAAGCTTAT TGACACAAAA GACGATAGGC CTTTGAAGAT AAATGTTGAT GGGTTTGAGC TGAACATTTG TACCCAGGTC TATAGCTACG ACTTTTTAAT AAACGTTCCA CTTTTGAAAG GGCACTGCCA GACACAACTT ACCTGTGCTT TGAAAAATCT CAAAGGGCTT ATTCCTGACA GTGAAAAGAG AAGGTTTCAC ACACTTGGTC TTCACAAACC AATTGCATAC TTGAACAAAG CAATAAAAAC GCATCTTGTA GTGGTAGACA GTATTATGCC AGACCCTGAC TTTGAAGAGG GAGGAAATCC TGTTGAGAAG GATTTTATAG CCCTTGGCTT TGACCCAGTT TTGATAGACA GCTTTGCCGC CGAGCATTTG GGTTACAACC CATATGACAT TGAATACATA AGATTAGCAG AAAAATTAGG TGTTGGGAAA GCAGGTGAGT ATAATCTCAT AGAAATAAAT TCTGATAAAA AACCTACAGG AGTTTCAAAA AGGTCTTCAA TTGTTTCAAG ATACACAAAA TATATTGAAG AAAAAGACGC ATGTTCTGTA TGTTATGCAA ACCTCATAAG TGCTCTTATG AGGTTAGACG AGCAGGGGGT TTTAAAAAGA CTTTCGAAAA AACTCTACAT TGGACAAGGC TATAAAGGGA AAGTTATGGA TGGAATAGGA ATTGGGAGCT GTACAAGCGA TTTTAATATA TGCAAACAGG GATGTCCTCC AAAGTCAAAC GAGATTGTTG AATTTTTAAA ACAGAATTTA TAG
|
Protein sequence | MNNNIYIIYG KDAKSMTKQL LEYADVKSYI PQGSKIAIKP NLVVAKPYTS GATTNPHIVE GIIEYLRENG FENIAILEGA WLGASTKRAF EVCGYTEIAK KYGVKLIDTK DDRPLKINVD GFELNICTQV YSYDFLINVP LLKGHCQTQL TCALKNLKGL IPDSEKRRFH TLGLHKPIAY LNKAIKTHLV VVDSIMPDPD FEEGGNPVEK DFIALGFDPV LIDSFAAEHL GYNPYDIEYI RLAEKLGVGK AGEYNLIEIN SDKKPTGVSK RSSIVSRYTK YIEEKDACSV CYANLISALM RLDEQGVLKR LSKKLYIGQG YKGKVMDGIG IGSCTSDFNI CKQGCPPKSN EIVEFLKQNL
|
| |