Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0449 |
Symbol | |
ID | 7407526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 511043 |
End bp | 512188 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643714836 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002572354 |
Protein GI | 222528472 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACCT TTTGGCTGTT TATAATATGC TCTTTATTAT TTGCTCTAAA TTTTTTCGTT ACTAAAAAAT TTGGACTAAA AAACGTTGAG TATGAAATCT ACTTTGAAGA AAATAAAAAA ACTGAAGGTG ATGAGATTCA TATTGTTGAA AGAATTTACA ACGGTAAAGC TCTGCCACTT CCATGGGTAA AATCCGAATT TGAAGTATCA GCATCATTTT TCATGGAAAA TGCAAAAAAT TATGTAGTAG GAAATAAGCT AAGGTACATC AGCATTTTCT TTCTTCTACC TTACCAGCAA ATAGTAAGGC GGCACAGATT TGTTGCAACA AAAAGAGGAT TTTATAAACT TGATAAAATA TATCTTGTCA CTGGTGACCT TTTCGGTCTT TCAATGGATG ACAGGTGTTA CTATGTAAAT TCCAATATTA CCATTTACCC AGCATTTTTG GACCTGAAAA AACACCTTCT GCCCCGTTCA AGCCTCTCAG GCGAAGTTGT GATAAAAAGA CATTATTATG AAGATATATT TCACTTTGCA GGAATAAGAG AGTATCAGTC TTTTGATTCT TTCAATAGAA TAAACTGGAA CGCAACTGCA AAGTATAATA CTTTGATGGT AAACAAGTAC GAATACACCT CATCAGGTGA TGCTTTAATA CTTTTGAATG TCCAAAGTTC AGAGTATGAA AGAAAAGAGG TTTTTAACAA AAACGCAATC GAACTTGGAA TAAAGATTGC AGCAAGCCTG ACAAAAGAAT GCTTAGATAA TCACATTCCA GTTGGTTTTG TTTGCAATGG CATAGACGAA GAAACTCTTG AGCCGCTTGA AATCTTGCTG CCATCACAAG ATTCAAATCA GCTTTTAAAA ATTCTCGAAA CACTTGCACA CATTAAAATT CAGGTAAACG AATACTTTGA AGCTTTGCTT TATCAAGTTT TAAGAAGTTA CAACTTTCGT GAGCTTTTTA TAATAACTTC TTTTGTTAAC AAGGAGATGG AAGACTCTAT CCTTCTTTAT TCCTCACTCG GAGTTAAGTT TACTATTATT CTTCTTGAAT ATGATGAAAA ACCTTTCAAA TTAGAATCAG AAAATGTAAG AATTTTTCTG GCAAAACAGC ATCTTTTAGA AAACGTTAGA ACTTGA
|
Protein sequence | METFWLFIIC SLLFALNFFV TKKFGLKNVE YEIYFEENKK TEGDEIHIVE RIYNGKALPL PWVKSEFEVS ASFFMENAKN YVVGNKLRYI SIFFLLPYQQ IVRRHRFVAT KRGFYKLDKI YLVTGDLFGL SMDDRCYYVN SNITIYPAFL DLKKHLLPRS SLSGEVVIKR HYYEDIFHFA GIREYQSFDS FNRINWNATA KYNTLMVNKY EYTSSGDALI LLNVQSSEYE RKEVFNKNAI ELGIKIAASL TKECLDNHIP VGFVCNGIDE ETLEPLEILL PSQDSNQLLK ILETLAHIKI QVNEYFEALL YQVLRSYNFR ELFIITSFVN KEMEDSILLY SSLGVKFTII LLEYDEKPFK LESENVRIFL AKQHLLENVR T
|
| |