Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1762 |
Symbol | |
ID | 7408549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1836541 |
End bp | 1838406 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643716139 |
Product | hypothetical protein |
Protein accession | YP_002573628 |
Protein GI | 222529746 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00346887 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGAA AGTTAAAAAC AGTAATTTGT ATTTTTACAA TAGTTGCTTT TGTGCTGACA TCTTCTATCT TTGCGCTTGC ACAAGAAGAT AGTGGCGTAA TTGACTTTTA TACAGTCTTT TCAAAAGATG ACGTCAATAA GAACAGAGTT GGAAACAGTG TTTACAACTG GTCTATCTAT ATGCCACAGG ATGCCTATAT AAACAAAGAC CCAAAAGGAA GCTATTTTTC AATGTCAAGC AATAGTTACA AGGCAAATAT TAATGTTGAG GCAATTTTAA ACAAAGAAGG CTACACATCA TTAGATGAGA TTTTGCTGTA TGGTCAAGAT TTAATTTCAG GTTATTATTC GGGTTCAAAG CTATATTCGC TTAAAAAAGG AAAAGACAAG CAAGGCCAAG AATATATCGA AGCCACAAGT GTTTATACCG ACTCTTTTTA TGTATTTGTT GACGAAGAAG AAAGTTCTGG TACTTTCAAT CTCATACGAA TTTATTTGAG CAAAAATAAA AAATACAACT ACATTTACAG ACTTACGATT AGCATGGATT TAAATTTCTA TTCACAACAC CAAAATCTTT TATACAAGAT TGCTGATTCT TTTGAAACAA ACTTTGACAG AAACAATCCC AACATAAAAG ATTTAGCAGA CAACGTAACA TCGTGGAGAG TTCACAAAAA CACAAGTTAT GGATGGCAGA TTGATCTTCC ACCCTATTGG AAATCTACAG ACATTTATAA TCTTGAGTAC AATTCATCAA CACAATCATT TGCTCCACTT TATACAGATG AGGAGATGGG AATCTCAACC CAAAAACAGC AGGATACCTC AAGTATAATA CCTTCAGATA GTCAGCAGGA ATATGACGAA TATCTTTCTG TCAGTTTTGT GACAAACTCG AATGTCAGTT TTGACAAATG GGTTTCTCAG GAAATAAAAA GTATTGAGCT TTACAATAAA GAACTTTTAA AAGTCATCTC ATCCAAAAGT TTAAATATTG GAACAAGCAA AGCAAAAATT TATGAACTCA GAATCAGAAA AAGCCTTAAC AAGACGTTTG TTGAAAAAAG ACTTTATGTT GACTCAAACA ATAATAAATA TGTGGTAAGA CTGTACGCTG CAGAAGAAAA ATATAACAAA GATAAGCAAA AATATGAGAG GATAATAAAT TCATTCAAAG TATTGCCGGC AAAAAGCAGA TACTTTGATT CTATTCTTTG GTCTGGAGAT TTGAAACCAC AAAGCAGTTT GAAAACCATA AAACTTAACA AAGCTCCATT TGAAATGAAG ATTTCAAAGG ATTACAAAAC AAATATGCCA TATTACTACT ATAGTTACTT TAGTCAGATA ATAGGAACCT TAATTCCATC TGCACAGGGT ATTTCAGACA TTGAAACAGT GATGCTTTAC AATACTCCAT ACTCAATCTT GACAATAAAT GGTGGAATAA ACGTAGACCC TGCCGAAAAA ATTATAAAAA ACACTATGCA AGCATGGGTT GAAAGCAATG AATACAAGAG CAAAACTGTA AATATGAGCT GTTTGAAGTA TATTGATAAA AATCTTTCTA TTTATAAGTT TACTTATGTT TATAACATTT CTAAGCTTTC AGAACTCGCA AAGGGAAATC CAAATAGAGA CTTTAATTTC ATGAATCTTC AGAATAGAAT TATTTATATG ATAAAATACA ATCAATATTA TTACACCATC GACCTGTCAA TTCCTGTTTT GTATTATAAT AGCTATACAG TTTCTGACTT TGAGAACTTT GTGAAAAGCA TAAAGATAGA CAAGATTGAA TTTTCAAAGC TGAATATAAA ATTTGTAAAA GAGGATTTGG AAAAATTTAA GAAAAAAGAG GAATAA
|
Protein sequence | MSRKLKTVIC IFTIVAFVLT SSIFALAQED SGVIDFYTVF SKDDVNKNRV GNSVYNWSIY MPQDAYINKD PKGSYFSMSS NSYKANINVE AILNKEGYTS LDEILLYGQD LISGYYSGSK LYSLKKGKDK QGQEYIEATS VYTDSFYVFV DEEESSGTFN LIRIYLSKNK KYNYIYRLTI SMDLNFYSQH QNLLYKIADS FETNFDRNNP NIKDLADNVT SWRVHKNTSY GWQIDLPPYW KSTDIYNLEY NSSTQSFAPL YTDEEMGIST QKQQDTSSII PSDSQQEYDE YLSVSFVTNS NVSFDKWVSQ EIKSIELYNK ELLKVISSKS LNIGTSKAKI YELRIRKSLN KTFVEKRLYV DSNNNKYVVR LYAAEEKYNK DKQKYERIIN SFKVLPAKSR YFDSILWSGD LKPQSSLKTI KLNKAPFEMK ISKDYKTNMP YYYYSYFSQI IGTLIPSAQG ISDIETVMLY NTPYSILTIN GGINVDPAEK IIKNTMQAWV ESNEYKSKTV NMSCLKYIDK NLSIYKFTYV YNISKLSELA KGNPNRDFNF MNLQNRIIYM IKYNQYYYTI DLSIPVLYYN SYTVSDFENF VKSIKIDKIE FSKLNIKFVK EDLEKFKKKE E
|
| |