Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2353 |
Symbol | |
ID | 7407772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2494947 |
End bp | 2496950 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716717 |
Product | hypothetical protein |
Protein accession | YP_002574196 |
Protein GI | 222530314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAG AACTGGTGCT TGTCCGCAAC GAAACATTTA AAGACTTTCC CATTGGTAAT TTCCCATTTG ACCCTAATCA CTCTGCCATG GGTGAATATC ATTGTTATAT TCCTGAAGGT TACCGTGGTA GATGGTACGA CCCTGTTGTA TATCATGGTT GGAACGGAAG TGGCCCTACC TGGATTGTGA CAGAGGAAGA TGGTAAAAAG TTTATGGAAC AGACAAGGGT GCTCGCCGCC TTAAAGAATT TGTGGCCAAT GCTTGTAACA GGAGATGAGT TTTGGCAAGA CTATATAGTA GAGGCTCAAG TTAGGATGCT TTCAACCTTG CCTTCAGCCC ATGCAGGAAT GATGTTTAGA TATGTACATG CACGCAGTTT TTATGCTCTT TTGCTTCATG ACAAAAAGCT CAAACTTATC AAAAAAGACC AGGAAAATCT TGAAGTCTTG GCTTTCTGCA ATTTCAATTA TGATTGCGAT ACATTTTATA ATCTAAAAGT GGAATGTTGT GGCAGCAAAC TAATTGGATA TGTAGACAAC AAAAAGATGG TAGAAGCAAT TGATAGCTCA TATACAAGAG GAAAAATCGG GATTACAGCA ACAATGCCTG CACAATTTAC AGATGTCAGG GTGTTGATGA ACCATCAAGC TTATAAAAAT TATTTGATAG CAAAAAACAA TTGGGTTCAA GTTGAAAAAA ATCTTCAATG CGAGTATCCT CAGCCGGTAT TATGGAAAAT AATTGATTTT AAAAACTTTG GTGCAGGCAG GCAAATAAGA TTTGGAAATT TACCAGAAAA CAATCAGAAG TTTATGGTAA TAGCCCAGCA TCAAAAGCGG GTGCACAGAG ATGCATACGC CAATATCAGC TGCCTTACCG CAATTGATTT AGATGGCAGA ATTCTTTGGC AGATAGGTGA AGCATCTTCA CAAAAAGACC ATGCATATCT TACAGCTGAC CTTCCTTTTC AGGTGTGCGA TGTTGACCTT GACGGGGTTG ATGAGGTAAT TGTTGCAAGA AACTTTAAAA TCATGATTTT AGACTCTCTT ACAGGAAAAA TAAAAAAGCA AATTCCTGCG CCCTTTTCTT ATGAAACTGA CAAACTGTAC TCTGTGCCAT TTGGCCAATA TGCCTTTGAT AGGGTCAATA TTGACTCAAT CAGAATTGCA AACCTAACTG GCAAATCTAA GCCAACAGAT ATAATTGTAA AAGATAGGTA CAGCAGGCTT TGGGCATATG ACAACAATCT TGATCTTCTG TGGAAGTTCA ATGGCAAAAA CACAGGACAT TTTATGTTCA CCAAAGACAT TGATAATGAT GGCAAAGAAG AGGTTGTTGT AGGTTATCAT CTTTTAGACC ATGATGGAAA GCTTATCTGG ACTTTACCCA TTGAATCTGA CCACACTGAT GAGATTGTAA TAGGTCCAAT TGACCCTGAG AGAAATGAAG ATATCATTGC GATGGCATGT GGTGATGAAG GCTTTATACT CTCTGACCTT AAAGGCAATA TTATAAAGCG ACATTTAATA GGTCATGCTC AAAGAATAAG TGTTGGGAAC TACCGACCTG ATCTTAAAGG CTATGAGATA TGCGTAACTA CATACTGGGG ATATCAAGGA ATAATATACA TCTTTGATTG TAAAGGGAAT CTTCTGCATC AGTTCGAATC ACCAGTACCT GGAAATATAA TAACACCTGT CAACTGGACT GGGGACGGAA AAGATTTAAT ACTTCTGAGC GGTCATATTC AGTATGGTGG GCTGATTGAT GGTTTTGGAA GAAGGGTTGT AACTTTCCCT GATGATGGTC ATCCAATCCT TTGTGCAGAT AGCATTGATT TGACAGGTGA TGGCAGAGAT GAGATATTGC TGTGGGACGA AAAGAAAATG TATATTTATA CCCAGCAAGA CAATGGCATT AAATCAGACG TGCTTGTTCC AAATAAATAC CCTACTATAA ATGGCTCAAA CTATAGAGGA GAGTACTCTT TTTATAACAT TTAA
|
Protein sequence | MKEELVLVRN ETFKDFPIGN FPFDPNHSAM GEYHCYIPEG YRGRWYDPVV YHGWNGSGPT WIVTEEDGKK FMEQTRVLAA LKNLWPMLVT GDEFWQDYIV EAQVRMLSTL PSAHAGMMFR YVHARSFYAL LLHDKKLKLI KKDQENLEVL AFCNFNYDCD TFYNLKVECC GSKLIGYVDN KKMVEAIDSS YTRGKIGITA TMPAQFTDVR VLMNHQAYKN YLIAKNNWVQ VEKNLQCEYP QPVLWKIIDF KNFGAGRQIR FGNLPENNQK FMVIAQHQKR VHRDAYANIS CLTAIDLDGR ILWQIGEASS QKDHAYLTAD LPFQVCDVDL DGVDEVIVAR NFKIMILDSL TGKIKKQIPA PFSYETDKLY SVPFGQYAFD RVNIDSIRIA NLTGKSKPTD IIVKDRYSRL WAYDNNLDLL WKFNGKNTGH FMFTKDIDND GKEEVVVGYH LLDHDGKLIW TLPIESDHTD EIVIGPIDPE RNEDIIAMAC GDEGFILSDL KGNIIKRHLI GHAQRISVGN YRPDLKGYEI CVTTYWGYQG IIYIFDCKGN LLHQFESPVP GNIITPVNWT GDGKDLILLS GHIQYGGLID GFGRRVVTFP DDGHPILCAD SIDLTGDGRD EILLWDEKKM YIYTQQDNGI KSDVLVPNKY PTINGSNYRG EYSFYNI
|
| |