Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2567 |
Symbol | |
ID | 7409518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2686781 |
End bp | 2688931 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643716929 |
Product | Hedgehog/intein hint domain protein |
Protein accession | YP_002574406 |
Protein GI | 222530524 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00023287 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGCG GAGGGCTTGC AGGTGGTTCT GCAGCTATTG ATGTTATGGT TGCGCTGGTT AGTTTAACTA CACTTGCTTG GGCTATATTT GATGTACTGC AAATATCTTG TGCTTTAATT GCTGCAACCA ATCCGCTTTT AGCAATTATA TTCTATTCAT TGTTTGTAAT CAGCACAATC AATCTCATTA TGACGTTAGA AACAATAATT TTGTACTGGG AGACAAGAGA TTATGAATAT GCAAGCCAAC TGTTTGGCGA GTTGATACTA AATATTGCTA CATTTGGAGT ATTTAAGGTA ATTGAATATT TAGTACCGGG TATTATGACG TTATTCAAGA CGGTAAAAAA TCAACTGGAT GAGATAGCCC AGATAGCTGA ACAATTTGGA GATGAAGTTG CTGAAGTTGC AGCAAGATAC GGTCCTGATG CGATTGAGGC TATAAAGAGG TATGGTCCAG ATGCTGCGAG GGTGATCAAC AATTATGGTG ATAGTGCAGT AAAAGCTATG GCAAGAGGTA TTGACCCTGC GCTTATTGAA AAAATGGACA GTTTAGCTGT TAAGGTAAAT AAGCTTGAAA AATTCAAGAT ACTATCTCGT GAAGCAGCTC TCAAAGTTGT TGAGGTAGTG GAAACAATAA AGGACTACTT GAAAACATCT GTGGGAAGAG TTTTTGAAAA GATTAGGTCT GTGTACAGGA TTGAAGATGA GTTAGATTTA ACAACAGTGG ATGGATGTGA ATTTGGCTTA TCAAGGGCAA AACTGAAGAA GAAATTAATA GAAGAGGGAA TGAGTGAGGA TTCGGCTGAA GAATCACTAA AATTTTTAGA AGAAGGTTGC TTTACCGGGG ACACAATTGT TATTACAAAA GAGGGTAAAA AGAGGATAGA TGAGATAAAG ATAGGTGATT TTGTTTTTGC GAAGGATGTC AATACAGGTA AGACAGCTTA TAAGAAAGTT AAACAGATTT ATGTCAAGAG TGCGGAAGAG ATTGTTCATA TCAAAGTTGG AGATGATGAA GTAAAAACTA CGAAATCGCA CTTATTCTTT ACAGATTCGG GCTGGTGGGA AGCAGCTGAG GATATAAAAA GTGGTGACAA AATAGTAACA CAAGATGGTA TAATGAAAGT AGTATATGAA GTAGAAGTTG AGAAGTTAAG CGCACCTGTA AAAATTTATA ATCTCAACAT AGAAGATTAT CATACTTACT TTGTTGGAAG CTCTGGATTG CTTGTGCATA ATGACTGCAC ACCTGAGGAG ACTAAAATTA TTTCAGAGGC GCAGCAGCAA TACGAAAAAT GTATAGAAGA AGGTATTGCT TCACTTGACA CAGCAAGAAA ATGTTTTAGC GACGAATTGG ATGATGTTCT AAAAAATCTT GAAGATGAGA TAAAAATTTC TAAAGATGAG TTTGTGGAGT ATAAATCATC AGTTGTTCAT GACAATCCAA AATTACCTGA AAATAAAAAA GAACTTTCTA TTAAAGAAAG GAAAGCATTG TTAAAAATAA GAGGTGATAT ACCACAACCA GATGAGAATA CAGTATTAGT AAAAGTGCTA AACCCTCAGG AAGATGTAGA AAAATACTAT TTACGTGGGG AGAGTCCATA TGTAAAAGGA TTTATAGCGC GTGCAGTTGA TCTTAAATAC GCTCGAACTT ATGAGGAAAT TGTAGCAAGT TTAAGGTTAG ATTATAAAAA TTCACCATTT CCGAAACGAA ATGATGTGTC CGAAAATAAA GTATCATGCT ATATTGTGAT ATTTAAAACG AAGGACGTAG ACAAGATAAA AATTCCAGTT AGCAAAAATA TGTTTGAAGA CATTTCGGAT GAGGAGCTTG CTAATTTGGG TATTCGCAAA GAACATTTTA TTTTATTTGA GGAGGATGAA GAGTTAGCGA AAAGGTATCT TTCCAAACCA TTCACAGGGA CTGGATTTAC AGCAACAGGT GCTGGAGACG ATGAGTATAT AACCGATGGG GTAATGAATC AAATGAAAGA CAAGGCAATA CCGGAGTATT TTGTTCAGTT TGAAAATCCC CTAAGTTTAA AAGATGGTGC ATTTTTGATA GAAATGAGAG GAAATAAGAA TATGAAAGTG ATAGCAAGAT ATTCTGAAGC AGAAAAAAGA TTTATACATT TTGAGGAGTA A
|
Protein sequence | MISGGLAGGS AAIDVMVALV SLTTLAWAIF DVLQISCALI AATNPLLAII FYSLFVISTI NLIMTLETII LYWETRDYEY ASQLFGELIL NIATFGVFKV IEYLVPGIMT LFKTVKNQLD EIAQIAEQFG DEVAEVAARY GPDAIEAIKR YGPDAARVIN NYGDSAVKAM ARGIDPALIE KMDSLAVKVN KLEKFKILSR EAALKVVEVV ETIKDYLKTS VGRVFEKIRS VYRIEDELDL TTVDGCEFGL SRAKLKKKLI EEGMSEDSAE ESLKFLEEGC FTGDTIVITK EGKKRIDEIK IGDFVFAKDV NTGKTAYKKV KQIYVKSAEE IVHIKVGDDE VKTTKSHLFF TDSGWWEAAE DIKSGDKIVT QDGIMKVVYE VEVEKLSAPV KIYNLNIEDY HTYFVGSSGL LVHNDCTPEE TKIISEAQQQ YEKCIEEGIA SLDTARKCFS DELDDVLKNL EDEIKISKDE FVEYKSSVVH DNPKLPENKK ELSIKERKAL LKIRGDIPQP DENTVLVKVL NPQEDVEKYY LRGESPYVKG FIARAVDLKY ARTYEEIVAS LRLDYKNSPF PKRNDVSENK VSCYIVIFKT KDVDKIKIPV SKNMFEDISD EELANLGIRK EHFILFEEDE ELAKRYLSKP FTGTGFTATG AGDDEYITDG VMNQMKDKAI PEYFVQFENP LSLKDGAFLI EMRGNKNMKV IARYSEAEKR FIHFEE
|
| |