Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0814 |
Symbol | |
ID | 4486302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 901764 |
End bp | 904742 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639729587 |
Product | hypothetical protein |
Protein accession | YP_872573 |
Protein GI | 117928022 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0772637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGGC AAAACAACCT TTTTGAAAAC CAGGAGGCCA AAACCAGGGA CCAGGAACCT GTCGAATGCC TTGGCAAGAC CTTCGACTCC GACGAGGCGC GGCGGCAGCA CTTCCTCCGC CTCTTGCGCG AAGGGCTGGA GGAACTCCAC GCCAAGCTCG GCGGTGTGCC CTTCACCACG GTGGAAGACG CCGTGGAACG GATGAAGTCG GTCGAGAAAT GGCCGATGGG AGGTGAGACG CGCCTGCGTG AGCTTGCCGA GCGCATGCGC CATGCCGATT CAAGCAAGGA CCTGCTCCAG CGCTGGAAGG ACGAGGTCGG CTTCCCGCAC GGTGAGATCG AAGACATCCT GAACCTATCC GATCCGCCCT ATTACACGGC GTGCCCGAAC CCCTTCATCG CGGACTTCAT CAAGCACTAC GGCAAGCCAT ACGACCCGAA CGTGCCTTAC AGCAAGGAGC CGTTCGCGGT CGACGTGAGC GTGGGGAAAA CGGACCCGCT TTACAAAGCC CATTCGTATC ACACCAAGGT ACCGCATCTG GCCATCGTGC CTTCCATTCT CCACTACACA GAGCCGGGAG ACGTGGTCTT GGATGGGTTC TGCGGCTCTG GTATGACGAG CGTCGCGGCA CAATGGTGTA GCTCCGCACC GGAGAGCTAC AAGCGTGATG TCGAGGCATC ATGGGAGAAG GAAGGCCGGA AGAAGCCCAA CTGGGGCCTA CGCCGTGTGG TGCTGGGCGA TCTCGCGCCT GCCGCCACCT TCATCGCTGC CAACTACAAC TTGCCCTTCG ATGTGGATGG ATTTGCACGC GCGGCGCGCC AGATCCTCGA CGAGGTCGAG CAGGAGATCG GCTGGATGTA CGAGACCCTG CATACCGATG GCAAGACCAA GGGGCGCATC GAATATACAG TCTGGAGCGA AGTGTTCACC TGCTGGGACT GCGCGGGCGA GGTGGTGTTT CTGGAGCAGG CCCTGGACCC GGAGACCAAG AAGGTAAGGG AGACCTTCCC TTGCCCCCAT TGCGGTGCGG AGCTGACAAA GAAGCGGCTG GAGCGCCTCT ATGAAACCAA GCTAGACAGG GCGCTGAACA CCATCATTCG CATCCCCAAG CGCCAGCCCG CTCTCATCGT CTACAGAATT GGCAAGACAC GTTACGAGAA GAAGCCGAAT CAGACCGATC TTGAAACCCT CTCCAGGGTG GAATCTCTTG CCTGGCCGCA TGAGGTTCCG ATAGATGCCT TGCCCTATAT GCACATGACC CATGAACGGG CGCGGATGGA CAATGCAGGC ATCACCCACA TCCACCACTT CTTTCTCCCC CGCGCCGCCC ACGTCCTGGC GGCGCTGTGG CGCAAGGCCC AGGCATGGCC GGACAAGCGC AACCGGCACA TGTTGTTGTT TTTCGTGGAG CAGGCGATTT GGGGGATGTC CGTTCTGAAT CGTTATTTGC CGACGGCGTT TTCGCAAACA AATCGCCAGT TGACGGGGGT ATATTACATT GCATCCCAAA TTGCCGAAGT CTCCCCGTCC TACAACCTTG AAGGAAAGCT CAAGGGTTTG GCGAAGGCCT TTCGAGGGCA TCGAACATCA TCCGGAAGCG CCATCGTCAC CACCGTCACC ACCGCACGTC TCGACCTCCC TGACAACTCT ATTGATTACA TCTTCACCGA CCCGCCCTTC GGCGAGAACA TCTATTACGC CGACCTAAAT TTCCTGGTGG AGTCCTGGCA CCGGGTGCTG ACCAACGCCA CCCCCGAAGC CATCGTGGAC AAGGCCAAGA AGAAAGGCCT GCCCGAGTAC CAGCACCTGA TGCGGCAATG CTTCGCAGAG TATTGCCGAG TACTCAAGCC CGGCCGTTGG ATGACGGTCG TTTTCCACAA CTCCCGCAAC GCCGTCTGGA ACGCTATTCA GGAAGCGATA CTGGCCGCGG GCTTCGTGGT GGCCGATGTC CGCACCCTCG ACAAGCAACA GGGTTCCTAC CGTCAGGTCA CCAGCACGGC GGTCAAACAG GACCTGGTCA TCTCCGCCTA CAAACCCAAC GGGGGCCTTG AGGAAAGGTT CAAGCTGACC GCCGGCACCG AGGAAGGTGT CTGGGACTTC GTCCGCACGC ACCTGAGGCA GCTGCCGGTG TTCGTCTCCA AGGACGGCGA GGCGGAAGTC ATCGCAGAGC GGCAGAACTA CCTGCTATTC GACCGCATGG TGGCTTTCCA TGTCCAGCGG GGCGTGACGG TGCCTCTCTC CGCGGCGGAG TTCTACGCCG GTCTCGCGCA GCGCTTCTCG GAGCGCGACG GGATGTACTT CCTGCCCGAG CAGGTGGCGG AGTACGACAA GAAGCGCATG AAGGTTGGTG AGGTCCTGCC GCTGCAGCTC TTCGTCACCG ACGAGGCCTC CGCCATCCAG TGGCTCAAGC AGCAGCTCAC CAGGAAGCCG CAGACCTTCC AGGAGCTTCA TCCGCAGTTT CTCAAGGAGA TTGGAGGCTG GCAGAAGCAC GAGAAGCCGC TCGAGCTGTC CGAGCTGCTG GAACAGAACT TCCTCCGCTA CGACGGCAAG GGTCCGATTC CGAAGCAGAT CGTTTCCTGG ATGCGTCAGA GCGCGGAGCT TCGGAAGATC ATCGATGAGC AGCTTGCGTC TGGACAGGCT CAAGAGGATG ACAGCGGACT CGTGACCCGT GAGCCACGAC TCATCGACCG CGCGAAAGCG CGCTGGTACG TCCCGGACCC CAACAAGGCC GGCGACCTGG AGAAACTCCG CGAGCGGGCC CTGCTGCGCG AGTTCGAGGA GTACCGCGAG TCCAAGCAGA AGCGCCTGAA GGTCTTCCGC CTCGAGGCCG TCCGCGCCGG CTTCAAGAAG GCATGGCAGG AACGGGACTA CGCCACCATC ATCGCCGTGG CGCGCAAGAT CCCAGAGGAC ATCCTCCAGG AAGACCCCAA GCTCCTCATG TGGTACGACC AGGCGCTTAC CCGTTCTGGG GAGGAGTGA
|
Protein sequence | MNRQNNLFEN QEAKTRDQEP VECLGKTFDS DEARRQHFLR LLREGLEELH AKLGGVPFTT VEDAVERMKS VEKWPMGGET RLRELAERMR HADSSKDLLQ RWKDEVGFPH GEIEDILNLS DPPYYTACPN PFIADFIKHY GKPYDPNVPY SKEPFAVDVS VGKTDPLYKA HSYHTKVPHL AIVPSILHYT EPGDVVLDGF CGSGMTSVAA QWCSSAPESY KRDVEASWEK EGRKKPNWGL RRVVLGDLAP AATFIAANYN LPFDVDGFAR AARQILDEVE QEIGWMYETL HTDGKTKGRI EYTVWSEVFT CWDCAGEVVF LEQALDPETK KVRETFPCPH CGAELTKKRL ERLYETKLDR ALNTIIRIPK RQPALIVYRI GKTRYEKKPN QTDLETLSRV ESLAWPHEVP IDALPYMHMT HERARMDNAG ITHIHHFFLP RAAHVLAALW RKAQAWPDKR NRHMLLFFVE QAIWGMSVLN RYLPTAFSQT NRQLTGVYYI ASQIAEVSPS YNLEGKLKGL AKAFRGHRTS SGSAIVTTVT TARLDLPDNS IDYIFTDPPF GENIYYADLN FLVESWHRVL TNATPEAIVD KAKKKGLPEY QHLMRQCFAE YCRVLKPGRW MTVVFHNSRN AVWNAIQEAI LAAGFVVADV RTLDKQQGSY RQVTSTAVKQ DLVISAYKPN GGLEERFKLT AGTEEGVWDF VRTHLRQLPV FVSKDGEAEV IAERQNYLLF DRMVAFHVQR GVTVPLSAAE FYAGLAQRFS ERDGMYFLPE QVAEYDKKRM KVGEVLPLQL FVTDEASAIQ WLKQQLTRKP QTFQELHPQF LKEIGGWQKH EKPLELSELL EQNFLRYDGK GPIPKQIVSW MRQSAELRKI IDEQLASGQA QEDDSGLVTR EPRLIDRAKA RWYVPDPNKA GDLEKLRERA LLREFEEYRE SKQKRLKVFR LEAVRAGFKK AWQERDYATI IAVARKIPED ILQEDPKLLM WYDQALTRSG EE
|
| |