Gene Acel_0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0814 
Symbol 
ID4486302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp901764 
End bp904742 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content61% 
IMG OID639729587 
Producthypothetical protein 
Protein accessionYP_872573 
Protein GI117928022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0772637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGC AAAACAACCT TTTTGAAAAC CAGGAGGCCA AAACCAGGGA CCAGGAACCT 
GTCGAATGCC TTGGCAAGAC CTTCGACTCC GACGAGGCGC GGCGGCAGCA CTTCCTCCGC
CTCTTGCGCG AAGGGCTGGA GGAACTCCAC GCCAAGCTCG GCGGTGTGCC CTTCACCACG
GTGGAAGACG CCGTGGAACG GATGAAGTCG GTCGAGAAAT GGCCGATGGG AGGTGAGACG
CGCCTGCGTG AGCTTGCCGA GCGCATGCGC CATGCCGATT CAAGCAAGGA CCTGCTCCAG
CGCTGGAAGG ACGAGGTCGG CTTCCCGCAC GGTGAGATCG AAGACATCCT GAACCTATCC
GATCCGCCCT ATTACACGGC GTGCCCGAAC CCCTTCATCG CGGACTTCAT CAAGCACTAC
GGCAAGCCAT ACGACCCGAA CGTGCCTTAC AGCAAGGAGC CGTTCGCGGT CGACGTGAGC
GTGGGGAAAA CGGACCCGCT TTACAAAGCC CATTCGTATC ACACCAAGGT ACCGCATCTG
GCCATCGTGC CTTCCATTCT CCACTACACA GAGCCGGGAG ACGTGGTCTT GGATGGGTTC
TGCGGCTCTG GTATGACGAG CGTCGCGGCA CAATGGTGTA GCTCCGCACC GGAGAGCTAC
AAGCGTGATG TCGAGGCATC ATGGGAGAAG GAAGGCCGGA AGAAGCCCAA CTGGGGCCTA
CGCCGTGTGG TGCTGGGCGA TCTCGCGCCT GCCGCCACCT TCATCGCTGC CAACTACAAC
TTGCCCTTCG ATGTGGATGG ATTTGCACGC GCGGCGCGCC AGATCCTCGA CGAGGTCGAG
CAGGAGATCG GCTGGATGTA CGAGACCCTG CATACCGATG GCAAGACCAA GGGGCGCATC
GAATATACAG TCTGGAGCGA AGTGTTCACC TGCTGGGACT GCGCGGGCGA GGTGGTGTTT
CTGGAGCAGG CCCTGGACCC GGAGACCAAG AAGGTAAGGG AGACCTTCCC TTGCCCCCAT
TGCGGTGCGG AGCTGACAAA GAAGCGGCTG GAGCGCCTCT ATGAAACCAA GCTAGACAGG
GCGCTGAACA CCATCATTCG CATCCCCAAG CGCCAGCCCG CTCTCATCGT CTACAGAATT
GGCAAGACAC GTTACGAGAA GAAGCCGAAT CAGACCGATC TTGAAACCCT CTCCAGGGTG
GAATCTCTTG CCTGGCCGCA TGAGGTTCCG ATAGATGCCT TGCCCTATAT GCACATGACC
CATGAACGGG CGCGGATGGA CAATGCAGGC ATCACCCACA TCCACCACTT CTTTCTCCCC
CGCGCCGCCC ACGTCCTGGC GGCGCTGTGG CGCAAGGCCC AGGCATGGCC GGACAAGCGC
AACCGGCACA TGTTGTTGTT TTTCGTGGAG CAGGCGATTT GGGGGATGTC CGTTCTGAAT
CGTTATTTGC CGACGGCGTT TTCGCAAACA AATCGCCAGT TGACGGGGGT ATATTACATT
GCATCCCAAA TTGCCGAAGT CTCCCCGTCC TACAACCTTG AAGGAAAGCT CAAGGGTTTG
GCGAAGGCCT TTCGAGGGCA TCGAACATCA TCCGGAAGCG CCATCGTCAC CACCGTCACC
ACCGCACGTC TCGACCTCCC TGACAACTCT ATTGATTACA TCTTCACCGA CCCGCCCTTC
GGCGAGAACA TCTATTACGC CGACCTAAAT TTCCTGGTGG AGTCCTGGCA CCGGGTGCTG
ACCAACGCCA CCCCCGAAGC CATCGTGGAC AAGGCCAAGA AGAAAGGCCT GCCCGAGTAC
CAGCACCTGA TGCGGCAATG CTTCGCAGAG TATTGCCGAG TACTCAAGCC CGGCCGTTGG
ATGACGGTCG TTTTCCACAA CTCCCGCAAC GCCGTCTGGA ACGCTATTCA GGAAGCGATA
CTGGCCGCGG GCTTCGTGGT GGCCGATGTC CGCACCCTCG ACAAGCAACA GGGTTCCTAC
CGTCAGGTCA CCAGCACGGC GGTCAAACAG GACCTGGTCA TCTCCGCCTA CAAACCCAAC
GGGGGCCTTG AGGAAAGGTT CAAGCTGACC GCCGGCACCG AGGAAGGTGT CTGGGACTTC
GTCCGCACGC ACCTGAGGCA GCTGCCGGTG TTCGTCTCCA AGGACGGCGA GGCGGAAGTC
ATCGCAGAGC GGCAGAACTA CCTGCTATTC GACCGCATGG TGGCTTTCCA TGTCCAGCGG
GGCGTGACGG TGCCTCTCTC CGCGGCGGAG TTCTACGCCG GTCTCGCGCA GCGCTTCTCG
GAGCGCGACG GGATGTACTT CCTGCCCGAG CAGGTGGCGG AGTACGACAA GAAGCGCATG
AAGGTTGGTG AGGTCCTGCC GCTGCAGCTC TTCGTCACCG ACGAGGCCTC CGCCATCCAG
TGGCTCAAGC AGCAGCTCAC CAGGAAGCCG CAGACCTTCC AGGAGCTTCA TCCGCAGTTT
CTCAAGGAGA TTGGAGGCTG GCAGAAGCAC GAGAAGCCGC TCGAGCTGTC CGAGCTGCTG
GAACAGAACT TCCTCCGCTA CGACGGCAAG GGTCCGATTC CGAAGCAGAT CGTTTCCTGG
ATGCGTCAGA GCGCGGAGCT TCGGAAGATC ATCGATGAGC AGCTTGCGTC TGGACAGGCT
CAAGAGGATG ACAGCGGACT CGTGACCCGT GAGCCACGAC TCATCGACCG CGCGAAAGCG
CGCTGGTACG TCCCGGACCC CAACAAGGCC GGCGACCTGG AGAAACTCCG CGAGCGGGCC
CTGCTGCGCG AGTTCGAGGA GTACCGCGAG TCCAAGCAGA AGCGCCTGAA GGTCTTCCGC
CTCGAGGCCG TCCGCGCCGG CTTCAAGAAG GCATGGCAGG AACGGGACTA CGCCACCATC
ATCGCCGTGG CGCGCAAGAT CCCAGAGGAC ATCCTCCAGG AAGACCCCAA GCTCCTCATG
TGGTACGACC AGGCGCTTAC CCGTTCTGGG GAGGAGTGA
 
Protein sequence
MNRQNNLFEN QEAKTRDQEP VECLGKTFDS DEARRQHFLR LLREGLEELH AKLGGVPFTT 
VEDAVERMKS VEKWPMGGET RLRELAERMR HADSSKDLLQ RWKDEVGFPH GEIEDILNLS
DPPYYTACPN PFIADFIKHY GKPYDPNVPY SKEPFAVDVS VGKTDPLYKA HSYHTKVPHL
AIVPSILHYT EPGDVVLDGF CGSGMTSVAA QWCSSAPESY KRDVEASWEK EGRKKPNWGL
RRVVLGDLAP AATFIAANYN LPFDVDGFAR AARQILDEVE QEIGWMYETL HTDGKTKGRI
EYTVWSEVFT CWDCAGEVVF LEQALDPETK KVRETFPCPH CGAELTKKRL ERLYETKLDR
ALNTIIRIPK RQPALIVYRI GKTRYEKKPN QTDLETLSRV ESLAWPHEVP IDALPYMHMT
HERARMDNAG ITHIHHFFLP RAAHVLAALW RKAQAWPDKR NRHMLLFFVE QAIWGMSVLN
RYLPTAFSQT NRQLTGVYYI ASQIAEVSPS YNLEGKLKGL AKAFRGHRTS SGSAIVTTVT
TARLDLPDNS IDYIFTDPPF GENIYYADLN FLVESWHRVL TNATPEAIVD KAKKKGLPEY
QHLMRQCFAE YCRVLKPGRW MTVVFHNSRN AVWNAIQEAI LAAGFVVADV RTLDKQQGSY
RQVTSTAVKQ DLVISAYKPN GGLEERFKLT AGTEEGVWDF VRTHLRQLPV FVSKDGEAEV
IAERQNYLLF DRMVAFHVQR GVTVPLSAAE FYAGLAQRFS ERDGMYFLPE QVAEYDKKRM
KVGEVLPLQL FVTDEASAIQ WLKQQLTRKP QTFQELHPQF LKEIGGWQKH EKPLELSELL
EQNFLRYDGK GPIPKQIVSW MRQSAELRKI IDEQLASGQA QEDDSGLVTR EPRLIDRAKA
RWYVPDPNKA GDLEKLRERA LLREFEEYRE SKQKRLKVFR LEAVRAGFKK AWQERDYATI
IAVARKIPED ILQEDPKLLM WYDQALTRSG EE