Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1461 |
Symbol | |
ID | 4484906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1634835 |
End bp | 1638056 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639730245 |
Product | cell wall binding repeat 2-containing protein |
Protein accession | YP_873219 |
Protein GI | 117928668 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGGC GGTCGGCCGT TGCGGCGCTG CTCGCAGGGG CAGTTCTGGC ACTCCTCACC GCATTTCCCA CCACTGCGTC CGCCGCGCAA CCCGTCGTTA CCGTTGGCGC CGCGCCGACA CCGCCTGCTG ACGCCCAGTT CCTTGGGCCC ATATCCGGTG CGCAAACCGG GACGGTGATC CAGGTAAACG TCGTCTTACG GTCCAGGGAT TCGCTGGGCC TTGACCGCTT CATGGCCGAG GTCACAAATC CGCACTCGCC TGAATACCGG CACTTTCTCA CGCCTGGGCA ATTCCAGCGG CGGTTCGGCC CCACCACGCA GGCAATCAAC GACGTTACCG CGGCGTTCCG AAATCTCGGA TTGCAGCGAA CCTCCGCCCT CGGCTCGGTA CTCGGTTTCT CCGGATCGCT TGCCCAGTTT TCGTCGGCAC TGCACATTGG CTTTGCGCGC TACCGACTCC ATTCCGGGCG AATTGCCCGC ATCAACACCG CCGCTCCGCA ACTGCCGGTG TCCGTCGCCC GGTACATCAG CGGCATCGTC GGGCTCGACG ATCTCGCGCG CGCCGCGCCC GCCATCCCTC GGCACGCCGC GCCGACGCGC GTGCCGGTGT GGTCTGCCGG GACGGCCGCT ACGCCGGGGT CGTCGACCTC CTGCCTGTCG ACGTCACCGA CGAACGCCTA TTACTTGCCG GATCAGCTTG CCAGATATTA CGGCCTCGAC ACACTGTTTA CCGTCGGCGC TCTCGGCGCA GGGGTGACGG TTGCGCTGCC TGAATTCGAG CCTTTTCTGT CGTCCGACAT CGCGGCATTC AACCAATGCT TCGGTCTGAC AGCAACACCT CAGGTGGTCA CGGTGGACGG CGGAGCCGGC TCCGGCCCCG GGAGCGGCGA GGCGGCATTG GACATTGAGA CCGTCGCCGC GCTGGCGCCG AGCGCTACCA TCAAGGTTTA CGAGGGACCG AACGGCGCAG GAATTTTGTC CGTCTACAAC GCCATTGCGA ATGACGCGAC AGTAAACATT GTCTCCGTCA GTTGGCTTCT CTGTGAGGCG GACTTGCCGT CAGGATATGG CGCCAGCCTC CGTGCCATAT TTCAACAGAT GGTTGCCGAG GGCCAGACGC CCCTCGCGGC CAGCGGTGAC TGGGGATCGG CCGGTTGTTA TCCAGACATT ACTCCGTCAT ATCCAAACGG CGGCAACACC GCCCTGTCTG TCGAAACCCC GGCAAGCTTT CCCGAAGTCA CCGGCGTTGG CGGCACCAGT CTCCCGGATC TGAGCGGAAC CTCAGAAGTC GCATGGGCGG GCGCCTGCAC GGACGGCTCG GGCGGCTCGT CTCCCTGCGG CACCGGCGGC GGAATCTCCG CACTGTGGAC CATGCCGACA TGGCAAGTCC CACTGGCAAT CACCGCACAA TCATCCGGCA CTCCCTGCGG CGCTCCGGCC GGTACGTACT GCCGGGAAGT GCCTGACGTC AGCGCGTCCG CGGATCCGTC ACACGGATAT GTCATCTACT GGACGTACGT CGATCCGGTT ACGAGTACGT CGAACCCCGG CTTCTATGTC ATCGGCGGAA CGAGCGCCGC GGCGCCGCTG TGGGCGGCGT TGCTCGCTGA CATTTCATCC AGTTGTCGGC CTCAACAGTG GGGCTCAATA AATTCCACAC TCTATGGCCT CGTTGGTGCG GGAATCACCC TGTTCCGTGA CATCACCAGC GGGAACAACG ACCTCTTTGG CGCAGGGGGA TACGCAGCGG GCACGGGATA TGACATGGTG ACCGGATTGG GAAGTCCCCA AGGCGGCGCG CTCAGCGCGT ATCTCTGCCC GGCGGCGGCG GACGGCGCCG GCTCCATCAC CGCAACGCCA GCGTCCGTGA CGGCGAGCAG CACCCAGGAT TTCTCGTTCA CGTACGCGGC GCCTGCCGGT CAGACCTTGA GAGCCGGAAA GCTCCAACTA ACCGTGCCTG CCGGCTGGCC AAGCCCTTCG ACCACGTCAG GCACACCCGG CTATGTGACC GCTACCGCCG GCACCGTAAC GATAAGCAAT TCGACCATCA CGGTAAGCGG TCTCACGTTA GCGGCAGGCG CCACGGTCAG CCTCACGTAC CACACTGTCA CCGCGCCTGC CACAGCGGGT ACCTACCCGT TCGCCGCGGC GATGGGAAAT TCGGGCGACA CACCGCAGTC GCTGACCAGC CCAGCGACGA TCGTCGTAAC AGCCCCTCCG CCGCCAGGCG GCGGAGGCGG GGGCAGCGGC GGTAGCGGAG GTTCCGGGAG CGGGGGCGGC AGCGGAAGCG GAGGAGGCGG CGGTGGCGGA GGGGGCGGCG CTTTCGGTCA TCCGCAGCCA ATCCGTGTCG CCGGAGCAGA TCGGATAGCG ACCGCGATCG CGGCGTCACA GACCGCCTTT CCCATCGACG CCGGCGCGCA CGTCGTCGTC CTCGCCCGTT CCGACAACTT CCCCGACGCC CTCGCCGGAA CTCCCTTGGC CATCACCAAA GAGGGCCCCA TCCTCCTTAC GCCGACGACA TCGCTCGATC CACGGACGCT CGCTGAAATC GAGCGCGTCC TACCCCGCGG CGCAACCGTA TACCTCCTCG GCGGAACAGC GGCGCTCTCC CCAGCCATTG CAGACGTGCT GCTCAACGAC GGCTACGTCG TCGTCCGACT CGCCGGAGCG GACCGCTACG GAACCGCCGT CGCCATAGCC AATGCGCTCG GTGACCCGAC GACCGTCTTC GAAGCTGACG GCACCAACTT TCCAGACGCC CTGACCGCCG GAACCGCGGC CGCCCACGAA GGCGGGGTCG TGCTCCTGAC CGATGGCAGC CGGCTGCCCG CCGCCACCGC GTCCTACCTT GCGGCGCATC CCGGGACGCA CATCGCGGTC GGTGGTCCGG CAGCTCACGC TGACCCCACC GCCGTCCCGT ACGTCGGGGC GGACCGCTAC GAGACAGCCG CGCTCGTCGA TCGCGCATTT TTCGCCGTAC CGATCGTGAT TGGCTGCGCA AGTGGAGCGA ATTTCCCCGA TGCGTTGTCC GGAGGAGCGG TCGCGGGGTT GGCTGGTGGA CCGATCGTCC TCGTGCCTGC CATAGGCGAT CTGCCCGGAC CAACAGTGCA GTACCTCCAG CAGGCAGCCG CCTCCGCCAG CAAAGCATGG GTTTTCGGTG GGACGGCGGC CGTGTCCGAC GTCGTTTTCA ACGAAATCGC GGCCGCCCTC ACCGGCGGAT GA
|
Protein sequence | MNGRSAVAAL LAGAVLALLT AFPTTASAAQ PVVTVGAAPT PPADAQFLGP ISGAQTGTVI QVNVVLRSRD SLGLDRFMAE VTNPHSPEYR HFLTPGQFQR RFGPTTQAIN DVTAAFRNLG LQRTSALGSV LGFSGSLAQF SSALHIGFAR YRLHSGRIAR INTAAPQLPV SVARYISGIV GLDDLARAAP AIPRHAAPTR VPVWSAGTAA TPGSSTSCLS TSPTNAYYLP DQLARYYGLD TLFTVGALGA GVTVALPEFE PFLSSDIAAF NQCFGLTATP QVVTVDGGAG SGPGSGEAAL DIETVAALAP SATIKVYEGP NGAGILSVYN AIANDATVNI VSVSWLLCEA DLPSGYGASL RAIFQQMVAE GQTPLAASGD WGSAGCYPDI TPSYPNGGNT ALSVETPASF PEVTGVGGTS LPDLSGTSEV AWAGACTDGS GGSSPCGTGG GISALWTMPT WQVPLAITAQ SSGTPCGAPA GTYCREVPDV SASADPSHGY VIYWTYVDPV TSTSNPGFYV IGGTSAAAPL WAALLADISS SCRPQQWGSI NSTLYGLVGA GITLFRDITS GNNDLFGAGG YAAGTGYDMV TGLGSPQGGA LSAYLCPAAA DGAGSITATP ASVTASSTQD FSFTYAAPAG QTLRAGKLQL TVPAGWPSPS TTSGTPGYVT ATAGTVTISN STITVSGLTL AAGATVSLTY HTVTAPATAG TYPFAAAMGN SGDTPQSLTS PATIVVTAPP PPGGGGGGSG GSGGSGSGGG SGSGGGGGGG GGGAFGHPQP IRVAGADRIA TAIAASQTAF PIDAGAHVVV LARSDNFPDA LAGTPLAITK EGPILLTPTT SLDPRTLAEI ERVLPRGATV YLLGGTAALS PAIADVLLND GYVVVRLAGA DRYGTAVAIA NALGDPTTVF EADGTNFPDA LTAGTAAAHE GGVVLLTDGS RLPAATASYL AAHPGTHIAV GGPAAHADPT AVPYVGADRY ETAALVDRAF FAVPIVIGCA SGANFPDALS GGAVAGLAGG PIVLVPAIGD LPGPTVQYLQ QAAASASKAW VFGGTAAVSD VVFNEIAAAL TGG
|
| |