Gene Acel_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1461 
Symbol 
ID4484906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1634835 
End bp1638056 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content66% 
IMG OID639730245 
Productcell wall binding repeat 2-containing protein 
Protein accessionYP_873219 
Protein GI117928668 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGC GGTCGGCCGT TGCGGCGCTG CTCGCAGGGG CAGTTCTGGC ACTCCTCACC 
GCATTTCCCA CCACTGCGTC CGCCGCGCAA CCCGTCGTTA CCGTTGGCGC CGCGCCGACA
CCGCCTGCTG ACGCCCAGTT CCTTGGGCCC ATATCCGGTG CGCAAACCGG GACGGTGATC
CAGGTAAACG TCGTCTTACG GTCCAGGGAT TCGCTGGGCC TTGACCGCTT CATGGCCGAG
GTCACAAATC CGCACTCGCC TGAATACCGG CACTTTCTCA CGCCTGGGCA ATTCCAGCGG
CGGTTCGGCC CCACCACGCA GGCAATCAAC GACGTTACCG CGGCGTTCCG AAATCTCGGA
TTGCAGCGAA CCTCCGCCCT CGGCTCGGTA CTCGGTTTCT CCGGATCGCT TGCCCAGTTT
TCGTCGGCAC TGCACATTGG CTTTGCGCGC TACCGACTCC ATTCCGGGCG AATTGCCCGC
ATCAACACCG CCGCTCCGCA ACTGCCGGTG TCCGTCGCCC GGTACATCAG CGGCATCGTC
GGGCTCGACG ATCTCGCGCG CGCCGCGCCC GCCATCCCTC GGCACGCCGC GCCGACGCGC
GTGCCGGTGT GGTCTGCCGG GACGGCCGCT ACGCCGGGGT CGTCGACCTC CTGCCTGTCG
ACGTCACCGA CGAACGCCTA TTACTTGCCG GATCAGCTTG CCAGATATTA CGGCCTCGAC
ACACTGTTTA CCGTCGGCGC TCTCGGCGCA GGGGTGACGG TTGCGCTGCC TGAATTCGAG
CCTTTTCTGT CGTCCGACAT CGCGGCATTC AACCAATGCT TCGGTCTGAC AGCAACACCT
CAGGTGGTCA CGGTGGACGG CGGAGCCGGC TCCGGCCCCG GGAGCGGCGA GGCGGCATTG
GACATTGAGA CCGTCGCCGC GCTGGCGCCG AGCGCTACCA TCAAGGTTTA CGAGGGACCG
AACGGCGCAG GAATTTTGTC CGTCTACAAC GCCATTGCGA ATGACGCGAC AGTAAACATT
GTCTCCGTCA GTTGGCTTCT CTGTGAGGCG GACTTGCCGT CAGGATATGG CGCCAGCCTC
CGTGCCATAT TTCAACAGAT GGTTGCCGAG GGCCAGACGC CCCTCGCGGC CAGCGGTGAC
TGGGGATCGG CCGGTTGTTA TCCAGACATT ACTCCGTCAT ATCCAAACGG CGGCAACACC
GCCCTGTCTG TCGAAACCCC GGCAAGCTTT CCCGAAGTCA CCGGCGTTGG CGGCACCAGT
CTCCCGGATC TGAGCGGAAC CTCAGAAGTC GCATGGGCGG GCGCCTGCAC GGACGGCTCG
GGCGGCTCGT CTCCCTGCGG CACCGGCGGC GGAATCTCCG CACTGTGGAC CATGCCGACA
TGGCAAGTCC CACTGGCAAT CACCGCACAA TCATCCGGCA CTCCCTGCGG CGCTCCGGCC
GGTACGTACT GCCGGGAAGT GCCTGACGTC AGCGCGTCCG CGGATCCGTC ACACGGATAT
GTCATCTACT GGACGTACGT CGATCCGGTT ACGAGTACGT CGAACCCCGG CTTCTATGTC
ATCGGCGGAA CGAGCGCCGC GGCGCCGCTG TGGGCGGCGT TGCTCGCTGA CATTTCATCC
AGTTGTCGGC CTCAACAGTG GGGCTCAATA AATTCCACAC TCTATGGCCT CGTTGGTGCG
GGAATCACCC TGTTCCGTGA CATCACCAGC GGGAACAACG ACCTCTTTGG CGCAGGGGGA
TACGCAGCGG GCACGGGATA TGACATGGTG ACCGGATTGG GAAGTCCCCA AGGCGGCGCG
CTCAGCGCGT ATCTCTGCCC GGCGGCGGCG GACGGCGCCG GCTCCATCAC CGCAACGCCA
GCGTCCGTGA CGGCGAGCAG CACCCAGGAT TTCTCGTTCA CGTACGCGGC GCCTGCCGGT
CAGACCTTGA GAGCCGGAAA GCTCCAACTA ACCGTGCCTG CCGGCTGGCC AAGCCCTTCG
ACCACGTCAG GCACACCCGG CTATGTGACC GCTACCGCCG GCACCGTAAC GATAAGCAAT
TCGACCATCA CGGTAAGCGG TCTCACGTTA GCGGCAGGCG CCACGGTCAG CCTCACGTAC
CACACTGTCA CCGCGCCTGC CACAGCGGGT ACCTACCCGT TCGCCGCGGC GATGGGAAAT
TCGGGCGACA CACCGCAGTC GCTGACCAGC CCAGCGACGA TCGTCGTAAC AGCCCCTCCG
CCGCCAGGCG GCGGAGGCGG GGGCAGCGGC GGTAGCGGAG GTTCCGGGAG CGGGGGCGGC
AGCGGAAGCG GAGGAGGCGG CGGTGGCGGA GGGGGCGGCG CTTTCGGTCA TCCGCAGCCA
ATCCGTGTCG CCGGAGCAGA TCGGATAGCG ACCGCGATCG CGGCGTCACA GACCGCCTTT
CCCATCGACG CCGGCGCGCA CGTCGTCGTC CTCGCCCGTT CCGACAACTT CCCCGACGCC
CTCGCCGGAA CTCCCTTGGC CATCACCAAA GAGGGCCCCA TCCTCCTTAC GCCGACGACA
TCGCTCGATC CACGGACGCT CGCTGAAATC GAGCGCGTCC TACCCCGCGG CGCAACCGTA
TACCTCCTCG GCGGAACAGC GGCGCTCTCC CCAGCCATTG CAGACGTGCT GCTCAACGAC
GGCTACGTCG TCGTCCGACT CGCCGGAGCG GACCGCTACG GAACCGCCGT CGCCATAGCC
AATGCGCTCG GTGACCCGAC GACCGTCTTC GAAGCTGACG GCACCAACTT TCCAGACGCC
CTGACCGCCG GAACCGCGGC CGCCCACGAA GGCGGGGTCG TGCTCCTGAC CGATGGCAGC
CGGCTGCCCG CCGCCACCGC GTCCTACCTT GCGGCGCATC CCGGGACGCA CATCGCGGTC
GGTGGTCCGG CAGCTCACGC TGACCCCACC GCCGTCCCGT ACGTCGGGGC GGACCGCTAC
GAGACAGCCG CGCTCGTCGA TCGCGCATTT TTCGCCGTAC CGATCGTGAT TGGCTGCGCA
AGTGGAGCGA ATTTCCCCGA TGCGTTGTCC GGAGGAGCGG TCGCGGGGTT GGCTGGTGGA
CCGATCGTCC TCGTGCCTGC CATAGGCGAT CTGCCCGGAC CAACAGTGCA GTACCTCCAG
CAGGCAGCCG CCTCCGCCAG CAAAGCATGG GTTTTCGGTG GGACGGCGGC CGTGTCCGAC
GTCGTTTTCA ACGAAATCGC GGCCGCCCTC ACCGGCGGAT GA
 
Protein sequence
MNGRSAVAAL LAGAVLALLT AFPTTASAAQ PVVTVGAAPT PPADAQFLGP ISGAQTGTVI 
QVNVVLRSRD SLGLDRFMAE VTNPHSPEYR HFLTPGQFQR RFGPTTQAIN DVTAAFRNLG
LQRTSALGSV LGFSGSLAQF SSALHIGFAR YRLHSGRIAR INTAAPQLPV SVARYISGIV
GLDDLARAAP AIPRHAAPTR VPVWSAGTAA TPGSSTSCLS TSPTNAYYLP DQLARYYGLD
TLFTVGALGA GVTVALPEFE PFLSSDIAAF NQCFGLTATP QVVTVDGGAG SGPGSGEAAL
DIETVAALAP SATIKVYEGP NGAGILSVYN AIANDATVNI VSVSWLLCEA DLPSGYGASL
RAIFQQMVAE GQTPLAASGD WGSAGCYPDI TPSYPNGGNT ALSVETPASF PEVTGVGGTS
LPDLSGTSEV AWAGACTDGS GGSSPCGTGG GISALWTMPT WQVPLAITAQ SSGTPCGAPA
GTYCREVPDV SASADPSHGY VIYWTYVDPV TSTSNPGFYV IGGTSAAAPL WAALLADISS
SCRPQQWGSI NSTLYGLVGA GITLFRDITS GNNDLFGAGG YAAGTGYDMV TGLGSPQGGA
LSAYLCPAAA DGAGSITATP ASVTASSTQD FSFTYAAPAG QTLRAGKLQL TVPAGWPSPS
TTSGTPGYVT ATAGTVTISN STITVSGLTL AAGATVSLTY HTVTAPATAG TYPFAAAMGN
SGDTPQSLTS PATIVVTAPP PPGGGGGGSG GSGGSGSGGG SGSGGGGGGG GGGAFGHPQP
IRVAGADRIA TAIAASQTAF PIDAGAHVVV LARSDNFPDA LAGTPLAITK EGPILLTPTT
SLDPRTLAEI ERVLPRGATV YLLGGTAALS PAIADVLLND GYVVVRLAGA DRYGTAVAIA
NALGDPTTVF EADGTNFPDA LTAGTAAAHE GGVVLLTDGS RLPAATASYL AAHPGTHIAV
GGPAAHADPT AVPYVGADRY ETAALVDRAF FAVPIVIGCA SGANFPDALS GGAVAGLAGG
PIVLVPAIGD LPGPTVQYLQ QAAASASKAW VFGGTAAVSD VVFNEIAAAL TGG