Gene Acel_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2111 
Symbol 
ID4484964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2387266 
End bp2388903 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content67% 
IMG OID639730912 
Producthypothetical protein 
Protein accessionYP_873869 
Protein GI117929318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.233844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG ATCTCCCCTA CGGACAACCT GACGGCTCGC CGTCGGCTGG CGGCGGCTCG 
GGCGGAGAGC ACTGGGCACC TGGACCGTCG CCGGCGGACG CCGTATCGCC CGGTCTCACC
CCGGCCGTCC CGACGCCGTC GGGTCCAGGG GCGTGGTCCG CCGACGTCCT TCCGCCGGGG
CCCGAGGTAC CGGCGGGCAA GGGGAAGCGT CGATGGCGTT GGACGGCCGC GGTCGTTGGG
GTGGCCGTTG CGTTCGTCGC TGCGGGGACG GCAGTTGCCG CGTATCGTCT CACGGCGCAT
CAGGGTGGTG ACCGCATCGA TTCGCTCGCG CCGGCGACGA CGGTCTTCTA CGTCAAAGTG
AATGTGAGCC CGGGAGGTTC AGCCGGGACC GACGTCGGCG CGTTTGCCCG CCATGTTCCG
GCTCTTTCGG GGGTAACGGA TGCTCGGACG TTGCGTGATT GGGTCGTGCG CCGTGCTCTC
GGCGGATATG CCGCCGAGTA CGACGCCATG GTGAAACCGT GGCTCGGCGA TGAGGTTGCC
ATCGGTGTCT TTCCCTCCGG GCAGGAACGT CACAGTTTTG CCTTGCTCCA CGTGACGGAT
GCGGCGAAGG CCGAGCAGGC GCTCGGCGGT CTCGCCGCAC TGGTCAAGAG CCGGAACAAT
CCCGGCAACG ACCTGGCGTA TCGGATCACC GACGGTTTCG CGGTCGTCAC CGATTCACCG
GCGGCACTCA ATGACCTACT CGCGGACGTT CCGCACCAAT CCCTGGCCAA CCAGCGGACG
TACACCACCG ACGTCGCATC GCTGGGATCC GGTCACCTGG TAGTCGGTTG GGCCGATCTG
GCGGCCGCTT CGCGGCTCGC GATGGACGAA GCCAAGGCAC TGGGCGCGGC GGGCAGTCTG
CTGGGCAGTG TGGACACCGC AGCTGCACAG GGACGGCTCG TTTTTGCCGG AACAGTGCAT
GCCACGTCTC TCGATCTAGA CGCCCGCGTG CTCGGGGCAA CTGCGACCAC GGCGCCCCTG
GCGGATTTGA GCGGTATGTT GGGCCGCCTG CCGGCCGACA CCCAACTTGG TGTTGCGCTC
GCCGGACCGG ATCAAGTTCT CAAGAGCCTG TTCGGCCGCC TGTCCAGTGA TCCGCTTACC
TCCGCCTTCC TCGGAAATCA GCTCGGCAGC CTGCAGAGCC AGACCGATCT TCATCTTCCG
GACGACATTT ACGGTTACGT CGGCAGCGCG TTGGCGATCG GCATTTCCAG GCCGGCCGGT
GCGGCGGACA AGCCGCAGGA CGCCGACATC ACCGTCCTGA CCGAGCCGAC GGATTCTGCT
GCAGCGACTC GGGTCGCGGC CGCACTGCGT GGATGGGCGG CGGGCGGCCG TGCTCTTACC
GTCGCGCCTG GGCACCCCTT CGTCATCAGC ACGAAGCCAC AGCCGTTGGC AGGTCCGGCG
CTTCAGAATG ATCCGTTGTA CCGGGCGGCT ATCGACGGCA TGCCGTCGCG CGTGATCGCC
GCCGGATATC TCGCCGTCCC TCGGGAGCCT GGAACGAACA GCTCCACGGA CGCCGGGTCG
GGCGGTGGTC TCGGCTTCTA CGTGGTCCCG GGCGGGGATT CGAGCGTGGT CGCCCATCTC
CGTCTGGTAT TTCCCTGA
 
Protein sequence
MTDDLPYGQP DGSPSAGGGS GGEHWAPGPS PADAVSPGLT PAVPTPSGPG AWSADVLPPG 
PEVPAGKGKR RWRWTAAVVG VAVAFVAAGT AVAAYRLTAH QGGDRIDSLA PATTVFYVKV
NVSPGGSAGT DVGAFARHVP ALSGVTDART LRDWVVRRAL GGYAAEYDAM VKPWLGDEVA
IGVFPSGQER HSFALLHVTD AAKAEQALGG LAALVKSRNN PGNDLAYRIT DGFAVVTDSP
AALNDLLADV PHQSLANQRT YTTDVASLGS GHLVVGWADL AAASRLAMDE AKALGAAGSL
LGSVDTAAAQ GRLVFAGTVH ATSLDLDARV LGATATTAPL ADLSGMLGRL PADTQLGVAL
AGPDQVLKSL FGRLSSDPLT SAFLGNQLGS LQSQTDLHLP DDIYGYVGSA LAIGISRPAG
AADKPQDADI TVLTEPTDSA AATRVAAALR GWAAGGRALT VAPGHPFVIS TKPQPLAGPA
LQNDPLYRAA IDGMPSRVIA AGYLAVPREP GTNSSTDAGS GGGLGFYVVP GGDSSVVAHL
RLVFP