Gene Acel_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2010 
Symbol 
ID4484948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2284627 
End bp2285841 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content70% 
IMG OID639730804 
Producthypothetical protein 
Protein accessionYP_873768 
Protein GI117929217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.785171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGCC ATCTTTCCTG GATGGGTTGT GCGGCCGCCG CTCTTGTCTT GGTGACGGCG 
TGCGCCGCGC CGAATGGACA ATCCCCACGC GCCGCCGGAT CGGCGCCTGG TCTAGCGGCG
AGCAAGGCAC GTAACGGCTC CGCGTCCCCT GCCGGCGGGG CGAGTCGGGG CGGCGCCGTG
GGCGCCGCCG CGGTGCGGCA GCTCCGCCTG CGAGGACAGA TCGGCGTCCG ACTTGCCGGC
GAAACCTTGG CCGTCACCCA TGACGGTGGC CGGACCTGGG CGCCGGTATC GCTGCCGGCA
GGGCTGGCTC CGGCGAACAT CGCGGCCGTT GATTCAGCTC CCGACGGTGC TCCGCTGCTC
GCCGCGGTGG ACCCCACGGG GTCGGTGCAC GTCTACCGCT ACGCCGCCGA GTGGAGCGCG
GTACGGCTCG ATCCACGGTG GCCGGCCGGC ACCTCGACGG CGGACGGCAC CGAACAGGTT
GCATTCCACC CGGCCGCCGG CGGGATCGTC GCCGTCGTCG TCTCCCTCGC TGGCGGCGGA
TTCAGCGCCG AGCATGCGCT TTTCGTCTCC ACCGACAACG GCGGGCATTT CGGACCGCCG
GCGACGCCGA ACGGTCCGGA TGTCAACGTG CGCTGGTCCG GTCTGCTCAT GGTGACGCCG
CGGATCGGCG TCCTCGTCGC GGGTCCCACG CAGGAATTGC TCCTGCGCAC CACGGACGGC
GGTGCGTCGT GGCAACGACT GTCTGTCCCC GGCGTCGGCG GCCCCGGCAC GTTCGCCCTC
GGGGCTCCGG TGCTCAGCGG GTCACGTATC GACATTCCGG TGACCGTGCA GGCGGCGGCC
GACGGAAGCC GGGAGAAATT CTTCCTGCTC GCTAGTGACG ATCAGGGTGC GACCTTCACC
GTCCGGGGCG TCGCTCTGGA CATCCCAGCG GATTTCGCCC CGACCGGCGC CGTAACGGGA
AACACCGGGT CCACCTGGTG GGTGGTTGCG CCGTCCATCG GCACGGTGTA TGAGACGACC
GACGACGGCG CCAGTTGGCG GACGGTTCAC GAAACCGGAC TCGCGCTGAA CACCGTCGCG
GTCACCCTCA CCGGCCCAGC AGCCGCAACC GCGGTAATCG CGGTGAATTC ATGCGCCAAT
GACAAGAGTG AGTGCACGAT GACCGTCACG GTCGAACAGA CCACGGACGG CGGTGCGACC
TGGTCCCCCG CGTGA
 
Protein sequence
MRGHLSWMGC AAAALVLVTA CAAPNGQSPR AAGSAPGLAA SKARNGSASP AGGASRGGAV 
GAAAVRQLRL RGQIGVRLAG ETLAVTHDGG RTWAPVSLPA GLAPANIAAV DSAPDGAPLL
AAVDPTGSVH VYRYAAEWSA VRLDPRWPAG TSTADGTEQV AFHPAAGGIV AVVVSLAGGG
FSAEHALFVS TDNGGHFGPP ATPNGPDVNV RWSGLLMVTP RIGVLVAGPT QELLLRTTDG
GASWQRLSVP GVGGPGTFAL GAPVLSGSRI DIPVTVQAAA DGSREKFFLL ASDDQGATFT
VRGVALDIPA DFAPTGAVTG NTGSTWWVVA PSIGTVYETT DDGASWRTVH ETGLALNTVA
VTLTGPAAAT AVIAVNSCAN DKSECTMTVT VEQTTDGGAT WSPA