Gene Acel_1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1138 
Symbol 
ID4484626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1266142 
End bp1267377 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content69% 
IMG OID639729913 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_872896 
Protein GI117928345 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.730169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0524008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTCG ACGTCGACGT CGTGCGGAAG GATTTTCCGA TCCTTGAGCG CACGGTGCGG 
GATGGACGGC CGCTGGTCTA CCTCGACAGT GCCAACACGT CGCAGAAGCC GCGCGCCGTC
CTCGACACGC TCACCGCCTT CTATGAACGG CACAACGCGA ACATCCACCG CGCGACCCAC
GCCCTGGGCG AAGAGGCGAC CGAGGCGTAC GAGACCGCGC GGATGAAGGT CGCGGACTTC
ATCGGTGCCG GTGCAGCGGA AGAGGTCGTC TTCGTCAAGA ACTCCTCGGA GGCGCTCAAC
CTGGTAGCCA ATGTGCTGAG CTGGGGGCCG CGGGCGGTCG GTCCCGGTGA CGAGATCGTC
ATCACCGAGA TGGAGCACCA CTCGAACATC GTGCCCTGGC AGATCCTCTG TGAGCGGACC
GGCGCCCGGC TCCGCTGGTT CGGCGTCACC GATGACGGCC GCCTCGACCT GGACGGCATG
GACGACCTGC TCACCGAGCG CACCCGGGTC CTCGCCGTCG TCCACGTTTC GAACGTCCTG
GGCACGGTGA ACCCGATCCC GCTCCTTGCC GAACGCGCCC ACCAGGTAGG GGCGCTCGTG
GTCGTCGATG CGTCGCAGTC CGTCCCGCAC ATGCCGGTGG ATGTGGCCGC GTTGGGCGCG
GACTTCCTGG CTTTCACCGG GCACAAGATG TGCGGACCGA CCGGCATCGG CGTGCTCTGG
GGACGCCGCG ACCTGCTCGA GGAGCTGCCG CCGTTCCTCG GCGGCGGCGA AATGATCGAG
ACGGTCACCA TGGAGAAGTC CACGTACGCC GCCGTGCCGC ACAAGTACGA GGCCGGCACA
CCGCCGATCG CGCAAGCGGT GGGACTCGGC GCCGCGGTCG ATTACCTGCG CAGCATCGGC
ATGGACCAGA TCGCGGCGCA CGAGCGGGAG CTCACCGCGT ACGCCCTCGG GCGGCTCACG
GAACTTCCCG GCGTGCGCAT CCTCGGCCCG ACGGAAGCGG TCGACCGGGG GAGCGCGATC
TCGTTCGTGG TGGACGGCGT CCACCCCCAC GACGTCGCCC AAGTGCTGGA TGCGCACGGC
GTGGCAGTCC GCGCCGGCCA CCACTGCGGC CGGCCGATCC ACCTGCGCTT CGGGGTTGCC
GCGTCGACCC GCGCATCCTC GTACCTGTAC ACCACCGAGG GGGAGATCGA CGCCCTCGTC
ACGGGATTGC ACGCGGTGCG GAGGTTCTTC GCCTGA
 
Protein sequence
MAFDVDVVRK DFPILERTVR DGRPLVYLDS ANTSQKPRAV LDTLTAFYER HNANIHRATH 
ALGEEATEAY ETARMKVADF IGAGAAEEVV FVKNSSEALN LVANVLSWGP RAVGPGDEIV
ITEMEHHSNI VPWQILCERT GARLRWFGVT DDGRLDLDGM DDLLTERTRV LAVVHVSNVL
GTVNPIPLLA ERAHQVGALV VVDASQSVPH MPVDVAALGA DFLAFTGHKM CGPTGIGVLW
GRRDLLEELP PFLGGGEMIE TVTMEKSTYA AVPHKYEAGT PPIAQAVGLG AAVDYLRSIG
MDQIAAHERE LTAYALGRLT ELPGVRILGP TEAVDRGSAI SFVVDGVHPH DVAQVLDAHG
VAVRAGHHCG RPIHLRFGVA ASTRASSYLY TTEGEIDALV TGLHAVRRFF A