Gene Acel_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0237 
Symbol 
ID4485375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp257040 
End bp258035 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content72% 
IMG OID639729000 
Producthydroxymethylbilane synthase 
Protein accessionYP_871997 
Protein GI117927446 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAG CCGCCGCCCT CTCCGTGCTG CGGCTGGGCA CCCGGCGCAG CACGCTCGCC 
CGCGCGCAGA CCGAGGAGAT CGCCGGCGCC CTCCGTGCCG CCGGCTGCCG GGTGGAAATC
GTCGGTATCC AGAGCACGGG CGATCGGCAC GCCGACGTCC CCCTGCACGA ATTCGCCGGC
TCCGGCGTTT TCGTCGCCGA GCTCCGGGCC GCACTGCTCC GCGGCGAGGT GGACGTCGTC
GTCCATTCGA TGAAGGATTT GCCGACGGCG GAAATACCCG AGCTGGCCAT TGCGGCCATC
CCGCGCCGCG CGGATCCGCG CGATGCGCTC GTCACCGGCG CGGGATGCCG GCTGGCGGAA
CTGCCGACGG GTGCGATCGT CGGCACCGGA TCGCCACGGC GCGCCGCCCA ACTGCGGCTG
CTCCGGCCGG ATCTGGAAAT TCGCCCGATC CGCGGTAACC TCGATACCCG GCTCGGCAAA
CTCCACGCAG GCGGGTACGC CGCGTTGATT GTGGCGGCGG CCGGACTTGC CCGATTGCAC
CGGTCGGAAG AAGCCGCCGA ATTCTTCGAC CCGACGGTGA TGCTGCCGGC ACCCGGTCAA
GGCGCGCTCG CCGTCGAGTG CCGCCGGGCG GACATCGCGG ACGGCGGCCG GCTCGCCGGG
ATTCTCGCCG GCCTGGACGA TCCGGCGACC CGGGCGGCGG TCACCGCAGA GCGTGCGCTG
CTGGCCGCCG TGGGCGCGGG GTGCTCGGCG CCGGTGGGTG CGCTGGGCGT GGTCACCGCG
GACACCCTGC AGCTGGACGC CGTCGTCGTC GACCCGTCCG GCACGACCGC ATTCCGCCGG
TCGTTGACCG GGACGCCGGA CGACGCAAGC GACCTCGGGC GGCGGCTCGC CGCCGATCTG
ATCCGCGCGG GGGCGGATCA GCTGCTCCAG GCTCCGAAAC AAACGGGGGA ACCGCATGAC
CCCGACAGGC ACGACAAAGG AACAGGACGA CCATGA
 
Protein sequence
MTTAAALSVL RLGTRRSTLA RAQTEEIAGA LRAAGCRVEI VGIQSTGDRH ADVPLHEFAG 
SGVFVAELRA ALLRGEVDVV VHSMKDLPTA EIPELAIAAI PRRADPRDAL VTGAGCRLAE
LPTGAIVGTG SPRRAAQLRL LRPDLEIRPI RGNLDTRLGK LHAGGYAALI VAAAGLARLH
RSEEAAEFFD PTVMLPAPGQ GALAVECRRA DIADGGRLAG ILAGLDDPAT RAAVTAERAL
LAAVGAGCSA PVGALGVVTA DTLQLDAVVV DPSGTTAFRR SLTGTPDDAS DLGRRLAADL
IRAGADQLLQ APKQTGEPHD PDRHDKGTGR P