Gene Acel_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1742 
Symbol 
ID4484863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1967115 
End bp1968422 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID639730532 
Productbifunctional uroporphyrinogen-III synthetase/response regulator domain protein 
Protein accessionYP_873500 
Protein GI117928949 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.849504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATC AGGTCGGCGG TGTGCCCCCG CTGACCGGTT TCACCGTTGC AATTACCGCT 
GAGCGTCGCC GGGAGGAGCT TGCCGAGCTC CTGCGGAGGA GGGGCGCCCG GGTAAGAGTG
GTGAGCACGT TACGCACCGT CTTCGCGGAT GACGCCGAAA TTGCCGCGGC CACCGAGGAA
GCGCTCACTG GGCCTGTGGA CATTGTTATC GTGACGACAG CTGTCGGTTT TCGCCGTTGG
CTGGAGAGCG CGGACGGCCT TGGCTACGGC GACAGACTGC GGATGGCTCT GTGGAGCGCT
GACCTTGTCG CCCGCGGCGC GAAGGCACGC GGAGCTGTCC GCGCCGCCGG ATTGCCCGAA
ACGTCCCCCC CGGTCGAGAC GATGTCGGAG ATCGTGGAGT TTCTCCGGGC GCGTGGCGTC
TGCGGCAAAC GCGTCGTGGT GCAGGCGCAC GGTGATCCGG ACGACCGAAC AGCGTCGGCT
CTTTGCCGCG ATGGTGCCGA GGTGGTCGTC GTCCCGGTGT ATCACTGGTC GTCGGCTGAT
GCCGCACGTG CGGAGCGGCT TGCCGGCGAT ATCGTGACCG GCCGGCTCGA TGCGGTGGTT
TTCACCAGCC GACCCGCATG TGACGGGCTT CTGCGGGCGT CCGGCGACGC GCGCGAGCAC
GTGATTGACG CCTTTCGCGC GGGTGTTGTT GCCGCGTGCA TCGGCCCGGT ATGTGCGCAG
CCGCTTGTCG ACGCCGGCGT TGACGTCGTC ATGCCGCCGC GGGGCCGGCT CGGCGACCTG
GTGCGCGCCG TGACAGAAGA GGTGCCGAGG AGGCGGACGC TGTCCACGGC AATCGGCAAC
CAGCGCCTCG AGGTACGGGG TCACGGCGTC GTTTACGGGG AGGCCTTTAC CGTGCTACCG
CCCGCGCCGT TGGCGGTCTT GCGGGCCTTG GTCGCGAATG CCGGTCGTGT GGTCACGAAA
TCGGCGCTGG CCGCGGCGAT CGGTACGGCG GTAGGTCGCC AGCAACCGAG TCAGGCACTC
AAACGGCTGG AGCATCGGCG GGTAGAAATG GCAGTCGCTC GACTACGACG TGCAGTGCCG
CACATACCCG TTACCGCTGT TATCAAACGC GGGTATCGTC TTGCCCAGCC TGGGAGCGAA
GGTGTTGTTT TCGTCACCGG CAAGGGCGAC GTGGACCGGA GTCATGCGCC CATGCCGGAT
GAGGAAGCCA CGCCCGGGCG GCATCCGGCC GTGGCTTCTC GGCATGCGAA TCCCGAAGAG
GTCACCGTCG AGCGGACTAC GTGGGTCGAG GAGCACTCCG CTCCGTGA
 
Protein sequence
MINQVGGVPP LTGFTVAITA ERRREELAEL LRRRGARVRV VSTLRTVFAD DAEIAAATEE 
ALTGPVDIVI VTTAVGFRRW LESADGLGYG DRLRMALWSA DLVARGAKAR GAVRAAGLPE
TSPPVETMSE IVEFLRARGV CGKRVVVQAH GDPDDRTASA LCRDGAEVVV VPVYHWSSAD
AARAERLAGD IVTGRLDAVV FTSRPACDGL LRASGDAREH VIDAFRAGVV AACIGPVCAQ
PLVDAGVDVV MPPRGRLGDL VRAVTEEVPR RRTLSTAIGN QRLEVRGHGV VYGEAFTVLP
PAPLAVLRAL VANAGRVVTK SALAAAIGTA VGRQQPSQAL KRLEHRRVEM AVARLRRAVP
HIPVTAVIKR GYRLAQPGSE GVVFVTGKGD VDRSHAPMPD EEATPGRHPA VASRHANPEE
VTVERTTWVE EHSAP