Gene Acel_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1274 
Symbol 
ID4486345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1420843 
End bp1421943 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID639730054 
Productdiaminohydroxyphosphoribosylaminopyrimidine deaminase / 5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_873032 
Protein GI117928481 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.308593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.319876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGTG TGGAAGCGAC CGCACACGTC GCTGCGCACG AACTCGCCGC GATGCGACGG 
GCGCTTGCAC TGGCCGAGCG GGGCCGCGGG TCGACGAGCC CGAATCCGAT CGTCGGTTGC
GTGGTGCTGG ACGCCGCGGG TCGAGTCGTC GGGGAAGGTT TTCACCTTCG GGCGGGTGGG
CCGCACGCCG AGGTCGTCGC GCTGGCCGCC GCCGGTCCGG CGGCCCGTGG CGGCACCCTC
GTGGTCACCC TCGAACCGTG CCGGCACGTC GGCCGGACCG GTCCGTGCGT TGCCGAAATT
CGGCGCGCCG GGATTCGGCG TGTGGTGTAC GCCGTCGCCG ACCCGACAGC CGCCGGCGGT
GGCGGGGCGG AGCTCGCCGC TGCCGGCCTG GATGTCGTCG GCGGCGTGTT GGCTGCCGAG
GCCGCTGCAG CCAACCGCGC CTGGCTGCAC CGGGTTGCGA CCGGCCGGCC CTTCGTTACC
TGGAAGTACG CCGCGACCCT CGACGGCCGG GTCGCCGCGG CAGACGGCTC CAGCCGGTGG
ATCACCTCTG ACGAAGCTCG GCGCGACGTC CACCTGCTGC GCGCTCAGTC GGACGCGATC
GTCATCGGCA CCGGAACAGC GCTTGCCGAC GATCCCGCCC TCACCGTGCG GGTGGACGAC
GCTGCGCCGG ACCTGACCCA GCCGCTCCGG GTCGTGGTCG GCCGTCGCGA TCTCCCACCG
GGCGCGCGAC TACGCGACGA TACGGCGCCT ACGGTGCAGC TGCGCAAGCA CGATCCAGCG
GCTGTCCTCG CCCGGCTTGC GGACCGGGGC GTGCTGAGCG TGCTGCTCGA AGGCGGCCCA
ACACTCGCCG CCGCGTTCCT CCGAGCGCGC CTTGTGGACC GGATCGTCGC CTACGTCGCG
CCGATCCTCC TCGGCTCCGG CCCGCCGCTC GTTGCTGATC TCGGCATTGC CACTCTTGCC
GCCGGCCAGC GGTGGCGGAT CGACGAGGTC ACCCGTATCG GACCCGACCT GCGGCTCACC
CTGGCGCCGG TCTCGGCCGA CGCGACCGCG GCGGCGGCGC CGGGCGCGAC CGTGACCCCA
GCGGCTGTCG GCGTGGCCTA G
 
Protein sequence
MQGVEATAHV AAHELAAMRR ALALAERGRG STSPNPIVGC VVLDAAGRVV GEGFHLRAGG 
PHAEVVALAA AGPAARGGTL VVTLEPCRHV GRTGPCVAEI RRAGIRRVVY AVADPTAAGG
GGAELAAAGL DVVGGVLAAE AAAANRAWLH RVATGRPFVT WKYAATLDGR VAAADGSSRW
ITSDEARRDV HLLRAQSDAI VIGTGTALAD DPALTVRVDD AAPDLTQPLR VVVGRRDLPP
GARLRDDTAP TVQLRKHDPA AVLARLADRG VLSVLLEGGP TLAAAFLRAR LVDRIVAYVA
PILLGSGPPL VADLGIATLA AGQRWRIDEV TRIGPDLRLT LAPVSADATA AAAPGATVTP
AAVGVA