Gene HS_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1050 
SymbolribD 
ID4240548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1157428 
End bp1158558 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content40% 
IMG OID638104611 
Productdiaminohydroxyphosphoribosylaminopyrimidine deaminase / 5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_719262 
Protein GI113461193 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.411969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCAA CATTTTCAGC TCAAGACGAA TATTTTATGC AGATAGCTTT AGAACTTGCG 
AAAAAAGGCA TATTTACCAC AACACCTAAT CCTGCTGTGG GCTGTGTTTT AGTCAAAAAT
GGTGAAATTG TTGGGAGAGG TTTTCATTTT AAAGCGGGGC AACCCCATGC GGAAGTTATG
GCATTACGTG ATGCCGGTGA TAGAGCAAAG GGGGCAACTG CTTATGTGAC TTTAGAGCCT
TGTTCTCATT TCGGAAGAAC GCCTCCTTGT GCACAAGGAC TAATTGAAGC AGGTATTCGC
AACGTTATTG TGGCAATGAA AGATCCTAAT CCGCAAGTTG CAGGCAAAGG GTTAGCTATG
TTGCAAGCAG CAGGTATTGA AAGTGCGGTA GGATTATTAC AAGAAAAAGC GGAATTATTA
AACAAAGGTT TTTTAACACG AATGAGAACA CAGAAACCTT TTGTTATTTT GAAAATGGCA
ATAAGCCTTG ACGGTCGAAC TGCTATGGCT AATGGCGAGA GTAAATGGAT TAGCGGAGAA
CAGGCACGGC AAGACGTACA GCAAGAGCGG GCTAAAGTTT CTGCAATTTT ATCGACAGCT
AAGACGGTGT TAGCCGATGA TCCCTTGTTA AATCTTCGTT GGGAGCAATT TCCATCGGAA
TTACAACAAA ATTATAAAGT TGAAGAAGTG CGTCAACCTG TCAGAATTAT TCTTGATCGT
CTGCATCTTG TGACACCTCG TCATAAATTA TTTCAATGTC AATCTCCCGT TTGGCTGGTG
GGTGATAAAG AGCGTGATAT GTCAGCGTTT CCTGATTATT GTCAATATAT AAAATTATTC
CCAAGTAACG AGCATTCTTA TTTGGAAAAC TTGTTGATTG AATTGGCAAA GCGTCAGATT
AATAGTCTTT GGTTAGAAGC GGGAGAAACT TTAGCCGGTG CATTTATTGA GGAAAATTTA
GTGAATGAGC TCATTATTTA TATGGCACCA AAATTACTGG GTAATGAAGC TCGTGGTTTT
TGCCATTTGC CACATTTGAA GCGTTTAGCC GATGCACCAA AGTGGCAATT GTTATCTTTG
GCACAAATTG GCGAAGACAT TAAGTTGAAT TATCAGCGTC ACATTTTATG A
 
Protein sequence
MQPTFSAQDE YFMQIALELA KKGIFTTTPN PAVGCVLVKN GEIVGRGFHF KAGQPHAEVM 
ALRDAGDRAK GATAYVTLEP CSHFGRTPPC AQGLIEAGIR NVIVAMKDPN PQVAGKGLAM
LQAAGIESAV GLLQEKAELL NKGFLTRMRT QKPFVILKMA ISLDGRTAMA NGESKWISGE
QARQDVQQER AKVSAILSTA KTVLADDPLL NLRWEQFPSE LQQNYKVEEV RQPVRIILDR
LHLVTPRHKL FQCQSPVWLV GDKERDMSAF PDYCQYIKLF PSNEHSYLEN LLIELAKRQI
NSLWLEAGET LAGAFIEENL VNELIIYMAP KLLGNEARGF CHLPHLKRLA DAPKWQLLSL
AQIGEDIKLN YQRHIL