Gene Cpha266_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0944 
Symbol 
ID4570640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1086002 
End bp1087102 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content52% 
IMG OID639765545 
Product5-amino-6-(5-phosphoribosylamino)uracil reductase / diaminohydroxyphosphoribosylaminopyrimidine deaminase 
Protein accessionYP_911416 
Protein GI119356772 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCACG AAAGTCATGA ATGGTATATG AACCGCTGCT TTGAGCTTGC TTTGCAGGGC 
TCTGGTATGG TGAGCCCCAA TCCTATGGTT GGCTCTGTTA TTGTCCATAA TGGAGAGATT
GTCGGCGAAG GGTATCATGA GCGGTTTGGC GGGCCTCATG CCGAGGTCCA TGCAATTGCA
TCGGTTGGCA ATGCCGAGGT GCTGCAGAAT TCCACGCTTT ACGTCAATCT TGAACCTTGC
TCTCATTTTG GAAAAACGCC TCCCTGCGCG GATCTTATTC TTGCAAAACG GATACCCCGC
GTCGTGATCG GTTGCCGTGA TCCTCATGAA AAAGTGGCGG GAAAAGGTAT TGAAAGGCTC
ACGGCCGCAG GGGTTCAGGT TACCGAGGGC GTGCTTATGC CCGAAGCTCT TAAACTCAAT
GAGGCTTTTA TTAAAAGCTG TACCGTCGGA TTGCCTTTTG TGACGCTCAA GCTTGCCCAG
ACTCTTGACG GTAAAATTGC TACTGCCGGG GGAGCTTCAC GATGGATTAC CGGCGAGGAG
TCGAGAACAC AGGTGCATCG CCTGAGATGC AACTGTGATG CGGTAGTTGT CGGAGAGGCG
ACCGTACAGG CTGACGATTC GGAGCTCACC GTCAGGCATT GTGCCGGGCG AAATCCTCTT
CGTGTTTTAC TTGACCGCAG GCTCTCTCTT TTCGCCGACG CCAGGATTTT CAGTACTGAA
GCCGCGACGC TCGTTTTTAC AACAAGAGCA TCCTCCTGTT CAGTAAAAGC GGAGCAGCTT
CGAAAGAGAG GGGTGGAGGT GATTGGTGTT GACGAAGATG CGACAGGTCT TGATTTGAAA
CGAATTCTTC AGGAACTGCA CAGGCGTGGA ATTATCTCCC TTCTTGTTGA AGGAGGAAGC
CGTCTGTCGG CATCGTTTCT TGGTGCGGCC CTTGTCGATA AACTTCTTGT GTTTATTGCT
CCCAGGCTTT TCGGTGGTGA CGGGTTGAGC GCTTTTGCTC CGCTTGGCGT GGAACTTCCG
GATCAAGCTG TTCAACTGCG TTTTCAGCCG CCGGTTTTTT TTGGCCGGGA TCTTCTGCTT
GAGGCCTATG TTGTATCGTG A
 
Protein sequence
MHHESHEWYM NRCFELALQG SGMVSPNPMV GSVIVHNGEI VGEGYHERFG GPHAEVHAIA 
SVGNAEVLQN STLYVNLEPC SHFGKTPPCA DLILAKRIPR VVIGCRDPHE KVAGKGIERL
TAAGVQVTEG VLMPEALKLN EAFIKSCTVG LPFVTLKLAQ TLDGKIATAG GASRWITGEE
SRTQVHRLRC NCDAVVVGEA TVQADDSELT VRHCAGRNPL RVLLDRRLSL FADARIFSTE
AATLVFTTRA SSCSVKAEQL RKRGVEVIGV DEDATGLDLK RILQELHRRG IISLLVEGGS
RLSASFLGAA LVDKLLVFIA PRLFGGDGLS AFAPLGVELP DQAVQLRFQP PVFFGRDLLL
EAYVVS