Gene P9303_19231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19231 
Symbol 
ID4776867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1691301 
End bp1692407 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID640087433 
Productputative diaminohydroxyphosphoribosylaminopyrimidine deaminase and 5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_001017930 
Protein GI124023623 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.533116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCA ACCCCTCTGC AAGTGCTGTT TGGCTGCCAT GGATGCGGCG GGCGTTGCAG 
TTGGCTGCGC TCGCAGATGG TCGCACCAGT CCCAACCCTC TTGTTGGGGC CGTTGTTCTC
GACAAGGCTG GCAAGCTTGT CGGAGAGGGT TTCCATGCAT GTGCTGGTGA GCCCCATGCT
GAGGTAGGCG CTCTCGCACA AGCTGGTGAA CAGGCCAGCG GTGGAACCCT GGTGGTCACC
CTGGAACCCT GTTGCCATCA AGGCCGCACG CCTCCCTGCA CGCAGGCCGT CATCGCTGCC
GGACTTCGCC GAGTTGTGGT GGCAATGCAG GACCCCGACC CACGCGTTGC TGGGGCCGGC
ATCACTCGTT TGCGCGATGC CGGCCTCGAG GTGATCACTG CTGTGTTGGA GCCGGAAGCC
GCACATCAGA ACCGGGCTTT TGTGCATCGT GTTTCTACTG GGCGCCCCTG GGGGATTCTC
AAATGGGCGA TGAGCCTCGA TGGACGCACG GCTCTGCCCA ATGGCGCCAG TCAGTGGATC
AGTGGTGGTG AAGCGCGTAG CTGGGTGCAT CGCTTACGTG GCCAATGTGA TGCTGTGATC
GTTGGCGGCG GCACTGTGCG TGCGGACGAT CCGTTGCTGA CCAGTCGTGG GCACTCTGAC
CCCGAACCAA AGCGGGTGGT GCTGAGTCGC AGCCTTGATT TGCCTCAACA AGCTCAGCTT
TGGGATATTG CAGTGGCTCA CACCCTCGTT GCTCATGGCC TAGAGCCTGG CCATGAACAG
TTGGCTCATT TGCCTGAGGG GCCTGAGCTA CTTGCTTTGC CTGCCTCTGA ACCGCTTGAG
TTGCTGCAGG CCTTAGCTCA ACAAGACTGC AATCGTGTGT TGTGGGAATG CGGGCCAGCT
TTAGCAGCCG CAGCATTGCA GCAAGGCTGT GTTCAAGAAT TGGCGGTGGT GGTAGCTCCC
AAGCTGTTGG GTGGCTTGCC GGCCAGAACA CCATTTGATG ATCTTGGCTT CACAAGCATG
AAAGAGGTTG TTGGGCTCGC GTCTGGCTCA TTGCAGCAGT TGGGGGCTGA CTGGCTCTTG
CAATATGAGC TTTCTAAGCA TTGCTGA
 
Protein sequence
MNVNPSASAV WLPWMRRALQ LAALADGRTS PNPLVGAVVL DKAGKLVGEG FHACAGEPHA 
EVGALAQAGE QASGGTLVVT LEPCCHQGRT PPCTQAVIAA GLRRVVVAMQ DPDPRVAGAG
ITRLRDAGLE VITAVLEPEA AHQNRAFVHR VSTGRPWGIL KWAMSLDGRT ALPNGASQWI
SGGEARSWVH RLRGQCDAVI VGGGTVRADD PLLTSRGHSD PEPKRVVLSR SLDLPQQAQL
WDIAVAHTLV AHGLEPGHEQ LAHLPEGPEL LALPASEPLE LLQALAQQDC NRVLWECGPA
LAAAALQQGC VQELAVVVAP KLLGGLPART PFDDLGFTSM KEVVGLASGS LQQLGADWLL
QYELSKHC