Gene P9303_28831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28831 
Symbol 
ID4778957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2549409 
End bp2551256 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content55% 
IMG OID640088406 
Productflavodoxin:flavin reductase-like domain-containing protein 
Protein accessionYP_001018878 
Protein GI124024571 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATAT CCTCCATCCA CGAGCCTGCA GCAGCAGCGC AGCGAACGGT GATCACACTC 
CCAATCGAAA AGGGCCTGAT CAGCTTGCGC GGCCTTAGTC CACAACGTCT GCGCTTTGAG
CTGGAATATG CCCTGGAGCG AGGCAGTACC GCCAATAGCT TTCTTTTCTC TGCCGGTGAC
GACTCACATG GGCAACCTCA ATCGGCTGTC CTCGTACACC CCCCTGGCGA CGCCTATGCC
GAGGTTTTCA TGCCGGCACT CGCCAAGGCT CTACCTTCAG ATACCACAAC GTTGAAGGTG
GTCGTTGGTC ACATCAACCC CAACCGAGTT GCGCTACTCA AAAAGCTGGC CAACAGCTAC
CCCAAGCTGG AGTTAATCAG TTCCAATCCC GGCGCCAAAT TGCTCAAAGA GCTTTGGGAA
CAACGCAAAC CAGCAACACC CAACAACAAT GAACAGGAAG AGTCATCCCT TCCGTCTCTT
CCATCCATTG AGATTGTTCG ACAAGAACAG AAGCTCTCCC TCAGCAACGA ACACGCATTG
TGGCTGCTAC CAGCGCCAAC AGCTCGCTGG CCAGGCGGCC TACTGGCCTT CGAGGAAAGC
CTTGGCTTGT TGATGAGCGA CAAACTATTC GCCGCCCACC TCTGCACAAG CGAATGGGCA
GAAGCCAATC GCATCAGCAC AGAAGAGGAG CGTAGGCATT TCTATGACTG CCTGATGGCT
CCCATGGCCA GCCAGGTAGA TACCTTAGTA GAGCGGCTTG AAGAGCTAGA CATCCGCACG
ATCGCCCCAT GCCATGGGCC AGCCATAGAA ACGAGCTGGC GGAGCCTGCT GAATGACTAC
CGCCGCTGGG GTGAAAGCCA ACAACAAGCC CCTTTAAAGG TCGTTCTTCT TTTCGCCAGC
GCCTACGGCA ACACAGCGGC GATTGCTGAC GCACTCGCAA AAGGAGTCTC CAGTACTGGT
ATTCAAGTAG AAAGCCTCAA CTGCGAATTC ACACCTGCGA ATGAATTGGT AAATGCAATC
CAACAAGCTG ATGCCTACTT GATTGGATCG CCAACCCTTG GAGGGCATGC ACCAACCCCA
ATCGTATCGG CCCTAGGAAC CTTGCTGGCC GAAGGTGACC GCAACAAAAA GGTAGGCATA
TTCGGCAGCT ATGGCTGGAG TGGAGAGGCA TTGGAACTTC TCGAAAAGAA GCTCCGTGAT
GGTGGGTTCT CCTTTGGATT CGAGCCAATC AAAGTGAAGT TCAGTCCCGA TGCTGCCATG
GTGAAAACCC TGGAAGAAAC AGGCACACTC TTTGGCCGAA AACTCCTCAA GCAACAACAA
CGCGAGCAAC CACGAGCAAG CAGTGGCATG AGTGCAAGCC GTAGTGATCC AGCCGTGCTT
GCCCTTGGTC GGGTAGTGGG CTCACTATGC ATCTTGACGG CTCGTAAAGG TGAAGGGAAT
ACAGCGCTTA GCGGCGCAAT GGTCGCAAGC TGGGTCAGCC AAGCCAGCTT TTCACCGCCA
GGGCTGAGCG TGGCCGTCGC CAAAGACCGA GCCGTTGAAG CGTTGCTGCA TCGGGGCGAC
CACTTCGCTC TCAATGTGTT GGCAGCAGGA AGGCAACACG AACTGATGAA ACATTTCCTG
CAACCATTCC CAGCTGGTTC AGACCGGTTC GCAGGGCTAG ACCTTGACGC CAGTCCCGCA
GGTCAACCGC TGCTTAAAAA TGCGCTGGCA TGGCTTGAAG GATGCGTACA GCAACGCATG
GAATGTGGAG ACCACTGGCT GCTATATGCC GAGATCAGCC ATGGTGCCCT ACTGGAGCGA
GAAGGCACGA CGGCTGTGCA TCAGCGCCGC AGCGGGGCGA ACTACTGA
 
Protein sequence
MSISSIHEPA AAAQRTVITL PIEKGLISLR GLSPQRLRFE LEYALERGST ANSFLFSAGD 
DSHGQPQSAV LVHPPGDAYA EVFMPALAKA LPSDTTTLKV VVGHINPNRV ALLKKLANSY
PKLELISSNP GAKLLKELWE QRKPATPNNN EQEESSLPSL PSIEIVRQEQ KLSLSNEHAL
WLLPAPTARW PGGLLAFEES LGLLMSDKLF AAHLCTSEWA EANRISTEEE RRHFYDCLMA
PMASQVDTLV ERLEELDIRT IAPCHGPAIE TSWRSLLNDY RRWGESQQQA PLKVVLLFAS
AYGNTAAIAD ALAKGVSSTG IQVESLNCEF TPANELVNAI QQADAYLIGS PTLGGHAPTP
IVSALGTLLA EGDRNKKVGI FGSYGWSGEA LELLEKKLRD GGFSFGFEPI KVKFSPDAAM
VKTLEETGTL FGRKLLKQQQ REQPRASSGM SASRSDPAVL ALGRVVGSLC ILTARKGEGN
TALSGAMVAS WVSQASFSPP GLSVAVAKDR AVEALLHRGD HFALNVLAAG RQHELMKHFL
QPFPAGSDRF AGLDLDASPA GQPLLKNALA WLEGCVQQRM ECGDHWLLYA EISHGALLER
EGTTAVHQRR SGANY