Gene P9301_00461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_00461 
Symbol 
ID4912066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp47319 
End bp49094 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content34% 
IMG OID640159610 
Productflavoprotein 
Protein accessionYP_001090270 
Protein GI126695384 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.551818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAGCCT CTGCCCAGAC AAGTAATTCC AAATTGGCAC AAATAAATAA CAAGTTGACA 
GTTCAATCTC AAAACTTTGC TGATGATTCT TGTGCCATAA GATCACTGGA TTGGGATCGC
AGTAGATTTG ATATTGAATT TGGCTTAAGA AATGGCACTA CCTACAATAG TTTTATTATT
AAAGGCGAAA AACTAGCAAT AATTGATACT AGTCACGCAA AATTCGAAGA ATTATGGTTT
GAAGAATTAC TTAAAGAGGT AAATCCACAA GAAGTGGATT ATCTAATTAC AAGCCATACA
GAACCTGATC ATTCTGGTTT AATAGGTAAT CTTTTACAAT TAAACAAAAA TATCACAGTA
GTAGGATCAA AATTAGCCCT TAAGTTTATT GAAGACCAAA TACATGTTCC CTTTAAGCGA
CTAGAAGTTA AAAGTGGAGA GTTTTTAAAT CTTGGAACTA ATCCTAATAG TGGCTTAGAA
CATAATATTG AATTTATAAG CGCACCAAAT TTACATTGGC CCGATACAAT TTTTTCATAT
GACCACAGCA CTCATGTTCT CTACACATGC GATGCATTTG GACTCCATTA TTGTTCTGAT
GAATTTTATG ACACTGATCA AAAAGAAATA TATGATGATT TTCGTTTTTA TTACGATTGC
CTAATGGGTC CAAACGCTAG AAGCGTTCTC CAAGCAATTA AAAGAATAGA TAAACTACCT
GAATTAAAAA CAATAGCTGT AGGTCATGGG CCTTTGCTTC ATAATCAAGT TAATTTTTGG
AAAGGGAAAT ATCAAGAATG GAGTAGCAAT AAAAGCAAAG GTAATGATTT TGTATCAGTT
TGCTATATCA GCGACTATGG TTATTGTGAT CGACTAAGTC AAGCGATATC TCATGGAATA
AGTAAAGCAG ATGCACAGGT TCAATTAATT GATTTAAGAT CTTCTGACCC CCAAGAATTA
ACAAGTTTAA TTTCAGAGTC AAAAGCAGTA GTCATCCCCA CATGGCCAGT AGATTCAGAT
AATGAATTAA AAGAATCTCT TGGTACTTTA TTTGCAGCAC TAAAATCAAA ACAATTTACA
GCTGTCTATG ATGCATTTGG TGGAAATGAT GAACCAATAG ATTCCTTAGC AAATAAATTA
AGAGAACTTG GTCAAAAAGA AGCTTTCTCT CCTTTAAGAG TAAAAAACAT TCCAGATCCC
ATTGTTTATC AACAATTCGA AGAAGCTGGA ACTGACTTAG GACAATTGAT CAATAAAAAG
AAAAATATTG CCTCTATGAA GAGCCTTGAT TCAAATTTAG ATAAAGCTTT AGGTAGGTTA
AGTGGAGGAT TATATGTAGT TACAGCAAGC CAGGGAGAAG GTTCGACATT TAGACAAAGT
GCGATGGTAG CAAGTTGGGT TAGTCAAGCA AGTTTTTCTC CACCAGGCAT TACAGTTGCA
GTAGCAAAAG ATAGAGCTAT TGAATCATAT ATGCAAGTTG GCAAAGGTTT TGTTGTGAAT
GTCTTGAGAG AAGATAATTA TCAAAAAATG TTCAGACATT TTTTAAAAAG ATTTGCTCCT
GGAGCTGATA GATTTGCAGA TGTAGATATA ATTAGCAACA TCGCAGATGG AGGACCAGTC
CTCTCAGATT CACTCGCTTT TTTAGATTGT AAAGTTAGTT CCAGACTGGA AACTCCAGAC
CATTGGATAA TTTACGGAAT TGTTGAAAAT GGTAATGTCT CTGACTTATC ATGCAAGACA
GCAGTTCATC ACAGAAAAGT TGCTAATCAC TATTAG
 
Protein sequence
MLASAQTSNS KLAQINNKLT VQSQNFADDS CAIRSLDWDR SRFDIEFGLR NGTTYNSFII 
KGEKLAIIDT SHAKFEELWF EELLKEVNPQ EVDYLITSHT EPDHSGLIGN LLQLNKNITV
VGSKLALKFI EDQIHVPFKR LEVKSGEFLN LGTNPNSGLE HNIEFISAPN LHWPDTIFSY
DHSTHVLYTC DAFGLHYCSD EFYDTDQKEI YDDFRFYYDC LMGPNARSVL QAIKRIDKLP
ELKTIAVGHG PLLHNQVNFW KGKYQEWSSN KSKGNDFVSV CYISDYGYCD RLSQAISHGI
SKADAQVQLI DLRSSDPQEL TSLISESKAV VIPTWPVDSD NELKESLGTL FAALKSKQFT
AVYDAFGGND EPIDSLANKL RELGQKEAFS PLRVKNIPDP IVYQQFEEAG TDLGQLINKK
KNIASMKSLD SNLDKALGRL SGGLYVVTAS QGEGSTFRQS AMVASWVSQA SFSPPGITVA
VAKDRAIESY MQVGKGFVVN VLREDNYQKM FRHFLKRFAP GADRFADVDI ISNIADGGPV
LSDSLAFLDC KVSSRLETPD HWIIYGIVEN GNVSDLSCKT AVHHRKVANH Y