Gene A9601_03031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03031 
Symbol 
ID4716990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp279616 
End bp281100 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content33% 
IMG OID640078005 
Productretinal pigment epithelial membrane protein 
Protein accessionYP_001008698 
Protein GI123967840 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTAATT TACAAGAGAA AAAAATTGAT AAATTTAATA TTTTTAAAAA AGAAGATTGG 
TCAAGTGCTT ACAAAAATGT AGAAAAGGAG TTAACTAAAG TGCCTCTAAA AGTTAGCAAA
GGTAATAATA TTAAAAATTT AAATGGAACA TTATTAAGAA ATGGACCAGG AATATTAGAG
AGAGGTGGAC AATGGGTTCA TCATCCATTT GATGGTGATG GGATGATAAC ATCCATAAAA
TTCAAAGATG GTCAGCCATT CTTAACAAAT AGATTTGTTA AGACTAAAGG CTATTTGGAA
GAAGAAAAAA TAAATAAATT TATTTATAGA GGTGTTTTTG GAACACAAAA AAATGGAGGA
ATTTTAAATA ATGCATTAGA TCTAAAATTC AAGAATATCG CTAATACACA TGTTATTAAA
TTAGGAGATG AAATTCTCGC GTTATGGGAA GCAGCAGGTC CACATGCAAT GAATCCTGAT
AATCTTGACA CTATTGGTTT AACAACATTA AAAGGGGTAC TCAAACCTAA CGAAGCATTC
AGTGCTCATC CCAAAACAGA CCTAAACTCA AATGCATCTT CAGAACTTTT AGTCACTTTT
GGAGTACAAA CCGGGCCCAA AAGTACCATT AGATTAATGG AATTTGATAA TGCTGGTACA
AATTCTGGAG AGCTCATTTT TGACAGAAAA GATACTTTTA ATGGCTTTGC ATTTCTTCAT
GATTTTGCAA TTACAACTAA TTGGGCAATA TTTTTACAGA ATGCTATTGA TTTCAATCCT
CTTCCATTTG TAATGGGTCA AAGAGGAGCA GCACAATGTC TAAAGTCAAA CCCAAATAAA
AAGGCAAAGT TTTTTATCAT CCCAAGAGAA AGTGGATTAT TTAGAGGACA GCCTCCTTTA
ACAATAGATG CTCCAGAAGG ATTCGTTTTT CATCATGTAA ACGCATTTGA AAAAGATTCC
AAAATCGTAT TAGATAGTAT TTTTTATGAT GATTTCCCAT CAGTTGGTCC AGATGAGAAT
TTTAGGGATA TTGACTTTGA TAAATATCCA GAAGGAAAAC TGAAAAGATC AATTATCGAT
CTGAAAACAA AAACTTGTGA ACTTGAAACT TTCAGTGAGC AATGTTGTGA ATTTGCTGTT
GTTAATCCTA AAAACTTAGG ATTAAAAGCA ACTTTTAGTT GGATGGCAAG CACATCTCAA
AAGCTGGGGA ACGCTCCACT TCAAGCAATA AAAAAAATAA ATTTAACTTC TAAGGAAGAG
ATTTCTTGGT CAGCCGGTCC AAGTGGATTT GTTAGTGAAC CAATTATGGT CCCATCAGAA
AACTCTTCAA AAGAAGATGA GGGATTTTTA TTTATACTTC TATGGAACGG AGAAAGAAGA
GGAAGCGATT TAGTGATATT AGACGCAAAA GACTTAAAAG AATTAGCTGT TTATGAATTA
CCCATTTCAA TTCCTCATGG CCTTCATGGA TCTTGGGTCA ATTGA
 
Protein sequence
MTNLQEKKID KFNIFKKEDW SSAYKNVEKE LTKVPLKVSK GNNIKNLNGT LLRNGPGILE 
RGGQWVHHPF DGDGMITSIK FKDGQPFLTN RFVKTKGYLE EEKINKFIYR GVFGTQKNGG
ILNNALDLKF KNIANTHVIK LGDEILALWE AAGPHAMNPD NLDTIGLTTL KGVLKPNEAF
SAHPKTDLNS NASSELLVTF GVQTGPKSTI RLMEFDNAGT NSGELIFDRK DTFNGFAFLH
DFAITTNWAI FLQNAIDFNP LPFVMGQRGA AQCLKSNPNK KAKFFIIPRE SGLFRGQPPL
TIDAPEGFVF HHVNAFEKDS KIVLDSIFYD DFPSVGPDEN FRDIDFDKYP EGKLKRSIID
LKTKTCELET FSEQCCEFAV VNPKNLGLKA TFSWMASTSQ KLGNAPLQAI KKINLTSKEE
ISWSAGPSGF VSEPIMVPSE NSSKEDEGFL FILLWNGERR GSDLVILDAK DLKELAVYEL
PISIPHGLHG SWVN