Gene P9303_24071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24071 
Symbol 
ID4775947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2115761 
End bp2117629 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content45% 
IMG OID640087928 
Producthypothetical protein 
Protein accessionYP_001018405 
Protein GI124024098 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAAG AGGAGATTAT GCAGCAGCTG CAGGCAGCGG TTGCATTGCA TAACCAGGGT 
GAGCTTGATC AGGCAGAAGC GATTTATAGG CAAGTGCTTG CTGTTGATGA AAATAATTTT
TATGCACTTA ATTTCTGCGG ATGTATTCAG CGCGAAAAGA AGAGATTCGA TGACGCGATT
ACCTTGCTGA GCAGTGCAGT CTCCGCTCAG CCAGGTAATC CAGATGCTAA CTACAATCTT
GGAAATGTCT TTAAGGACGC TGAGCGATGG GATGAAGCTA TCTCTTGCTA CGAGAAGACG
CTTGACTTAA AAGCAGAGTA TCCAGAAGCA CTGAATAACC TGGGAATTTG TTTAAAGGAG
ACTGAGCAAT ATGAGCATTC AGAGATTGTC CTGAAGCGTG CTATTTCGAG GCAGCCTAGG
TTTGCAGCTG CCTGGCTCAA CCTAGGTAAT ACGCTCAAGG AGCAGAAAAA GTATTCAGAA
GCGATTGTGA GTTATCGGAA CGCGATCGAG GTGAAGCCTG ATTTTGCGGA GGCTTATCTA
AATTTAGGGA ATGTGTTGAA GGAGGAGGGA GAAGTTGAGG AAGCGATTGT AAGTTATCGG
AAGGCAATTG AAGTTAAGCC TGATTGTGCT GGTGCGTATT TTTCTCTTGG TTTGGTGTTG
AAGGGAGAGG GAGAAGTTGA GGAAGCGATT GTGAGTTATC GGAACGCGAT CGAGGTGAAG
CCTGATTTTG CGGAGGCTTA TCTAAATTTA GGGTATGTGT TGAAGGAGGA GGGAGATGTA
GAGGAAGCGA TCGCAAGTTA TCGGAAGGCA ATTGAGGTGA AGCCTGACTT TGTGAAAGCG
TTTTTGGGAT TAGGGGCTGT ATTGACAGAG AAAGGTGAGA TTGATGACGC GCGACAAGTT
GTTTCTGCTC TTTTCGAAAT GATTGCAATT GAGGAGTCAT ATATGCTTCC TTTCCCTTCT
AGTAATTTGG TTTTTGAGTG GCATCACAGG CTGGCATTAC ATCTTTCTTG GGAATTGGAG
TTTGCTGCCC TTTCTGGTTC TTCTGTGCCA TTCTCTGCAT TCGAAGCTGA GAAAAAAGTC
GATGCTCAGC ATTTCCCTCC TTTGTTTTTG AAAGGAGAAG GCGATAGGGC AAGTAAACAG
CGCTTGTATC GAAATGGATA TTTGGTGGAA GATCAGATAT TGTCTGAAAA TTTATGTGCT
GAGTTTGTCA ATGAGTTTGA GGGTGTCAGG ATGATGACTG CAGGGTTGAT CCGTGCGGTA
TCTGAGAAGG GCGTCTTAGG GTCTGTCTTG GATAAAATTT TCAAACACAC TGGTTTTCCG
CACTTTGTAT GGGATTGTTT TTGCTTCGCA AAAGGACCTG ATACTGAGTC TGTGTCTGAT
GCTTGGCATT ATGACAATCA TTACAATATT TGGACTCCCA AATTAATGGT TTATCTTAAT
TCTCAGCGAG AAGAGGCTGG GGCAACGCAG TTTGTTGATG CGACTTTGTC CCAAAGGATT
TCTGAGAAAT CTGATTATAT GGGCCTTGTT TGTCAGCGTA AATATTACAC AGAATACGTT
AAGGCTTTGG AGGGCGAGCT AAGGCTTGAT CCTGTCACTT TTGATCCTCC TCATTACACC
TTTTGCCCCG ACCGGGCCGG CACGGGTGTT TGGTTTTGTC CTGCACGAGT GCTGCATCGT
GGCGTGAGTC CTAAGAAAGG ACTTCGCTAT GTGCTTACGT TTTCCTTGAC ACCTCTTCCT
AGAGATTGTC AGTGGTCCAT GGAGCAGTGC GTCGAGAAGT CAGTAGAGAT ATTGAGGGAC
AAGATTAAGC AAGGGATGAC AGAAATTGAT ATCAACCCCT TTTGGTCTGT ATCAAATAGG
TCTGTATAG
 
Protein sequence
MNQEEIMQQL QAAVALHNQG ELDQAEAIYR QVLAVDENNF YALNFCGCIQ REKKRFDDAI 
TLLSSAVSAQ PGNPDANYNL GNVFKDAERW DEAISCYEKT LDLKAEYPEA LNNLGICLKE
TEQYEHSEIV LKRAISRQPR FAAAWLNLGN TLKEQKKYSE AIVSYRNAIE VKPDFAEAYL
NLGNVLKEEG EVEEAIVSYR KAIEVKPDCA GAYFSLGLVL KGEGEVEEAI VSYRNAIEVK
PDFAEAYLNL GYVLKEEGDV EEAIASYRKA IEVKPDFVKA FLGLGAVLTE KGEIDDARQV
VSALFEMIAI EESYMLPFPS SNLVFEWHHR LALHLSWELE FAALSGSSVP FSAFEAEKKV
DAQHFPPLFL KGEGDRASKQ RLYRNGYLVE DQILSENLCA EFVNEFEGVR MMTAGLIRAV
SEKGVLGSVL DKIFKHTGFP HFVWDCFCFA KGPDTESVSD AWHYDNHYNI WTPKLMVYLN
SQREEAGATQ FVDATLSQRI SEKSDYMGLV CQRKYYTEYV KALEGELRLD PVTFDPPHYT
FCPDRAGTGV WFCPARVLHR GVSPKKGLRY VLTFSLTPLP RDCQWSMEQC VEKSVEILRD
KIKQGMTEID INPFWSVSNR SV