Gene P9303_24081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24081 
Symbol 
ID4775935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2118119 
End bp2120413 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content43% 
IMG OID640087929 
Producthypothetical protein 
Protein accessionYP_001018406 
Protein GI124024099 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAAG AGGAGATTAT GCAGCAGCTG CAGGCAGCGG TTGCATTGCA TAACCAGGGT 
GAGCTTGATC AGGCAGAAGC GATTTATAGG CAAGTGCTTG CTGTTGATGA AAATAATTTT
TATGCACTTA ATTTCTGCGG ATGTATTCAG CGCGAAAAGA AGAGATTCGA TGACGCGATT
ACCTTGCTGA GCAGTGCAGT CTCCGCTCAG CCAGGTAATC CAGATGCTAA CTACAATCTT
GGAAATGTCT TTAAGGACGC TGAGCGATGG GATGAAGCTA TCTCTTGCTA CGAGAAGACG
CTTGACTTAA AAGCAGAGTA TCCAGAAGCA CTGAATAACC TGGGAATTTG TTTAAAGGAG
ACTGAGCAAT ATGAGCATTC AGAGATTGTC CTGAAGCGTG CTATTTCGAG GCAGCCTAGG
TTTGCAGCTG CCTGGCTCAA CCTAGGTAAT ACGCTCAAGG AGCAGAAAAA GTATTCAGAA
GCGATTGTGA GTTATCGGAA CGCGATCGAG GTGAAGCCTG ATTTTGCGGA GGCTTATCTA
AATTTAGGGA ATGTGTTGAA GGAGGAGGGA GCTGTAGAGG AAGCGATCGC AAGTTATCGG
AAGGCAATTG AAGTTAAGCC TGATTGTGCT GGTGCGTATT TTTCTCTTGG TTTCGTGTTG
AAGGGAGAGG GAGAAGTTGA GGAAGCGATT GTGAGTTATC GGAACGCGAT CGAGGTGAAG
CCTGATTTAG CGGAGGCTTA TCTAAATTTA GGGTATGTGT TGAAGGAGGA GGGAGATGTA
GAGGAAGCGA TCGCAAGTTA TCGGCAGGCA ATTGAGGTGA AGCCCGAATT TGCGGATGCG
TATTTGAATT TAGGGAATGT CTTGGAAGAG GAGGGAGAGA TTGAGGAAGC GATTGCAAGT
TATCGGCAGG CGATTGAGGT GAATCCAGAT TTTGTAGAAG CCTACTCCGA TCTGGGAAAG
TTATTCTATG AGGGAGGGGA CTACATGTCT AGTATTGAAT TTTTTCAAAA AGCACTGTCG
CTAGATAAAA ACCATCTAAA GAGTGCAGCT ACTTTGGGAT TTAGTTTTTT CAGATGTGGT
CAGATTGATG CTGCTATATC CTATTATTCA GCCAAGCTAC ATTCGGGATT TTGCAGTATC
TCAGCATATT ACTTGTTTCG AGCCGTAGAT CAAATAATTC CTTCCAAAAT TAATTCAGAT
TTTGACATTG CATCTGAGTT TTGTGCTTCT GCTATGCAGT ATATGGGCGT TGCTTCGATT
TTGGCTTTTG GAGACTCGCA CGTCAATGTC TTTTCTGGTA TTGAGGGCAT TGATGTCCAT
CATGTCGGTG CATCTACTGC TTACAACTTA ATGTCAGACA AATCTAGTAG TGGAGGCTCG
AAAGAAGTTT GGTCTAGACT AAAGAAAGTA GGTCGTAATA AATCTTCGAC GGCTGTTTTG
CTTTGTTTTG GCGAGATTGA TTGTCGTAAT CATGTCGTGA TGCAGTGCTA TTCAAGAGGT
ATTTCTATAG AAGAATCGGC TTCTAATGTG GTTTCTAGAT ATTTACAGTT TATTAATAGT
CTGCTTAGTG CTGGTTTTAA AGTAGTCGTC TATGGATGCT ATGGATCGGG CTCTCATTTC
AATGCTGTTG GCTCTGAAGT CGAAAGGAAT CTTGCGGCTC TAGAAGTAAA TCGTCTGTTG
GGGAAGGGCT GTGATGACCT CAACTCTCCG TTCTTTTCCT TGAATGATTC GTTTGTTGAC
GAATTTGGGC TTACAAGAAG GTACCTCATG CAAGATGATT CGCATTTACC AGAAAAAGGG
AGAGCTGGTA TTGAGATTAG GTCACTTCTA ATGGCTAATT TGATGAAATC CTGTAAAGCG
ATGTTTTCCT TATCGTCTAT ACATTCCTTA CATTCTTCTT CGGAATGGGA GGTGTCTTAT
TGTAATTCTG TAATTGTCAA TATGGGGGAG GCCTCTCAGT CTTTTTGTCA TTGGGATGAG
AATGGATTTC TTGCTTTGGG GGTGCATTCC GTCTCTAGTC TTTGGTTTGA TATGGGGGCT
CAGCTTGTAG TTAAAGGTTT TGGGTTGGAA TTTGACTCCA AGCCGAGTTT CCCGCCTGAC
TCATCGTGCC CATTCTTTTT CAGGCTTGAT GGGCAAGAGT TGTTAGTAAA GAAGGCTGAA
TATAGCTCTG GAGAGTTAAA GGTTGAAATC GGCAATTCTT ATGGTAGGTA TATCGAGATT
TTGCCGTCCG TTGCTGGTGG AGGAATTCTG GACTGTGTTT CCCGAATGCT TCCTGCTGTT
GGCAGGGCTA TTTGA
 
Protein sequence
MNQEEIMQQL QAAVALHNQG ELDQAEAIYR QVLAVDENNF YALNFCGCIQ REKKRFDDAI 
TLLSSAVSAQ PGNPDANYNL GNVFKDAERW DEAISCYEKT LDLKAEYPEA LNNLGICLKE
TEQYEHSEIV LKRAISRQPR FAAAWLNLGN TLKEQKKYSE AIVSYRNAIE VKPDFAEAYL
NLGNVLKEEG AVEEAIASYR KAIEVKPDCA GAYFSLGFVL KGEGEVEEAI VSYRNAIEVK
PDLAEAYLNL GYVLKEEGDV EEAIASYRQA IEVKPEFADA YLNLGNVLEE EGEIEEAIAS
YRQAIEVNPD FVEAYSDLGK LFYEGGDYMS SIEFFQKALS LDKNHLKSAA TLGFSFFRCG
QIDAAISYYS AKLHSGFCSI SAYYLFRAVD QIIPSKINSD FDIASEFCAS AMQYMGVASI
LAFGDSHVNV FSGIEGIDVH HVGASTAYNL MSDKSSSGGS KEVWSRLKKV GRNKSSTAVL
LCFGEIDCRN HVVMQCYSRG ISIEESASNV VSRYLQFINS LLSAGFKVVV YGCYGSGSHF
NAVGSEVERN LAALEVNRLL GKGCDDLNSP FFSLNDSFVD EFGLTRRYLM QDDSHLPEKG
RAGIEIRSLL MANLMKSCKA MFSLSSIHSL HSSSEWEVSY CNSVIVNMGE ASQSFCHWDE
NGFLALGVHS VSSLWFDMGA QLVVKGFGLE FDSKPSFPPD SSCPFFFRLD GQELLVKKAE
YSSGELKVEI GNSYGRYIEI LPSVAGGGIL DCVSRMLPAV GRAI