Gene P9303_12041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_12041 
Symbol 
ID4777660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1050014 
End bp1051939 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content48% 
IMG OID640086713 
Producthypothetical protein 
Protein accessionYP_001017218 
Protein GI124022911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.681037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCC GGTACAGCTC TCTAGCCGAT CCTCTGATAG CCTACATAGA CAAAGATCAA 
GAAAATACAT CGGGCAGAGC ATCAAGCTTA CTGACCCCAA TCAACTACAA CGCAGATCTA
TTTGCTAACG AGGCAAAGTT TTTCACTGGT AACTATGGAT TTGACGGATA TATTGGTGTA
CCGGGCTTAC TCGGGCCAGG GGGCCAGCAG GCAGCTGCGC ATCACAATGT TGCCTGGGAA
AGTGTTGATC CAGATCTGAG TCCAACGCTC CGCGCGCTAA CAAGTGCAGC GAGCGTTGTT
GGATCTAAAG CTGTTTATGG CATGAATCTT TTGTTGCAAG ACACCATGCC ATTGGTGTTC
AGTTATCCAG TCTTACCAAC AACCTTAGAT GGAAATGGAA GCGATTTTGA GATCACCCTG
AACGATGGTT CGATCGTATC GCCTGCCCTT GCAGGATTCT TGCCAAACCT CGAATACAAT
GAACGCCAAA CGATAGTGGT GGCAGGTGAT TTCGGCAATC GCCTCAACCC CGAAAGTGAA
GGAGCCCGCT ATCCGGTATC GGTTCGAATC GTCAATGACG GCACACCCCT GCAGATGCTC
TCAGCAAAAG GGCCTGTCTT TGCTACAGAA CTGTCCGTCG ATAGCAGCAA CTCCTATGTT
CAAGGGGATG GCCCGAAGCT TGTTGCGGCC AAACTCAATA CCTTCAGTCC GCTTGGAGAA
GGCGGACCTA TTGGCGTTGG CGCCACTTCA GCTAGCAACA GCGGGAGCGA TCTTTATGGG
GATCAAGCCC AATATCGCCT ACGCCTTTAT ACCAGTGCAG GATTCTCACC AGATGGCATT
GCAAGCCTGC AACCCAGCGA GTTCAATAAA TACTTTATTC TTGAAGCGAA AGGCGATAAC
GGCGAAAAAA TTTCACTTAC TAAATCGAAT CAGGATTATC TTATTGGCAA ATATGGTTCC
ATCAAAGTTG TAGGAATAGC AGATTTAGCA CCAGCAGGAA CAATAGAAAA TGCCGCTTAT
GTGGAAGATC ATGATAATTA CTACGACATC ATCCTAGAAG GTGATCTCAG TGCAATTACG
AGACTAAAAA GTGTTCGTAT GCCATCAAGG GGTAATTACC AGGCCGTTTA TAACCCTGGT
GGACCCGGTA ATAATCCGGA CGCACAGGCT GCAGCACCAG GCCCGTTTAC GATGCCGAGC
GTCGATCACA CCATTGCCAT CATCAACGAC CTTAATGGTG CGATGACTGC CACCTATGTC
GAAATCGAAG GAGACGTACT GACAAATCCA TTGAGTAACT TGCCGGTTGG AAAGCTGCTA
GGAGTAGCAG TGGAAGACAC GATCAGCGGC CAGCAAATTT ACGCGTACGA AGATCCCTAT
GGACGTCGTT TCTACACAAG TTTCGAAGCC TCCAAAGACG TAGCCTCGGT ACTTCCTAGC
AACCTCCTGA AGCCGAAGCC GATTGACCTG ATCGACACAA CAGGTTTTGC GCCCGACTCC
AGCGTCATTA TTTCCGGGTC ATTTAGTCGC TCAGCATCTC ACAGCTCAAC ATTGCAGTTT
TATGAGGTAG CAGGACCCGA TGGTGGTGTA GTTGACCCTG TTACTGGCAG AACGCTGATG
CCCAATGAAA GCGGCTACAA CTATGTGGCG AGGAGCAATC TGCTTACCAG CCAAAATAGT
TCATTGAAAA TCGAAAATAA GGAAATTAAT AAATTTCAGT TCAATGCCGA GGCCGGAAAG
ATTTATGCAC CATTGCTGAT CAACGAAGTA ACTGGTGAAC AGTATTTTGC CTTCACTGGC
GCTAACTCTG ACAAGTCCGC GCATTTCACT GCTCTTGGTC CCAATGGCTT TGGTATAGAG
GATCTATTTG GTGGTGGAGA TAAGGATTTT GCCGACATGA TCGTGCAATA TACGATCACA
GTCTGA
 
Protein sequence
MAIRYSSLAD PLIAYIDKDQ ENTSGRASSL LTPINYNADL FANEAKFFTG NYGFDGYIGV 
PGLLGPGGQQ AAAHHNVAWE SVDPDLSPTL RALTSAASVV GSKAVYGMNL LLQDTMPLVF
SYPVLPTTLD GNGSDFEITL NDGSIVSPAL AGFLPNLEYN ERQTIVVAGD FGNRLNPESE
GARYPVSVRI VNDGTPLQML SAKGPVFATE LSVDSSNSYV QGDGPKLVAA KLNTFSPLGE
GGPIGVGATS ASNSGSDLYG DQAQYRLRLY TSAGFSPDGI ASLQPSEFNK YFILEAKGDN
GEKISLTKSN QDYLIGKYGS IKVVGIADLA PAGTIENAAY VEDHDNYYDI ILEGDLSAIT
RLKSVRMPSR GNYQAVYNPG GPGNNPDAQA AAPGPFTMPS VDHTIAIIND LNGAMTATYV
EIEGDVLTNP LSNLPVGKLL GVAVEDTISG QQIYAYEDPY GRRFYTSFEA SKDVASVLPS
NLLKPKPIDL IDTTGFAPDS SVIISGSFSR SASHSSTLQF YEVAGPDGGV VDPVTGRTLM
PNESGYNYVA RSNLLTSQNS SLKIENKEIN KFQFNAEAGK IYAPLLINEV TGEQYFAFTG
ANSDKSAHFT ALGPNGFGIE DLFGGGDKDF ADMIVQYTIT V