Gene P9303_20961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20961 
Symbol 
ID4776828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1851334 
End bp1857582 
Gene Length6249 bp 
Protein Length2082 aa 
Translation table11 
GC content48% 
IMG OID640087604 
Producthemolysin-type calcium-binding domain-containing protein 
Protein accessionYP_001018096 
Protein GI124023789 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAA TCACCAACTT CACAGCAAAC GCATCTGATC GATGGGGTTG GTGGAGATGG 
AACAGGTGGG GAACAACAGC CACAGCCATT GAAGATCAAA GCCTGCGTAT CGATAGCGAC
GAACTGCGTA TCTCAGCCAC AGCCAACTCT GGCTGGCGTA CAGCACGTGC AACGGGTCTG
CTGAATAGCG AAGTTGAATA CACCGGCAAA GCCGTAAGCA TTAACACCTA CGCGCAAGGG
CGAGATGCCT ACTCCATTGG TATCGATGGC TCCCAGCTTT CTATCATTGG CAATGACAAC
AAGCTGATCA ATATCAACGC CACCTCAAGG GTCTTTGGAT ATGACCCAGC CTGGGGCCTT
CGTAATTCCA CACTCAGCAC CCAGGGTGGC AATGACATCA TCAGCATCAG AGCTGATGCT
GGAATCCTTG GTGGCAGATT CGTCAACTCT CCGATAGTGG CAACTGGCCT TGAGAGCAGC
TCTGTCAATA CTGGCGCGGG CAATGATTTC CTTAGCCTTC AATCAATCGC AAGCGGTAGA
AATACTGCCA CGGCTAGCGG CACAACTGAC AGTTCGATTG ATCTCGGTTC AGGCAACGAT
TCCCTATTCA TCAATGCTTC AGCAACCGGT TCAGGCGGAT GGTGGGGTGG CGGCAAAGTT
GCTGCCTATG GAGCTCTTAA CACCACAATC AACACAGCGG GGCAAACCTC TGAAGGAGTG
CCCCAGATGG GGCATGAGAA CAACTCTTTT CTCTTATTAG GTTTTGAGGA AAACCCTGCA
GTTGGCAATG ACAACGACTC CATTGTCATT AATGCCTCTG CCACCAACTG GGGACGAGGA
CGCCGTGGTT ATGCAGAAGC GATAGGCCTT GGTGATCGCT CTTCCATCAA CACTGGTAGT
GGCGACGATT TAATCAACAT CACAGCAAGA GCTGTTGGCG CATCGACCAA TGCCTGGGCC
ATGCGAGACA GCAATATAAG TGCAGGTTCA GGCGATGATT CCGTCATCCT CAACGCCTTC
ACTCAATCTA GAGTTTGGGA TCCGGCCTAC GGCGCAAGCA ACAGCTCCAT CGAACTCGGT
AGTGGCAATG ATCAGCTAAC AATCAATGCC AATGCACGTG GCAATGGCAG AGAGATCAAA
GCTTATGGCC TCGAAGACAG CCTTGTAGAA GCCGGGTGGG GCAATGACGA GATCAAGATT
AATGTTTCTG CAGAAGGTGG CAACAGCAGA ATCAACAAAC ACTCTTGGGA GCATATCTAT
TCATACGAGT CCTCAGGCGA TAGAAGTCAC AACAATCAGA GGGAATATTC ATCTACCAAT
CAATACGGCT ATAACTCCGG CGGAAAATAC AGCAACAGCT ACGACTACAA CAACTCTAGC
TCCTACGATT ACGACTCCAA CTCACAAGGA AGCTACAACA GAGAAAACGC TTATTCCTAC
AAAAATTCCT ATAGCTACGG CAACAGCTGG TATAACAACA GCCGTGACTA CGAGCGCTCA
CATCTCAACA GCTACGACTA CGACTACGAC AACAATTCAA GCAGTTCCAG CAACCACAGC
AATTCCTACT CGCGCTCTTA CAGCTCCGAC AACAGCCGCA GTAGCAGCCA TTCCAGCGAT
TACCTCAACA GCTACAGCAG TTCCTATCAA TCCTCCTACG ACAACAACCG CTTCAATCAA
TCCTCCTACG ACTACGACCA CTCACGCATT CACTCCTACG GTGAGGCCAC AGGCGCTGAT
GGGTCCACCA TCTTCACAGG TTCTGGCAAT GACTCCGTCA ACCTGCAAAT CAATGCCGGC
TCTTCTGCCA TAGGCCTTAA CGATTCAATT CTCTCCACCA GCGATGGTTC TGATGACATC
ACCATTGATG TCAACGCTTT TGGCGAAAAC AGCTTCTACA ACAGATCATC GCGCTCTTAT
CTCAATCTCG ACGATTCATC TGGCAACCGC AGCAGCGACT ACAGCAGAAT CTATTCTCGC
TCTTCCAGTT ACAGCAACGA CTACACCAAC AGCTACAGCA ACTCCTACGA TCACAGCTCT
TCCAGCTCCT ACGATTACAG CCACGCTTCA CAAGGCAGCG GTAACAGCAT CAATCAATCG
ACTTACAAAA ATTCCTATAG CCACTCAAAT CGATGGGGAT ACAACAGCAG CAGTAACTAT
GAAAACTCCT ACCTCAATAA CTACGACTAC GACTACAGCA ACAGCTACAG CAGATCCAAC
AGCAGAGATA CTTCCTTCTC TCGCTCTTAC AACTCCGACA TCAGCCGCAG CAGCAGCCAT
TCCAGCGATT ATCTCAACAA CTACAGCAGA TCCAATCAAT CCTCCTACGA CAACAACCGC
TTCAATCAAT CCTCCTACGA CTACGACCAC TCACGCATTC ACTCCTACGG AGAAGCTACT
GGTGCTGATG GCTCAACCAT CTCCACAGGC TCTGGCAATG ATTCCGTCAA CCTGCAAATC
AATGCCGGCT CTTCTGCCAT AGGCCTTAAC GATTCAATTC TCTCCACCAG CGATGGTTCT
GATGACATCA CCATTGATAT CACCGCTATA GGTGAAAACA GCTTCTATAA CAAATCATCG
AGCTCACGCT CCTCCTTCGA TGATTCCACT GGCAGCAACA GCAGCGACTA CAGCAGCAAC
TCGTCGCGCT CAAGCAGTTC AACTTACGAT TACACCAACA CCAACAGCAG CAACTACGAA
CGCTCTGACT CCAGCTCAAG CTCTTTCAAC CGCGACTCAA CAAATAGCCG CAGCGGCATC
AATCAATCCA CTTACAAAAA TTCCTATAGC TACGGCAACA ACTGGTATAA CAACAGCCGT
GACTACGAGC GCTCACATCT CAACAGCTAC GACTACGACT CCAGCAACAG CTACAGCAGA
TCCAACAGCA GAGATACTTC CTTCTCTCGC TCTTACAACT CCGACATCAG CCGCAGCAGC
AGCCATTCCA GCGATTATCT CAACAGCTAC AGCAGATCCA ATCAATCCTC CTACGACAAC
AACCGCTTCA ATCAATCCTC CTACGACTAC GACCACTCAC GTATTCACTC CTACGGAGAA
GCTACTGGTG CTGATGGTTC AGCCATCTCC ACAGGCTCTG GCAATGATTC CGTCAACCTG
CAAATCAATG CCGGCTCTTC TGCCATAGGC CTTAACGATT CAATTCTCTC CACCAGCGAT
GGTTCTGATG ACATCACCAT TGATATCACC GCTATAGGTG AAAACAGCTT CTATAACAAA
TCATCGAGCT CACGCTCCTC CTTCGATGAT TCCACTGGCA GCAACAGCAG CGACTACAGC
AGCAACTCGT CGCGCTCAAG CAGTTCAACT TACGATTACA CCAACACCAA CAGCAGCAAC
TACGAACGCT CTGACTCCAG CTCAAGCTCT TTCAACCGCG ACTCAACAAA TAGCCGCAGC
GGCATCAATC AATCCACTTA CAAAAATTCC TATAGCTACG GCAACAACTG GTATAACAAC
AGCCGTGACT ACGAGCGCTC ACATCTCAAC AGCTACGACT ACGACTCCAG CAACAGCTAC
AGCAGATCCA ACAGCAGAGA TACTTCCTTC TCTCGCTCTT ACAACTCCGA CATCAGCCGC
AGCCGCAGCC ATTCCAGCGA TTATCTCAAC AGCTACAGCA GATCCAATCA ATCCTCCTAC
GACAACAACC GCTTCAATCA ATCCTCCTAC GACTACGACC ACTCACGTAT TCACTCCTAC
GGAGAAGCAA CTGGTGCTGA TGGCTCAACC ATCTCCACAG GCTCTGGCAG TGATTCCGTC
AACCTAAAAA TCAATGCCGG ATCCTCTGCT ATAGGCCTTA AAGATTCAAT TCTCTCCACC
AGCGAGGGTG CTGATGACAT CACCATCGAT GTCAGCGCTT TTGGTGAAAA CAGCTTCTAC
AACAGATCAT CCAACTCACG CTCATACTTC GACGATTCAT CTGGCAACCA AAGCAGCGAC
TACAGCAGAA TCTATTCTCG CTCTTACAGT TACAGCAACG ACTACAACAA CAGCTACAGC
AACTCCTACG ATCACAGCTC TTCCGGCTCC TACGATTATA ACCACGACTC AACAAATAGC
CGCAGTGGTA TCAATCAATC GACTTACAAA AACTCCTATA GCTACGGCAA TAGGTGGTAC
AACTACGGAC GCGACTACGA GCGCTCATAT CTCAACAGCT ACGACTACGA CTCCAGCAAC
AGCTACAGCA GATCTGGAAA CTACGATCAC TCCTCCTCAG GATCTTATAA CTACGACTAC
AACAGGAGTT ATAATTATGG CTACGACTAC AACAACAACT ACAACGGCTC TCATCAATCT
AATTACGACA ACAACCGCTT CAATCAATCC TCCTACGGCT ACGACCACTC ACGTATTGAG
ACCATTGGCA TTGCTATTGG GGCGGAGGGT TCCAACATCT CTACAGGCTC TGGTAGTGAT
TCCGTCAACT TGCAAATCAA TGCTGGATCC TCTGCCATAG GCCTTAAGAA TTCTGTTCTC
GAAACTGGCG AGGGACAAGA TCAAGTCAAC ATAAGTGTTA ATGCCCTAGG TCATTCCTTC
TACCACGGCA AATCAACTGC CACAGGTGAT GCCAAAGCTC TGTTGAATTC CACCATCTCA
ACAGGAGGTG GAGATGACCA GATCTTTGTG AACGCCATCG GCAACCGTTC TGGAGCTGCT
CTCACCAACA GCACAATCAA AACAGGAACA GGTGATGACT CTGTCTTGTT GAGTGGCGAC
CTTGTCAACA GCAGCATCCA CGGCGGAGAT GGTGACGACC AGATCATGCA TTCAGGTGGA
GGCGACGCCG AGCTCTACGG TGATGATGGT AATGACACCA TTATCGGTGG GTCAGGAGCA
GATAAAATCA GTGGTGGCAA GGGTGATGAT GACATCTTTG CTGGCGATGG AGGTGACACA
ATTGATGCTG GAGATGGCAA GGACAACATC ACCATTACCG GCGGTAATGG TATAAGCGTT
ACGACTGGTT TCGGCTCAGA TACAATATTT TTTACAGCCA ATTACTACAG AAGTTTGTTA
AGCGATACAA GCATAGGAGG AAACATTTCC GATGCCAGAT CAATCAGTTC AGCAACAAAT
CTAGCCATTA CAGACTTCAC AGTCTATGAG CCTTTTAAAG ACATCGAGTT AGATTTGATC
CCAGAGTGCA GCGGAAAGTT AACATTGAGC AGCGGCATTA ACTTCATCGA AGATGAAGCA
TCTGATGCAG AGGCAAAAGA CATTTCAGCA AAGCTTGGCC ACCCTCCCGC ACCAACCACA
TCAGGAGTGG AAGATCAGAT TATTTTAGAG AGCGTCGCAA GTTCAAAGTC AGAGCAAATT
ATAGAAGTTT CTTTAGGAGA AACTAAAGAC AATTCAGCAA AGCTTGGCCA CCCTCCCGCA
CCAACCACAT CAGGAGTGGA AGATCAGATT ATTTTAGAGA GCGTCGCAAG TTCAAAGTCA
GAGCAAATTA TAGAAGTTTC TTTAGGAGAA ACTAAAGACA ATTCAGCAAA GCTTGAGCAC
CCTCCCGCAC CAACCACATC AGGAGTGGAA GATCAGATTA TTTTAGAGAG TGTCGCAAGT
TCAAAGCCAG AGCAAATTAT AGAAGTTTCT TTAGGAGAAA CTTCTGAGAT CACGACACAA
GAAGTTCTGG GTTTTCAAGT CAGCGAGGAT CTTGTGTCTC TACCAGAAGA GAATCCAGCT
CGTCAGATAT CAATCAATCA TGACATGATG AACTTTGATG ACCTCCTCAA TAATGTTGCA
GTCAACTACG ATGGGGGCAA CGCGATAGCA TCAGGGTACA TTGACTTTGC ACAAGATGGA
GTCAATACAT TGGTGAGTTT TGACGCAGAT GGTCTTGCTG GAGACAAAAA TACCGGTGTT
GTCATAGCAA CACTCCAGAA CGTGGATACT TTAGACTTGG GTACAGACAA TATTGATTCG
GATTATTCAC TAGAGGTAGG ATTAGATCCA ATCACAGGGG CAGCAGAAAC ACAAATCGAA
GAAGATTTAA CAACGATCAC TGGTGAAACA ACTGATGCAA TCATCAATCC AAGCAATGAC
TCACTTGCAA AATTAACAAC ATCAGACGAG ACTGATTCTA TGGCGATTTC AGAAGCCATC
AACGCCCTAA CAACATCAGA AGATGTCGCC TCTGTGGAAT CCCCTACATC AACAACATCA
GAAGCAGCAA GTTCAACCGA TATCGTCACA GCAGCGATAG CCACTGCAGT TGATCAAAAC
ACCATTTAA
 
Protein sequence
MASITNFTAN ASDRWGWWRW NRWGTTATAI EDQSLRIDSD ELRISATANS GWRTARATGL 
LNSEVEYTGK AVSINTYAQG RDAYSIGIDG SQLSIIGNDN KLININATSR VFGYDPAWGL
RNSTLSTQGG NDIISIRADA GILGGRFVNS PIVATGLESS SVNTGAGNDF LSLQSIASGR
NTATASGTTD SSIDLGSGND SLFINASATG SGGWWGGGKV AAYGALNTTI NTAGQTSEGV
PQMGHENNSF LLLGFEENPA VGNDNDSIVI NASATNWGRG RRGYAEAIGL GDRSSINTGS
GDDLINITAR AVGASTNAWA MRDSNISAGS GDDSVILNAF TQSRVWDPAY GASNSSIELG
SGNDQLTINA NARGNGREIK AYGLEDSLVE AGWGNDEIKI NVSAEGGNSR INKHSWEHIY
SYESSGDRSH NNQREYSSTN QYGYNSGGKY SNSYDYNNSS SYDYDSNSQG SYNRENAYSY
KNSYSYGNSW YNNSRDYERS HLNSYDYDYD NNSSSSSNHS NSYSRSYSSD NSRSSSHSSD
YLNSYSSSYQ SSYDNNRFNQ SSYDYDHSRI HSYGEATGAD GSTIFTGSGN DSVNLQINAG
SSAIGLNDSI LSTSDGSDDI TIDVNAFGEN SFYNRSSRSY LNLDDSSGNR SSDYSRIYSR
SSSYSNDYTN SYSNSYDHSS SSSYDYSHAS QGSGNSINQS TYKNSYSHSN RWGYNSSSNY
ENSYLNNYDY DYSNSYSRSN SRDTSFSRSY NSDISRSSSH SSDYLNNYSR SNQSSYDNNR
FNQSSYDYDH SRIHSYGEAT GADGSTISTG SGNDSVNLQI NAGSSAIGLN DSILSTSDGS
DDITIDITAI GENSFYNKSS SSRSSFDDST GSNSSDYSSN SSRSSSSTYD YTNTNSSNYE
RSDSSSSSFN RDSTNSRSGI NQSTYKNSYS YGNNWYNNSR DYERSHLNSY DYDSSNSYSR
SNSRDTSFSR SYNSDISRSS SHSSDYLNSY SRSNQSSYDN NRFNQSSYDY DHSRIHSYGE
ATGADGSAIS TGSGNDSVNL QINAGSSAIG LNDSILSTSD GSDDITIDIT AIGENSFYNK
SSSSRSSFDD STGSNSSDYS SNSSRSSSST YDYTNTNSSN YERSDSSSSS FNRDSTNSRS
GINQSTYKNS YSYGNNWYNN SRDYERSHLN SYDYDSSNSY SRSNSRDTSF SRSYNSDISR
SRSHSSDYLN SYSRSNQSSY DNNRFNQSSY DYDHSRIHSY GEATGADGST ISTGSGSDSV
NLKINAGSSA IGLKDSILST SEGADDITID VSAFGENSFY NRSSNSRSYF DDSSGNQSSD
YSRIYSRSYS YSNDYNNSYS NSYDHSSSGS YDYNHDSTNS RSGINQSTYK NSYSYGNRWY
NYGRDYERSY LNSYDYDSSN SYSRSGNYDH SSSGSYNYDY NRSYNYGYDY NNNYNGSHQS
NYDNNRFNQS SYGYDHSRIE TIGIAIGAEG SNISTGSGSD SVNLQINAGS SAIGLKNSVL
ETGEGQDQVN ISVNALGHSF YHGKSTATGD AKALLNSTIS TGGGDDQIFV NAIGNRSGAA
LTNSTIKTGT GDDSVLLSGD LVNSSIHGGD GDDQIMHSGG GDAELYGDDG NDTIIGGSGA
DKISGGKGDD DIFAGDGGDT IDAGDGKDNI TITGGNGISV TTGFGSDTIF FTANYYRSLL
SDTSIGGNIS DARSISSATN LAITDFTVYE PFKDIELDLI PECSGKLTLS SGINFIEDEA
SDAEAKDISA KLGHPPAPTT SGVEDQIILE SVASSKSEQI IEVSLGETKD NSAKLGHPPA
PTTSGVEDQI ILESVASSKS EQIIEVSLGE TKDNSAKLEH PPAPTTSGVE DQIILESVAS
SKPEQIIEVS LGETSEITTQ EVLGFQVSED LVSLPEENPA RQISINHDMM NFDDLLNNVA
VNYDGGNAIA SGYIDFAQDG VNTLVSFDAD GLAGDKNTGV VIATLQNVDT LDLGTDNIDS
DYSLEVGLDP ITGAAETQIE EDLTTITGET TDAIINPSND SLAKLTTSDE TDSMAISEAI
NALTTSEDVA SVESPTSTTS EAASSTDIVT AAIATAVDQN TI