Gene P9303_01291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01291 
Symbol 
ID4776298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp138714 
End bp141674 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content45% 
IMG OID640085628 
Producthypothetical protein 
Protein accessionYP_001016149 
Protein GI124021842 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01846] type I secretion system ABC transporter, HlyB family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.838373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTC CCGCCATGCG CTCAGAAACG ATTAAAATAT TGAGAGAGTT GCCAACATTT 
GCGGCGGCTA GACTCAAAAG CATTGAATTA TTAGCAGACG CAGCAGAAAA AGTCACCCTG
AACCAAGGTC AAACTCTATT GCGTGCAGGT GAAGTTGAAA GCCATTGTTT CCTGCTGCTT
GATGGTTCAC TCCGTTTGCT TGCGCAGACT CCCTTCTACA ACGATCTCTT CACCGTTGGC
AAGCTTGAAA AAGGAGAACT CATAGGTTTT ATTGATCTCC TGCGACAAGG ATCCTGCGAA
GCAGCAATTG GCCGGAGGCC TTGCTCATTA CTCAGCTTCC CTGGGTCTCT GATTCTGAAA
CTGCTGCAAG ACGATTCAGG GTTGCGAAAC GGACTGGAAA AACTGCAGAG TCCGTGTGAA
GGCGCTTGTG TCCTTCAAAC AGTAATCAAG CAATTAAATC CCCCACCATT GGATGGTCAA
GCCTGGATAA TGGATCAACT CAAGGCATCT AACACAACGC TGAAGGGTCA TAACCTGCTA
AGTACAATAA TAGTAGGTTA TGAAGAATGC GTTGGGCAAC AAATCAGTGC AGAAAAGCAT
GAGATACTTG TCAACAAAAG CATCTTGCCG CTGCGGTTCT GGCATTGGAC GCCCGCCAAA
CCTGAGGGGA AGTTGACGAA TATTCAAGAA CAGCAAAATA GTGGTGAAGT AGAAGCAGAA
AACTCAATTT CTACTCGTAA ATGGGAAGCA AACAAAAGCT TAGATCTCAC TGAGTTAGGC
CTAGGTCTAC GTGAAGCACA CTCCGATTCT GATCTACAAG GATTCAAACT TTTGCGCGGC
CAAGGTCAGG TTGGCGCGAA CTTGGCAACC CTGAGGATGG TGGCTCGTGC TTACGACACG
CCCTGTCCAG TCGATGTAAT AGAGAGGGTG CTCGAAGGTG CTGTGGACCG CGCAGGGTCA
ATACCGATCC AGGTAATGGG TCAGCTCGCT GAAAGCATGG GTCTTCAAAC CCAAGTTGGC
TCAATAAATT TTGCACAATT ACATAAACTA GAATTACCTG TACTTGTCAA AAATAAAAGA
CATTATGCTC TGCTAACCGA AGTACGAGAG AAAACGTTAC TGCTTGCAGA TCCAGTGAAG
GGATTGATTA AGTTGCCCTT TGAAGAAGGA AAGGAGCAAT GGGGCGATCA AGTAGAGGTA
GTGCTTCTTA AACGTCTAAA TGACACTCCT TTTCGTCAGT TTGGATGGAA CTGGTTCACA
CCTGTTGTGA GACGTTTCCG CTGGCCTCTC ATTCAGGTCG TGCTGGCTTC GCTGTTTATT
CAACTGTTCC AGCTAGCCAA TCCGCTACTG CTGCAGCAAA TCATTGACAA AGTTATCAAC
CAAAGCAACC TATCTGCCTT ACAAGTCCTA GGAGCAGCGA TGGTTGCGTC AGCACTCTTC
CAAGGTTTGC TTACAGCTGT ACGGACCTGG CTGCTCATTG ATACGACAGA CCGCATGGAC
CTTGTCTTAG GAACTCAAGT CATCGACAAA CTGCTAAGAC TGCCATTGCG GTTTTTCGAA
AAAAGGCCAG TAGGCGAGCT CTCACAAAGG TTAGGCGAAC TAGGCAACCT TAGAGGCTTC
CTAACAGGTA CAGCTATCAC CAGTTTACTA GATCTTCTAT TTGCAACGAT CTATATCCTG
ATCATGCTGA TATATAGCCC ACTACTTACG GCTGTGGCAC TTGGCACGAT TCCGATATAC
ATCATGATGG TCTTATTTAT TGTACCCATT TATCGTAGAT TGATACGACG TCAGGCTCAA
CATGCAGCAG CTACTCAGAG CCATCTTATT GAAACACTGA GTGGAATTCA AACGGTTAAA
GCTCAACATT TTGAACTCAA TTCACGCTGG CGCTGGCAAG AACGCTATTC AAGTCAAATC
GCCGAACAAT TTAAAAGTGT GGTCCTAGGA AGCAGCGCCA GTGAAATAGG CAACTTTTTA
AACCAACTTA GTTCACTACT AATTATTTGG GTAGGCGTTT ACCAAGTGAT TAACGGTCAA
TTAAGTCTTG GTCAGCTGAT TGCTTTTAGA ATTATTGCTG GTTATGTAAC TGGCCCAATC
CTCCGCCTTT CAAGTCTTTG GCAGGGTTTC CAGCAAGTGG CTATCTCAAT GGAGCGCCTA
GCAGATATTG TTGACCAAGT ACCGGAAACA GGTGAAGAGG ACGCAGGACA GATCGCCTTG
CCTCCAATTA AAGGAAAAGT TAAGTTCGAT TCCTTAGACT TTAGATTCGG CAAAAGTGGA
CCGAACCAAA TCAACGGATT GGATCTTGAG GTTTCAGCAG GTAGTTTCGT AGGGATTGTT
GGTCAAAGCG GTAGCGGAAA GAGCACACTC ATGAAACTTT TACCCAGGCT TTATGAACCA
GATGTGGGTA GAATCCTGAT CGATGGATAT GACATATCTA AAGTTAGTTT AAATAGTGTA
AGGCAACAGA TCGGAATCGT TCCTCAAGAA TGCTTACTCT TTGAAGGAAC AGTGAGAGAT
AACATCACCA TGAATCACCC TGAAGCAGAT ACTGAATCTG TGATCAGGGT TTCTCGTGCA
TCAGCTGCAC ACGAGTTCAT TATGGAATTA TCTGATGGCT ATAACACTCG CATAGGTGAA
CGGGGCGCTG GGCTAAGTGG TGGTCAGAAA CAAAGGATAG CTATAGCTAG AACTTTATTA
CAGAATCCAA ATATGTTAGT GTTAGATGAA GCAACAAGCG CATTGGACTA TGATACGGAA
GCGATAGTTT GTAATTACCT ACAAAAGGCA TTGAAAGAGA AGACCGTCTT CTTCATCACT
CACAGACTAA GTACAGTTAG AAATGCCGAT TGGATTGTGT TGATGCATCA GGGCACTATC
TCAGAACAAG GAACGCATCA CGATCTAATG TCGATGGGTG GACGCTATGC CACACTATAT
TCACATCAAG GAGATTCATA A
 
Protein sequence
MNAPAMRSET IKILRELPTF AAARLKSIEL LADAAEKVTL NQGQTLLRAG EVESHCFLLL 
DGSLRLLAQT PFYNDLFTVG KLEKGELIGF IDLLRQGSCE AAIGRRPCSL LSFPGSLILK
LLQDDSGLRN GLEKLQSPCE GACVLQTVIK QLNPPPLDGQ AWIMDQLKAS NTTLKGHNLL
STIIVGYEEC VGQQISAEKH EILVNKSILP LRFWHWTPAK PEGKLTNIQE QQNSGEVEAE
NSISTRKWEA NKSLDLTELG LGLREAHSDS DLQGFKLLRG QGQVGANLAT LRMVARAYDT
PCPVDVIERV LEGAVDRAGS IPIQVMGQLA ESMGLQTQVG SINFAQLHKL ELPVLVKNKR
HYALLTEVRE KTLLLADPVK GLIKLPFEEG KEQWGDQVEV VLLKRLNDTP FRQFGWNWFT
PVVRRFRWPL IQVVLASLFI QLFQLANPLL LQQIIDKVIN QSNLSALQVL GAAMVASALF
QGLLTAVRTW LLIDTTDRMD LVLGTQVIDK LLRLPLRFFE KRPVGELSQR LGELGNLRGF
LTGTAITSLL DLLFATIYIL IMLIYSPLLT AVALGTIPIY IMMVLFIVPI YRRLIRRQAQ
HAAATQSHLI ETLSGIQTVK AQHFELNSRW RWQERYSSQI AEQFKSVVLG SSASEIGNFL
NQLSSLLIIW VGVYQVINGQ LSLGQLIAFR IIAGYVTGPI LRLSSLWQGF QQVAISMERL
ADIVDQVPET GEEDAGQIAL PPIKGKVKFD SLDFRFGKSG PNQINGLDLE VSAGSFVGIV
GQSGSGKSTL MKLLPRLYEP DVGRILIDGY DISKVSLNSV RQQIGIVPQE CLLFEGTVRD
NITMNHPEAD TESVIRVSRA SAAHEFIMEL SDGYNTRIGE RGAGLSGGQK QRIAIARTLL
QNPNMLVLDE ATSALDYDTE AIVCNYLQKA LKEKTVFFIT HRLSTVRNAD WIVLMHQGTI
SEQGTHHDLM SMGGRYATLY SHQGDS