Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01291 |
Symbol | |
ID | 4776298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 138714 |
End bp | 141674 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640085628 |
Product | hypothetical protein |
Protein accession | YP_001016149 |
Protein GI | 124021842 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | [TIGR01846] type I secretion system ABC transporter, HlyB family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.838373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTC CCGCCATGCG CTCAGAAACG ATTAAAATAT TGAGAGAGTT GCCAACATTT GCGGCGGCTA GACTCAAAAG CATTGAATTA TTAGCAGACG CAGCAGAAAA AGTCACCCTG AACCAAGGTC AAACTCTATT GCGTGCAGGT GAAGTTGAAA GCCATTGTTT CCTGCTGCTT GATGGTTCAC TCCGTTTGCT TGCGCAGACT CCCTTCTACA ACGATCTCTT CACCGTTGGC AAGCTTGAAA AAGGAGAACT CATAGGTTTT ATTGATCTCC TGCGACAAGG ATCCTGCGAA GCAGCAATTG GCCGGAGGCC TTGCTCATTA CTCAGCTTCC CTGGGTCTCT GATTCTGAAA CTGCTGCAAG ACGATTCAGG GTTGCGAAAC GGACTGGAAA AACTGCAGAG TCCGTGTGAA GGCGCTTGTG TCCTTCAAAC AGTAATCAAG CAATTAAATC CCCCACCATT GGATGGTCAA GCCTGGATAA TGGATCAACT CAAGGCATCT AACACAACGC TGAAGGGTCA TAACCTGCTA AGTACAATAA TAGTAGGTTA TGAAGAATGC GTTGGGCAAC AAATCAGTGC AGAAAAGCAT GAGATACTTG TCAACAAAAG CATCTTGCCG CTGCGGTTCT GGCATTGGAC GCCCGCCAAA CCTGAGGGGA AGTTGACGAA TATTCAAGAA CAGCAAAATA GTGGTGAAGT AGAAGCAGAA AACTCAATTT CTACTCGTAA ATGGGAAGCA AACAAAAGCT TAGATCTCAC TGAGTTAGGC CTAGGTCTAC GTGAAGCACA CTCCGATTCT GATCTACAAG GATTCAAACT TTTGCGCGGC CAAGGTCAGG TTGGCGCGAA CTTGGCAACC CTGAGGATGG TGGCTCGTGC TTACGACACG CCCTGTCCAG TCGATGTAAT AGAGAGGGTG CTCGAAGGTG CTGTGGACCG CGCAGGGTCA ATACCGATCC AGGTAATGGG TCAGCTCGCT GAAAGCATGG GTCTTCAAAC CCAAGTTGGC TCAATAAATT TTGCACAATT ACATAAACTA GAATTACCTG TACTTGTCAA AAATAAAAGA CATTATGCTC TGCTAACCGA AGTACGAGAG AAAACGTTAC TGCTTGCAGA TCCAGTGAAG GGATTGATTA AGTTGCCCTT TGAAGAAGGA AAGGAGCAAT GGGGCGATCA AGTAGAGGTA GTGCTTCTTA AACGTCTAAA TGACACTCCT TTTCGTCAGT TTGGATGGAA CTGGTTCACA CCTGTTGTGA GACGTTTCCG CTGGCCTCTC ATTCAGGTCG TGCTGGCTTC GCTGTTTATT CAACTGTTCC AGCTAGCCAA TCCGCTACTG CTGCAGCAAA TCATTGACAA AGTTATCAAC CAAAGCAACC TATCTGCCTT ACAAGTCCTA GGAGCAGCGA TGGTTGCGTC AGCACTCTTC CAAGGTTTGC TTACAGCTGT ACGGACCTGG CTGCTCATTG ATACGACAGA CCGCATGGAC CTTGTCTTAG GAACTCAAGT CATCGACAAA CTGCTAAGAC TGCCATTGCG GTTTTTCGAA AAAAGGCCAG TAGGCGAGCT CTCACAAAGG TTAGGCGAAC TAGGCAACCT TAGAGGCTTC CTAACAGGTA CAGCTATCAC CAGTTTACTA GATCTTCTAT TTGCAACGAT CTATATCCTG ATCATGCTGA TATATAGCCC ACTACTTACG GCTGTGGCAC TTGGCACGAT TCCGATATAC ATCATGATGG TCTTATTTAT TGTACCCATT TATCGTAGAT TGATACGACG TCAGGCTCAA CATGCAGCAG CTACTCAGAG CCATCTTATT GAAACACTGA GTGGAATTCA AACGGTTAAA GCTCAACATT TTGAACTCAA TTCACGCTGG CGCTGGCAAG AACGCTATTC AAGTCAAATC GCCGAACAAT TTAAAAGTGT GGTCCTAGGA AGCAGCGCCA GTGAAATAGG CAACTTTTTA AACCAACTTA GTTCACTACT AATTATTTGG GTAGGCGTTT ACCAAGTGAT TAACGGTCAA TTAAGTCTTG GTCAGCTGAT TGCTTTTAGA ATTATTGCTG GTTATGTAAC TGGCCCAATC CTCCGCCTTT CAAGTCTTTG GCAGGGTTTC CAGCAAGTGG CTATCTCAAT GGAGCGCCTA GCAGATATTG TTGACCAAGT ACCGGAAACA GGTGAAGAGG ACGCAGGACA GATCGCCTTG CCTCCAATTA AAGGAAAAGT TAAGTTCGAT TCCTTAGACT TTAGATTCGG CAAAAGTGGA CCGAACCAAA TCAACGGATT GGATCTTGAG GTTTCAGCAG GTAGTTTCGT AGGGATTGTT GGTCAAAGCG GTAGCGGAAA GAGCACACTC ATGAAACTTT TACCCAGGCT TTATGAACCA GATGTGGGTA GAATCCTGAT CGATGGATAT GACATATCTA AAGTTAGTTT AAATAGTGTA AGGCAACAGA TCGGAATCGT TCCTCAAGAA TGCTTACTCT TTGAAGGAAC AGTGAGAGAT AACATCACCA TGAATCACCC TGAAGCAGAT ACTGAATCTG TGATCAGGGT TTCTCGTGCA TCAGCTGCAC ACGAGTTCAT TATGGAATTA TCTGATGGCT ATAACACTCG CATAGGTGAA CGGGGCGCTG GGCTAAGTGG TGGTCAGAAA CAAAGGATAG CTATAGCTAG AACTTTATTA CAGAATCCAA ATATGTTAGT GTTAGATGAA GCAACAAGCG CATTGGACTA TGATACGGAA GCGATAGTTT GTAATTACCT ACAAAAGGCA TTGAAAGAGA AGACCGTCTT CTTCATCACT CACAGACTAA GTACAGTTAG AAATGCCGAT TGGATTGTGT TGATGCATCA GGGCACTATC TCAGAACAAG GAACGCATCA CGATCTAATG TCGATGGGTG GACGCTATGC CACACTATAT TCACATCAAG GAGATTCATA A
|
Protein sequence | MNAPAMRSET IKILRELPTF AAARLKSIEL LADAAEKVTL NQGQTLLRAG EVESHCFLLL DGSLRLLAQT PFYNDLFTVG KLEKGELIGF IDLLRQGSCE AAIGRRPCSL LSFPGSLILK LLQDDSGLRN GLEKLQSPCE GACVLQTVIK QLNPPPLDGQ AWIMDQLKAS NTTLKGHNLL STIIVGYEEC VGQQISAEKH EILVNKSILP LRFWHWTPAK PEGKLTNIQE QQNSGEVEAE NSISTRKWEA NKSLDLTELG LGLREAHSDS DLQGFKLLRG QGQVGANLAT LRMVARAYDT PCPVDVIERV LEGAVDRAGS IPIQVMGQLA ESMGLQTQVG SINFAQLHKL ELPVLVKNKR HYALLTEVRE KTLLLADPVK GLIKLPFEEG KEQWGDQVEV VLLKRLNDTP FRQFGWNWFT PVVRRFRWPL IQVVLASLFI QLFQLANPLL LQQIIDKVIN QSNLSALQVL GAAMVASALF QGLLTAVRTW LLIDTTDRMD LVLGTQVIDK LLRLPLRFFE KRPVGELSQR LGELGNLRGF LTGTAITSLL DLLFATIYIL IMLIYSPLLT AVALGTIPIY IMMVLFIVPI YRRLIRRQAQ HAAATQSHLI ETLSGIQTVK AQHFELNSRW RWQERYSSQI AEQFKSVVLG SSASEIGNFL NQLSSLLIIW VGVYQVINGQ LSLGQLIAFR IIAGYVTGPI LRLSSLWQGF QQVAISMERL ADIVDQVPET GEEDAGQIAL PPIKGKVKFD SLDFRFGKSG PNQINGLDLE VSAGSFVGIV GQSGSGKSTL MKLLPRLYEP DVGRILIDGY DISKVSLNSV RQQIGIVPQE CLLFEGTVRD NITMNHPEAD TESVIRVSRA SAAHEFIMEL SDGYNTRIGE RGAGLSGGQK QRIAIARTLL QNPNMLVLDE ATSALDYDTE AIVCNYLQKA LKEKTVFFIT HRLSTVRNAD WIVLMHQGTI SEQGTHHDLM SMGGRYATLY SHQGDS
|
| |