Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14311 |
Symbol | |
ID | 4718152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1203163 |
End bp | 1206069 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 25% |
IMG OID | 640079152 |
Product | hypothetical protein |
Protein accession | YP_001009821 |
Protein GI | 123968963 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | [TIGR01846] type I secretion system ABC transporter, HlyB family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATGG ATAAAGATAG TTACTTTAAA AATTTAGATC TTGATAAGGA GATAATTAAA ATAATTGAAA AAGATATTTT TTTAGAGAAT TATTCAGTAG GTGAAGAAAT ATTCAATCCT GAAATCACAA TTAATAAAGT TTCAATTATT TTATCTGGAA GTATTAGACA AATAAAAAGA GATTCTACAA ATAATACCAA TATTTATAAA TATGTTAAAA ATGATTTTTT ATTTATTCCT GAATTGATTT ATAAACTTAA AAATTCTTTT TACTATATAG CAGCAAATGA CTTGCAGTTA ATATCTATTG AAAAGGAAAA ATTTTTAAAT TTATTAAAGG AAAATAATGA ATTTCGTAAA TGGATAAATA ATCAAATCTT CAAAAATGAG AAAATTTCAA TTTTAAATAA ATTATTAAAG GAGGAATTTA ATAACAATTT TGATAAAGAA CAACTTCTTA ATAATCTTTC AGAAAATATA GACCTTGTTA ATGAGAAAAT TTTGAAAAAT ATTCAGGATA AAAAGATTGA TTCAAAAAAT TTTGAAATTA TTTCAATTTC TAAATCAATT CACTTTGATT ACCTTGAAAA AATAAATTTC GAGAAGATAT TAGGTTTAGA TTTTTCAGAA CTAGAAAGAT TAGTAATTAT TAATAATAAA TTCAAGAAAT TTAAAATTCA TAAAAATAAA ATTCCCAAAG TAGATAAAAT TTCTCAAATG GTTGAAGAAT CTATAGATGA ATCAAAAAAA GAATTACATT ATGCAGATAT AAATATTAAA AAAAATGTTT GCAATAGGAA AGATAATGTT ATTGAATGTT TTAGGATTTT AAGTAAGTTA ATTGATATTA ATTATCGAGT TGATCCAATA AGCAATTATT TAGATTTTCT TGATAAGAAT AAAAAGAAAT ATACTTTTAG GAATTACGCA GAAATTGCTT ATGGATTAGG TTTAGAGGTA TCTTGTGGAG AACTTAGCAT ATCGCAAGTA CTTAAGGTTA AAACGCCTTC ATTAATAATT TATAAAAATG ATTTAGCTTT AGTAGTTAAT GCAGACAGAG AAAAACTGAC TCTAATTTAC CCTGCAGATG GATTGATTAC TTTGTATAAA AATGATTTAG AGAAAATATA TGAGGGAAAT ATTAACATCA TAAATATTTC AAAAAATCGT TTAACTCAAG AAAACAAATT CTCAATAAGT TGGTTTATTC CAATTTTAAA AGAATATAAA AATACTTTAT TCCAAATTTT AATTTCAGGA TTAGTTGTAC AAATATTTAT ATTGTCGAAT CCATTATTAA TTCAAGTAAT AATTGATAAG GTTATATCGC AAAGAAGTCT AGATACACTA CAAGTATTAG GATTCGCACT ATTAGTAATT ACAGTTATTG AAGCAGTATT ATCAAGTATA AAATCTTTTA TCCTTTCAGA AACTACTAAT AGAATAGATC AAAAATTAGG AATTAAAATA ATTGATCATT TATTTAGATT ACCTCTTGAA TATTATGACA AAAGATCTAT AGGAGAATTA TCTAATAGGG TAGGTGAACT TGAGAAAATT AGGAACTTTT TGACTAGTCA AGGTATTAAT ACTTTTTTAG ATGCGTCATT TTCTTTATTT TATATTTTCG TACTATTTTT ATATAGCGGT AAGCTTACAT TAATAGCTTT AAGTGTTATC CCAATTCAGA TTTTAATTAC ATACTATGGA TCGCCACTTT TTAAAAAACA ATATCGGAAA GCAGCTATTA ATAATGCAAA TACTCAAAGT TATTTAGTAG AAGTGCTATC TGGTATCCAA ACGGTAAAAA CACAAAATGC AGAAACCTCA AGTCGCTGGA GATGGCAGAA TTACTATTCA AAATTTATTA AGAGTACATA CCAAAAAACG ATTACAGCTG TTTCATTAAA TCAACTTACT CAATCTCTGC AAAAAATTTC TCAATTAATA GTTTTATGGT ATGGAGCAAT AATGGTTTTA AACGGTGAAT TTACTCTTGG TCAACTAATT GCATTTAGAA TCATTTCTGG ATATGTAACA CAACCAATTT TAAGGTTGAG CACTATATGG CAACAGTACC AGGAAATAAA AATTAGTTTT GAAAGATTGG GAGATATTGT TAATACTCCA AAAGAAAATG AATCAAAAGA TTTAGGAAAA ATTCAACTGC CAAGTGTTGA GGGGAATATT TTATTTGATA ATGTATCATT TAAATTTATT GGCGACTCCA AAACAACTCT GAATAAAATC AACTGTCAAA TTGATAAAAA TTCTTTTGTT GGAATTGTTG GTAAAAGTGG AAGTGGTAAA AGTACATTTT GTAAATTAAT TTCTAGGCTT TATGTACCTA ATGAGGGGTC TATTTTAATT GATAAATACG ATATCCAAAA GGTAGAAATA AGTTCAATTA GAAGGCAATT AGGGATAGTT AGTCAAGACC CTTTACTTTT CGCTGGAACA ATAAGAGATA ATATATGTTT TGGTGATGAA AGTTTTTCTG ATAAGGAGAT TGTAGAAGCA TCAAAAATAT GTTGCGCCCA TGAATTTATT ATGGAACTTC CATTGGGATA CAATACAAAA ATTTCTGAGA AAGGAAGTTC ATTAAGTGGG GGACAACGTC AGAGAATTGC ATTAGTAAGA GCATTATTAA AAAAACCAAA AATAATTATC TTAGATGAAG CAACAAGTGC TTTAGATATA GAAACTGAAC AACTATTTGT TAAAAATCTA TTAAATAAAT TTAAAAATTC AACAATAATA ATTATTACGC ATAGATTATC TAACGTTATA AATGCAGATA AAATTCTTGT TTTTGAAAAA GGTGACTTAT CTGAACAAGG AGATCATGAA TCACTTCTTA AAAACAAATC AGTATATTAT TCACTCTTAA ATAATGAGGA AAAATAA
|
Protein sequence | MNMDKDSYFK NLDLDKEIIK IIEKDIFLEN YSVGEEIFNP EITINKVSII LSGSIRQIKR DSTNNTNIYK YVKNDFLFIP ELIYKLKNSF YYIAANDLQL ISIEKEKFLN LLKENNEFRK WINNQIFKNE KISILNKLLK EEFNNNFDKE QLLNNLSENI DLVNEKILKN IQDKKIDSKN FEIISISKSI HFDYLEKINF EKILGLDFSE LERLVIINNK FKKFKIHKNK IPKVDKISQM VEESIDESKK ELHYADINIK KNVCNRKDNV IECFRILSKL IDINYRVDPI SNYLDFLDKN KKKYTFRNYA EIAYGLGLEV SCGELSISQV LKVKTPSLII YKNDLALVVN ADREKLTLIY PADGLITLYK NDLEKIYEGN INIINISKNR LTQENKFSIS WFIPILKEYK NTLFQILISG LVVQIFILSN PLLIQVIIDK VISQRSLDTL QVLGFALLVI TVIEAVLSSI KSFILSETTN RIDQKLGIKI IDHLFRLPLE YYDKRSIGEL SNRVGELEKI RNFLTSQGIN TFLDASFSLF YIFVLFLYSG KLTLIALSVI PIQILITYYG SPLFKKQYRK AAINNANTQS YLVEVLSGIQ TVKTQNAETS SRWRWQNYYS KFIKSTYQKT ITAVSLNQLT QSLQKISQLI VLWYGAIMVL NGEFTLGQLI AFRIISGYVT QPILRLSTIW QQYQEIKISF ERLGDIVNTP KENESKDLGK IQLPSVEGNI LFDNVSFKFI GDSKTTLNKI NCQIDKNSFV GIVGKSGSGK STFCKLISRL YVPNEGSILI DKYDIQKVEI SSIRRQLGIV SQDPLLFAGT IRDNICFGDE SFSDKEIVEA SKICCAHEFI MELPLGYNTK ISEKGSSLSG GQRQRIALVR ALLKKPKIII LDEATSALDI ETEQLFVKNL LNKFKNSTII IITHRLSNVI NADKILVFEK GDLSEQGDHE SLLKNKSVYY SLLNNEEK
|
| |