Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_15281 |
Symbol | |
ID | 5730410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1362368 |
End bp | 1364086 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285906 |
Product | hypothetical protein |
Protein accession | YP_001551413 |
Protein GI | 159904069 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.493814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTG AAAAGAACAA AGCATTCACT TTGATTGAGA TGGCAGTTGT AATGGCTGTT ATCTCTATGC TCAGTGCCTT GCTAGGACCT CAGTTCCTGT CACTTGTGCG CCGTGGTAAT TCAGTTGGCG CAGGACAGGC TATGCGTCAG ATCAGAGATG AATGTCAAAG CGATTATATT TTTGGATCGC CTGGTACTTT TACACCTAGA AGAATCCCTC GATATTCATT GATCGGTAAT TGTGAATCTG CGAGTGCAAT CCCTAGAGAT GAAGAAAATA ACCCTCAATA TAGTTATGAC TCAGAAACTG GACTAATAAC TTGTAGCTAT AAGAATGCAG AGTCAACGGG GTTCCCAAGT TGCAAGAAAG TAGCTGTTGC AAAAGAAAAA CCTTCTTTAG GGGAGGTTGC TGTTATTACT GATGTCGCAT TAAATGAAGT GGAAGAGCCT GAAACCAATG GCTTAGATGT CGACCTTCTG AAGCTTTCTG ATGGAAATGG AGATAACACT ATCATTAGCT GTGATGGGAA GTATGTGACC TCTAGAGGAA GAGGGGGTTA CACAGAGAGA AATAATCAAT ATCGACAGAA TATAATTACT GGTGAAAAGG TATTAATCAC TTCAACAAAA GAAGGGATTC CATCAACAGG CGACAGTCAG CATATAAGAG GGATGTCTTG TGATGGTAGG TATGCTCTGT TTACTATTGA CGAGTATGGC TCAGTTGATC TTGACGGGTT GCCAGGAGCA ATAACAGGAG AAGAGTGTGA TCCAAGAGGA CAATGTGAAG GTCCTCAACC GATTTATCGA AAAGACCTTC TTACAGGAGA CGTAGTTCGG GTAGATACAC TTAGTAATGG AACAAAAATC CAGACGAAAT GGGGAATTAG GGATGCTTCA ATGTCTTCGG ATGGTAGATA TGTTATTTTT GAAAGTCCTG ATGTTCGATT TGCAGGATTA CCTGAAGCTG AATGGCAATC TAAAGACTGG TCGAGAACAT TGATGTATAG AAAAGACTTA TTAACAGGAG AACTTTCTAC TGTTACAACA GCACCAGATG GTTCTACTGG AACTGGGTGG GGATCCAATG GTGGATATGG GATGGGGATG AGTGATGATG GGTCAAAAGT AACTTTTATC TATAAAGGAG ATGATCTCGT AGAAGGTGTA AGTGGTACAA ACCTTTATGT CAAAGACTTT AATACTTCCA AAGTTTCTTT GGTTACAGCA GATAGTCAGG GGAATAGATT GTCTGACTTT GGTCAATATG GTTCTGGGTC ACAAATTTCG GCAGATGGAT CAAAAGTTAT CTTTACAAGT AGAGGTTCAA ATGGCGTCGC CCAACTTTAT ATGAAAGATC TGAGCTCTGG GAAGTTAAAG GTAATTAGTC AGAATTCAAA TGGTAATTCT GGCAATGGCT ACTCACACAC AGGTAAGTTC TCTGGTGATG GTAAATATGT TGTTTTCCAG TCATCAGCAA CGAATCTTAC GGATACTCCA ACTACGGGTA AAAGTGACAT TTTTGTCTAT GATACTTCTT CTAATCAAGT AAAGAGGTTA TTTGATGAGT CTTATGAATT TGATGATCAC CTTCGAGATC CAAATATAAC TAAAGAAGGG AAATATATTA CTTTTAGGTC TAGCAGTAAA GGTATTACAA CTGGTGAAAC CCAAGGTCAA GAAGTTTATA TGAAAAAAAA CCCCTACTTT GTTGAATAG
|
Protein sequence | MDIEKNKAFT LIEMAVVMAV ISMLSALLGP QFLSLVRRGN SVGAGQAMRQ IRDECQSDYI FGSPGTFTPR RIPRYSLIGN CESASAIPRD EENNPQYSYD SETGLITCSY KNAESTGFPS CKKVAVAKEK PSLGEVAVIT DVALNEVEEP ETNGLDVDLL KLSDGNGDNT IISCDGKYVT SRGRGGYTER NNQYRQNIIT GEKVLITSTK EGIPSTGDSQ HIRGMSCDGR YALFTIDEYG SVDLDGLPGA ITGEECDPRG QCEGPQPIYR KDLLTGDVVR VDTLSNGTKI QTKWGIRDAS MSSDGRYVIF ESPDVRFAGL PEAEWQSKDW SRTLMYRKDL LTGELSTVTT APDGSTGTGW GSNGGYGMGM SDDGSKVTFI YKGDDLVEGV SGTNLYVKDF NTSKVSLVTA DSQGNRLSDF GQYGSGSQIS ADGSKVIFTS RGSNGVAQLY MKDLSSGKLK VISQNSNGNS GNGYSHTGKF SGDGKYVVFQ SSATNLTDTP TTGKSDIFVY DTSSNQVKRL FDESYEFDDH LRDPNITKEG KYITFRSSSK GITTGETQGQ EVYMKKNPYF VE
|
| |