Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1227 |
Symbol | |
ID | 3606620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 1719155 |
End bp | 1722268 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637688102 |
Product | hypothetical protein |
Protein accession | YP_292420 |
Protein GI | 72383065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAACG CGACTAGTAC CCAACTTCAA GAGCTATACG TTGCTTACTT CGGACGCGCT GCGGATCCCA CTGGTCTTGA TTACTGGACG GAAAGAGGGA TCACTACCGC GGCATTTGCA GCAAATATGT ATGCTCAGCC CGAATTCACA AGCGAGTACG GAACTCTCTC CATCGAGTCA CAAGTAAACC AGATTTATAA AAATCTATTC GATAGAGATG CCGATGTAGC TGGCTTGACT TATTGGTCTC AGCAAATTCG TTTAGGTAAT TTGCAATTAG CTGAGATTGC AAATGATCTT ATTTGGGCTG CTCAAAATAA TTCTGGAAGC GCAGATGATA AAGCAGCCTT AAGTAATAGA ACTGAAGCAG CAGTAGCTTA TACAGCCAAA ATTAAAGAGA CTACAGCTGG AATACTTTCT TATCAGCCTC TAAATGATGG TCTAGCAGCT GATTCTACTT TTGCTGCTGG TAACAACATC ATCGCTGCAA GAAATTATCT ATCAACAATA GATAAAGATA CAGCTTCAAC TGCTGCAGGT ATAGCTGCAA GTGTCGCAAC CATTACTTCA AATGGGGTGC CAACAACTGC TACTGCATCT AAGACATTAA CTCTCACAAC TAATCAGGAT TCAGTCACTG GTGGTGCTGG TAACGACAAG ATTAATGGAG TTATTGTTGG CGGTGCTACT GGTACGACCA TCCAAGCTGG TGACGTAATT GATGGTGGAT CAGGTGTAGA TACTTTAAAC ATCGCTGTTT CTGGTGATGC AGCTACAACT CTTAGCGGTG TTCAAGCAAC TGGCATTGAA AAGGTTTTAC TAACTAACTT TGATGGACCT ACTGGTGTTA CAACCGTAGA TACAACTCTG CTCGGTGCAC CTTCAACAGT TGGTTTGAGT TCTTCTGGAT CTGATGGAGA CACTGTTTTC CAAGGAATGA CTGTGTTGAC AAATGCAGAG TTGAGAAATG GTGCGGCTGA CTTAACCCTT ACTCATACTT CAACTGCTGT ATCTGGCTCA AGTGACGCAA TTACTCTTGA TGTGAGTAAT CTTTCTGGCG GTACCTTCAC AGCAGCTTCT ATCGAAACTT TGACAATCAA CTCAACTCTG ACTCAAAGTC AGTTGACTGA TGTCGTAATC GATAACGCTA CAGCATTAAA CGTAACTGGC TCTGCGAATC TTCAAATCGT TGGTGATGTA GATTTTGCAG ACAATGCCAC AACAACAGCT ATTGATGGAA CTGTAGATGC ATCTGCATTC ACCGGTAACC TTTCTATACA GCTCAATACA GGAGATATAG CTTCAGTAAC TGGTGGATCA GGTGATGATT TCGTTGAGTT TTCTACTGGC TTTACAAGTG CTGATGTTGT TGATGGAGGC GCCGGAACAG ATACTTTAAG CTTGGATTTG GGTAATGCCA CCCTTACAAG TACAACATTC GCAAATGTAA GTAACGTTGA GGTTTTAGAA GTCAATCCAA CTAACGATTC GGCTGTAGTT AATGCAGATG GAGTTGGTTC ATTTACAACT CTAAAAGCAG CTGGGCACAC AAAAACAGTT CATGTGACAG GTTCTTTGAA CGCAGCTGGC GATGATTATT CATACTCTCT AAACGGTGTA TCAACATCAA TTGCTTCCGC TACGGGCACC AATACTGACG CTGAAGTTGC TGCTGAACTA GCTGCATCTA TCAACGCCTT GACAGGATTT ACTGCCACAG CTGTCGACGT AGACGGAACA GGTGATGGTG TTTTTATTAC CAGAACATCT GACACCGGAG ACACAATCAA TTTCGACAGT ATTACTGGTA CCGGTAATAT TGCTGTTGCA GAAGTTGGAT ACTCAAACGT TTCATTCACA AATTTAACTG ATCAAACTGT AGAAATTAGT TCAGGGGCTC AAGTAACTGC TTCTTTGAAA GATCCATCAG GAACTGCAGA TTCGCTAACC GTAAACCTAA CTTCACCATC TGGTGACAAG GCTTTAACAC AAACAATTGA GCAAATTTCA GCTGGTGACA TTGAGACACT TACTATTAAT ACCTCAGGCT TATCAGCTAA CACAGTTGAT TATGTAGTAT CTACTCTTAC TGACGGTGGT ACAAACGCCC TGACTACTTT GAAGATCACT GGTGACTCTT CTCTTGATAT TGATGGCACA ATAACTGCTT CAAAACTTGT CACTGTTGAT GCTTCTACTT TTACTGGAGA CTTGCAACTC GATGGAGTAG CTGCTAATCA AACGATTACT ACTGGTTCAG GTGCTGATTC ATTAGTCTTC GGTTCTAATC TAAACAATGC AGACACCGTT GATGGAGGAG CAGGCACAGA TACACTTTCT GCAACTGTAA CGGGTTTAAC TGCTACGACC GGTGCTCTAA ATGTCTCTAA TGTAGAAACT CTGAACCTAA CTAATGGCGG AGTATTTGTT TTTGACGCTT CCAAAGTTAC TGGAGCCAGT GAAATTGCTG TTACAACAAA CACAACTTCA ACAACAATCA CTAATCTGGC TGCTGGAGTA AAAGTTGGAG CAGGTCTCAA CAATACCGAT GGTGATGTAG ATGGTTTATT CGATATCAGT CTTGCTGATT CCTCTGGAAC TTCAGATTCA CTTACGTTCA ACCTGAATGA CACAGATGGA ACTACTCCAA ATACAAACAC TATTGAAGTA AAAGCAACTG GAATTGAGAC AGTTACATTT GACGTCACTG ACGACACTGA CACCTCAAAT GCCAACACAA GTCTTGATGT TGACTCACTT AATGCAGCAA AGATTGTTGT AGTCGGTTCA GCTGCTGATG CTGGACAGAC AATGACGCTT AATACTCTTG ATACTGATAC AACAGCTGTT GATGCGACTG GTTATTTTGG AATACTAACT GCTACTGCAG GTACTGCTAT AGCTACTACC TTTGACCTTA AAGGTGACAG AGCTCATAAC ATTACTGGTT CATCTAAAAA TGACACCTTC ACTATCGGAG AAACCACTAA TGCCGATATC ACTGTCAACG GTAATGGAGG AACTGATGTT CTTAACCTTA CCCTTGGTGA TGGAGATGCG ATCACTGATA ATGTCTCAGA TGTTGACACT ATTAACCTGA TTATCTCAGG TTAG
|
Protein sequence | MTNATSTQLQ ELYVAYFGRA ADPTGLDYWT ERGITTAAFA ANMYAQPEFT SEYGTLSIES QVNQIYKNLF DRDADVAGLT YWSQQIRLGN LQLAEIANDL IWAAQNNSGS ADDKAALSNR TEAAVAYTAK IKETTAGILS YQPLNDGLAA DSTFAAGNNI IAARNYLSTI DKDTASTAAG IAASVATITS NGVPTTATAS KTLTLTTNQD SVTGGAGNDK INGVIVGGAT GTTIQAGDVI DGGSGVDTLN IAVSGDAATT LSGVQATGIE KVLLTNFDGP TGVTTVDTTL LGAPSTVGLS SSGSDGDTVF QGMTVLTNAE LRNGAADLTL THTSTAVSGS SDAITLDVSN LSGGTFTAAS IETLTINSTL TQSQLTDVVI DNATALNVTG SANLQIVGDV DFADNATTTA IDGTVDASAF TGNLSIQLNT GDIASVTGGS GDDFVEFSTG FTSADVVDGG AGTDTLSLDL GNATLTSTTF ANVSNVEVLE VNPTNDSAVV NADGVGSFTT LKAAGHTKTV HVTGSLNAAG DDYSYSLNGV STSIASATGT NTDAEVAAEL AASINALTGF TATAVDVDGT GDGVFITRTS DTGDTINFDS ITGTGNIAVA EVGYSNVSFT NLTDQTVEIS SGAQVTASLK DPSGTADSLT VNLTSPSGDK ALTQTIEQIS AGDIETLTIN TSGLSANTVD YVVSTLTDGG TNALTTLKIT GDSSLDIDGT ITASKLVTVD ASTFTGDLQL DGVAANQTIT TGSGADSLVF GSNLNNADTV DGGAGTDTLS ATVTGLTATT GALNVSNVET LNLTNGGVFV FDASKVTGAS EIAVTTNTTS TTITNLAAGV KVGAGLNNTD GDVDGLFDIS LADSSGTSDS LTFNLNDTDG TTPNTNTIEV KATGIETVTF DVTDDTDTSN ANTSLDVDSL NAAKIVVVGS AADAGQTMTL NTLDTDTTAV DATGYFGILT ATAGTAIATT FDLKGDRAHN ITGSSKNDTF TIGETTNADI TVNGNGGTDV LNLTLGDGDA ITDNVSDVDT INLIISG
|
| |