Gene PMN2A_1227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1227 
Symbol 
ID3606620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1719155 
End bp1722268 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content42% 
IMG OID637688102 
Producthypothetical protein 
Protein accessionYP_292420 
Protein GI72383065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACG CGACTAGTAC CCAACTTCAA GAGCTATACG TTGCTTACTT CGGACGCGCT 
GCGGATCCCA CTGGTCTTGA TTACTGGACG GAAAGAGGGA TCACTACCGC GGCATTTGCA
GCAAATATGT ATGCTCAGCC CGAATTCACA AGCGAGTACG GAACTCTCTC CATCGAGTCA
CAAGTAAACC AGATTTATAA AAATCTATTC GATAGAGATG CCGATGTAGC TGGCTTGACT
TATTGGTCTC AGCAAATTCG TTTAGGTAAT TTGCAATTAG CTGAGATTGC AAATGATCTT
ATTTGGGCTG CTCAAAATAA TTCTGGAAGC GCAGATGATA AAGCAGCCTT AAGTAATAGA
ACTGAAGCAG CAGTAGCTTA TACAGCCAAA ATTAAAGAGA CTACAGCTGG AATACTTTCT
TATCAGCCTC TAAATGATGG TCTAGCAGCT GATTCTACTT TTGCTGCTGG TAACAACATC
ATCGCTGCAA GAAATTATCT ATCAACAATA GATAAAGATA CAGCTTCAAC TGCTGCAGGT
ATAGCTGCAA GTGTCGCAAC CATTACTTCA AATGGGGTGC CAACAACTGC TACTGCATCT
AAGACATTAA CTCTCACAAC TAATCAGGAT TCAGTCACTG GTGGTGCTGG TAACGACAAG
ATTAATGGAG TTATTGTTGG CGGTGCTACT GGTACGACCA TCCAAGCTGG TGACGTAATT
GATGGTGGAT CAGGTGTAGA TACTTTAAAC ATCGCTGTTT CTGGTGATGC AGCTACAACT
CTTAGCGGTG TTCAAGCAAC TGGCATTGAA AAGGTTTTAC TAACTAACTT TGATGGACCT
ACTGGTGTTA CAACCGTAGA TACAACTCTG CTCGGTGCAC CTTCAACAGT TGGTTTGAGT
TCTTCTGGAT CTGATGGAGA CACTGTTTTC CAAGGAATGA CTGTGTTGAC AAATGCAGAG
TTGAGAAATG GTGCGGCTGA CTTAACCCTT ACTCATACTT CAACTGCTGT ATCTGGCTCA
AGTGACGCAA TTACTCTTGA TGTGAGTAAT CTTTCTGGCG GTACCTTCAC AGCAGCTTCT
ATCGAAACTT TGACAATCAA CTCAACTCTG ACTCAAAGTC AGTTGACTGA TGTCGTAATC
GATAACGCTA CAGCATTAAA CGTAACTGGC TCTGCGAATC TTCAAATCGT TGGTGATGTA
GATTTTGCAG ACAATGCCAC AACAACAGCT ATTGATGGAA CTGTAGATGC ATCTGCATTC
ACCGGTAACC TTTCTATACA GCTCAATACA GGAGATATAG CTTCAGTAAC TGGTGGATCA
GGTGATGATT TCGTTGAGTT TTCTACTGGC TTTACAAGTG CTGATGTTGT TGATGGAGGC
GCCGGAACAG ATACTTTAAG CTTGGATTTG GGTAATGCCA CCCTTACAAG TACAACATTC
GCAAATGTAA GTAACGTTGA GGTTTTAGAA GTCAATCCAA CTAACGATTC GGCTGTAGTT
AATGCAGATG GAGTTGGTTC ATTTACAACT CTAAAAGCAG CTGGGCACAC AAAAACAGTT
CATGTGACAG GTTCTTTGAA CGCAGCTGGC GATGATTATT CATACTCTCT AAACGGTGTA
TCAACATCAA TTGCTTCCGC TACGGGCACC AATACTGACG CTGAAGTTGC TGCTGAACTA
GCTGCATCTA TCAACGCCTT GACAGGATTT ACTGCCACAG CTGTCGACGT AGACGGAACA
GGTGATGGTG TTTTTATTAC CAGAACATCT GACACCGGAG ACACAATCAA TTTCGACAGT
ATTACTGGTA CCGGTAATAT TGCTGTTGCA GAAGTTGGAT ACTCAAACGT TTCATTCACA
AATTTAACTG ATCAAACTGT AGAAATTAGT TCAGGGGCTC AAGTAACTGC TTCTTTGAAA
GATCCATCAG GAACTGCAGA TTCGCTAACC GTAAACCTAA CTTCACCATC TGGTGACAAG
GCTTTAACAC AAACAATTGA GCAAATTTCA GCTGGTGACA TTGAGACACT TACTATTAAT
ACCTCAGGCT TATCAGCTAA CACAGTTGAT TATGTAGTAT CTACTCTTAC TGACGGTGGT
ACAAACGCCC TGACTACTTT GAAGATCACT GGTGACTCTT CTCTTGATAT TGATGGCACA
ATAACTGCTT CAAAACTTGT CACTGTTGAT GCTTCTACTT TTACTGGAGA CTTGCAACTC
GATGGAGTAG CTGCTAATCA AACGATTACT ACTGGTTCAG GTGCTGATTC ATTAGTCTTC
GGTTCTAATC TAAACAATGC AGACACCGTT GATGGAGGAG CAGGCACAGA TACACTTTCT
GCAACTGTAA CGGGTTTAAC TGCTACGACC GGTGCTCTAA ATGTCTCTAA TGTAGAAACT
CTGAACCTAA CTAATGGCGG AGTATTTGTT TTTGACGCTT CCAAAGTTAC TGGAGCCAGT
GAAATTGCTG TTACAACAAA CACAACTTCA ACAACAATCA CTAATCTGGC TGCTGGAGTA
AAAGTTGGAG CAGGTCTCAA CAATACCGAT GGTGATGTAG ATGGTTTATT CGATATCAGT
CTTGCTGATT CCTCTGGAAC TTCAGATTCA CTTACGTTCA ACCTGAATGA CACAGATGGA
ACTACTCCAA ATACAAACAC TATTGAAGTA AAAGCAACTG GAATTGAGAC AGTTACATTT
GACGTCACTG ACGACACTGA CACCTCAAAT GCCAACACAA GTCTTGATGT TGACTCACTT
AATGCAGCAA AGATTGTTGT AGTCGGTTCA GCTGCTGATG CTGGACAGAC AATGACGCTT
AATACTCTTG ATACTGATAC AACAGCTGTT GATGCGACTG GTTATTTTGG AATACTAACT
GCTACTGCAG GTACTGCTAT AGCTACTACC TTTGACCTTA AAGGTGACAG AGCTCATAAC
ATTACTGGTT CATCTAAAAA TGACACCTTC ACTATCGGAG AAACCACTAA TGCCGATATC
ACTGTCAACG GTAATGGAGG AACTGATGTT CTTAACCTTA CCCTTGGTGA TGGAGATGCG
ATCACTGATA ATGTCTCAGA TGTTGACACT ATTAACCTGA TTATCTCAGG TTAG
 
Protein sequence
MTNATSTQLQ ELYVAYFGRA ADPTGLDYWT ERGITTAAFA ANMYAQPEFT SEYGTLSIES 
QVNQIYKNLF DRDADVAGLT YWSQQIRLGN LQLAEIANDL IWAAQNNSGS ADDKAALSNR
TEAAVAYTAK IKETTAGILS YQPLNDGLAA DSTFAAGNNI IAARNYLSTI DKDTASTAAG
IAASVATITS NGVPTTATAS KTLTLTTNQD SVTGGAGNDK INGVIVGGAT GTTIQAGDVI
DGGSGVDTLN IAVSGDAATT LSGVQATGIE KVLLTNFDGP TGVTTVDTTL LGAPSTVGLS
SSGSDGDTVF QGMTVLTNAE LRNGAADLTL THTSTAVSGS SDAITLDVSN LSGGTFTAAS
IETLTINSTL TQSQLTDVVI DNATALNVTG SANLQIVGDV DFADNATTTA IDGTVDASAF
TGNLSIQLNT GDIASVTGGS GDDFVEFSTG FTSADVVDGG AGTDTLSLDL GNATLTSTTF
ANVSNVEVLE VNPTNDSAVV NADGVGSFTT LKAAGHTKTV HVTGSLNAAG DDYSYSLNGV
STSIASATGT NTDAEVAAEL AASINALTGF TATAVDVDGT GDGVFITRTS DTGDTINFDS
ITGTGNIAVA EVGYSNVSFT NLTDQTVEIS SGAQVTASLK DPSGTADSLT VNLTSPSGDK
ALTQTIEQIS AGDIETLTIN TSGLSANTVD YVVSTLTDGG TNALTTLKIT GDSSLDIDGT
ITASKLVTVD ASTFTGDLQL DGVAANQTIT TGSGADSLVF GSNLNNADTV DGGAGTDTLS
ATVTGLTATT GALNVSNVET LNLTNGGVFV FDASKVTGAS EIAVTTNTTS TTITNLAAGV
KVGAGLNNTD GDVDGLFDIS LADSSGTSDS LTFNLNDTDG TTPNTNTIEV KATGIETVTF
DVTDDTDTSN ANTSLDVDSL NAAKIVVVGS AADAGQTMTL NTLDTDTTAV DATGYFGILT
ATAGTAIATT FDLKGDRAHN ITGSSKNDTF TIGETTNADI TVNGNGGTDV LNLTLGDGDA
ITDNVSDVDT INLIISG