Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21051 |
Symbol | |
ID | 4781124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1750448 |
End bp | 1756984 |
Gene Length | 6537 bp |
Protein Length | 2178 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640085401 |
Product | hypothetical protein |
Protein accession | YP_001015925 |
Protein GI | 124026810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.466354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAACG CTACTGCTAC ACAACTGCAA CAGCTTTACG TTGCGTATTT CGGGCGCGCT GCCGATCCCA CTGGACTTGA TTACTGGGTG GGGCAGGGCA CTACTACTAA GTCTTTTGCG GCAAGCATGT ATGCCCAAGA TGAATTTGAG TCTGTAAACG GTTCATTATC CGTTGAACTT CAAGTCAACC AGATATATCA AAATCTTTTC GGACGCGACG GAGATACAGC TGGACTAACT TATTGGGCAA ATCAAATAAG AACTGGTTCA TTAGAACTTG CTTCAATAGC TAATGACCTT ATATATGCAG TTAATAATGG ATCGAGTGCA ACTGATCTAA CAGCCCTTAC AAATAAAACA AACGCAGCAA CTAGTTATAC AGCTGATATA CGTGAGAGCT CTTCAGCTTT TCTTGCTTAC CAGCCAAAAT CCTCTTCACC ATGGGTTACT GGTACTAATT TTGAAACAGC AAAAACTTTC TTTAAGACAG TTACTGCTAC TAATGCCCCA AGTGCGGCAG AGGTTCAAGC AAGTGTAGAC GTTATTGCTT CTAACAGCTC TAACGTAGCC ACATCAGATA CGGCATTCAC TCTTACAACT GGTGTTGATG AACCTACAAC TACATATACA AAGTTTGATG CGTCTTTAAC TAGTGGTGGT ACTCAGACAC TTGGTTCTCT AGACAAAATT AGTGGAACAA CAGGATCTTC TGACGTTTTA AATGCAACCA TCAAGAGTAG CGTTACTCCA GCTTCAATAA CTGGAATCGA GACTATAAAT GTTGTTGCTG ATGCAGCTGC AACACTTGGA TTGGTAAACG CGTCTGGATT TAATACAGTT AATGCTGGTG GAGCTGCTGG TGCACTCACT ATTTCAGGAT TACCAACAAC AACTACAGTA ACTATTACTG ACTCAGCAGT TGACCATACA CTTTCATTTA AAGAAGTTAG TGGTGAATCT GATTCAGCAA CAATTTCTTT TGCTCAAGTT ACAGGTGGTG CAACTACTGA AATTAATATT GCCGGAATTG AAACCATCAC AGCTACATCA GCTGGTGGAT CAGCAAGTAA CAGTTTTGAG CTAGAGGCTG CAGCGTTAAC AACCTTAACT CTTGATGGAA GCGGAGATTT AACTCTTTCT TCTCTAAATA CAGGTGGAAC AACAAGTCTA AAATCAATCG ACGCTAGTGC CATGACAGGT GCACTAGATG TGACGACAGG AGCTTTGTCT CTTGGTACTT CAGCGCTTAC TATTTCAGGT GGTTCTGGTA ACGATTCAAT AGATTCATCT GCTCATACAG GAACGGACGT TTCTATTTCA GGTGGTGCTG GTAATGACAC AGTTACTCAC GTATTATCTA TAGCTGACAC GATTGCTGGT GGTGCTGGAA CAGATAACTT AATCACTACA GCTGTAATTG CTTCTGCAGG AAATGTTTCA GATTTTGAGA CATTCACATT GGATACTTCA GCCGGCAATC TTGATCAAGA TTTTGATCAT TTGAGTGGTA ATACATTTAC CAAGTTAGTA GCTGATGGCG ATGGTACAAC TACTGATAAT CCAACATTCA CGGATGCACC TTCAACAATT ACAGACCTAG AACTAACTGC TGATGCTTCT GGAACTGTTG TTGTCGATCG TAAGACTGAC ACATCTGCTG ATGCTATTAC TATCACCGCT AAAGGCGATA CAACAACCGC AGTTACGGTT AACGAAGAAG AGACAATAAC TTTCGCATCT GACACTGCAG CATCAGTTAT TACAACTTTG AATGCTGCTG ACGCTACTAA GTTATCTGTT ACTGGTAGTG CAGATGCAAC GATTTCAACA ATCAGTGGTA ACGATAATCT TGCAACTATT GATGCTTCAG CTTCAACCGG TGCAGTAAGT ATTGGTGATA CTGCAAACGG CTCAGTAGTA CCAATGACTA TCACTGCTGG TTCTGGTGGA TACACTGGTA GTGGTGGTAC GAAAGCTGAT ACCTTTACTG GTGGCGCCGG TGATGACTCT ATCGATGGTG ATGCTGGAGA CGACAGTTTA GTTGGTGCTG CAGGAGCTGA TGATTTGGAT GGAGGAGCAG GTAACGACGT TATTTCTGGT GGTGCTGGTA ATGACACAAT TACTGCAGGT TCAGGTAATG ACAACCTTGA TGGTGGGGCA GATGCTGACA CATTTACCTT AAGTACGAAT TACACCACAG ATGACACAAT TGTTGGTGGA GCTGGAGAAG ACTCAGTTTC AATGACAATT GCAGGTGGAA CTACTACTTA CACAGCTGCA TCTATCTCAG GAGTTGAAGT TTTTACGCTT ACTGGTTCTA CAGCTGCTAA TACTTATGAC TTCAAAAACC TTACAGGTAT AACTGAAGTA ACAATCGCGG ATGGAGCTGC TTTTAATACA ACTGTTCAAG GTCTAAATTC TGGAGTAATA GTCAACATTC TTGAACCTAC TGACATAACT ACTATTGACA CAGCAAATGC TGCTTCAGTT TCTGTTGATG TTGAAGTTAG TAAAACAGGT GGTTCAGTTG CAATTACTGA TACAGCTGCG GTAACGATTA CCGGTAAGGT TGCTACAGGT GCGATGCAGG CTTTAGCTCT CGACGCAGTT GATACAACAT CACTTTCGTT AAAAACCATA CTGACCCATG ATTTAAATAC TGGAGCTATT ACTAACACTG ATAAAGTAAC TACCTTTAAC TTAACCACTG CGGCACACAC TGGAACGATC GATACTACGA CCTTTGCTGA TGCGACAAGC CTAGAGACAA TTACTGCTAC TGGTGTTGGT GGAGATATAA CCACCACCAC AATTGGTAAT GGTGGAGCTG CTAATGATCT ATCAACTATC AATGCAACAG CAACAGTTGG ATCTGTCATT ACTTTCGGTG CAATAACTGC TGATACAACT GACTCGGTTA CAGACAATGC AATGGTTATC ACAACCTCTG CAACTGGAGT CGATACAGGA AACGGAAATT CAGCTGTTGT ATTTGGTGCT GTTAACAACC AATACGGCAC AATTACTCTT ACGAACTCTG GTGATACAAC TGACGGAACA ACTACTGGTG ATTTAACTGC TGTTGATGTA ACCGTTACTT CATCTGGTGG TGACATCACA ATCGGTGATG TTTCTGCCTC TGAAGATTTC AGTCTTACAG TTACTGGTGG AACAGCAACA GTTACTGACT TCGACGCTGA GACTTTCTCG TTGATCGATG CGAGTGCCTC AACTGGCAAG TTAACTGTTA CTGGTACTTC ATCCGATGGG AATATTACTG TTCTCGGTGG TTCAGCTGGT GATGATATTA CCCTTGCTAC TGCTATTGCA ACCGATAAAG TTCATTCAAT TTCTGGTGGT GGCGGTGCTG ACACAATTCT TGGATCAGCT GGTGCTGAAA CAATCATTGG TGGTGCAGGT AACGACTCAT TAGATGGTGC TGCTGGAAAT GACAGTATTT CTGGTGGAGC AGGAAATGAT ACTTTCAGTT TTGCTGCAAC TGGTCAGCTA AATAATGGTG ATACCATCGA CGGTGGTGAT GGTACTGACA ATCTTGATGT CACAACTGCA GTTGATTTAG ATAGTTCTTC AACTGCATCT ACCTCAGTAA CTCTCACGTC TATAGAAACC GTAGATCTAA CTTTAGGTGC AGACAATTTA AGTTTTGATG CCTCTGGCTG GGCCTCTTTG ACAGGTTTGG ATATCGCAGC TAGTGCTGAC ACTTACACCC CTACGGTTAA TAACCTCCGA ACTGGAGTGA CAGTTACTCT TGGGAATACA TTCGTTGAAG AAGCGACTCT TGATACAGTT GCAGGAGCTG ATCTAACGAT TGATGCTGAG ATCACAGATC TTGTAAGTGT TACCATTACT GATGCAGAAA ACGTCACTAT TAATGGTGAG GGTGCAGCTG CTGATTTAGT TTCTCTTGTT CTTGATGCTA CTGATACTAA GACACTTACT CTTACAGCAG ACGACGGCAC AGCATTAGAT ACTGGTTCTA TTACTGGTAC CGATGAAATC ACAACTATTA CCGCTACAAC AACAGTTGCT AGTGGAACGA TCACTCAGGG TGGTGGAACT ATCGCTGACG TTGACGGTCT AACAACACTA AACCTAAGTG CTACCAACGC AAGCCTAACT CTTGGTGCGA TGGGTACTGA TGCAACAGCA AATAATGCTG AGCTTTTAAG CACTATTACA ACATCAGCAA CTGGAACAGG TGTTGTACTA ACAACTGGAG CTCTTTACGC CGACAGCACA GTTGATAGCA CCACTGACTT GGCAATGACC ATCAACTCAA CAACTAACGT TGGTGCTCAG ACAGTATTAG GTGCCATTGA CAATACATAT GGTTCTATCA CTGGTACATT TGTCAGTCAC TCATCAGATG ACACCGATGT AGGCAACTTA ACTGCCGTTG ACATGACTCT TACAGTCTCT GGTGGTGGAG ATACTGATTT CCAAGACTTA ATTGCTTCAG GTACTGTTGC TGTCACAGCT TCTGGTTCAG GTAATCTGAC TATCGATGAT GCAAACATTG ATGGAACAAG TGCTTCTTTA AGCGTTGATG CCTCCGCTAT GTCTGGAACT GTTAGTGCCG TTGCTACTAA CACAAGCGTT GCTACTACCT TGACAGGTGG TTCTGGCGCA GACACTCTGA CAGGTGGTTC AGTCGCTGAC ACAATCACTG GTGGAGCAGG TGGAGACACC ATTACTACAA ATGCTGGTGA TGACACAGTT GCTGGTGGCG GCGGTAATGA CACAATTACT GGTGGTGCAG GAGACAATGT AATCACCGGT GGTGCTGGTA ATGACAGCAT CACTGCTGGT GCAGGTTTCG ACAATATCGA CTCTGGAGCC GGTGACGACA CTATTGTCTT TGCTGCAAAT ATGAGTCTCT CGGATACTGT TGCTGGAGGA GATGGCAATG ACACAATTAC TGCAACACTA AGCAGTGGTT CAGCTTCCCA AAAGGTTGGT ACTCTCTCAG GTGTTGAGAA TTTCACCTTA GATTTTGCCC AAACAGCTGG TAGTTTTGAT TCAACTAATG CAACTACAGC TGCCTATACC GTTACAAATG CTGCTGACGC TAAGCTCATT GATATGGATA ACCTGGCAAC AGGAAGCACC ATCAAGATAA CAGCTGCTTT TGATGGCTTA GCTCTTGATT ATGCAGACGA TGCCACTGCT GCTATTACAT TTGACGACAA TGGTGCTGCA GCTAATTTTG ATAACGACAC TACATTTGCC GTCACTGATG CCCAAACAGT CAACATAACT ATTTCGGGTG GAGAAACATA CACACAGACA GGAGCAACGA CTTTAGATGC TACTGATACT GATTATGTAA CTATCACCGG TGATGCTTCC AGTACGATCC TTACTCATAC TGGTGCTATT TCAGCAGACA ATGCAGTAAC TTTCTCTCTT GTCTCAACGG ATGGAGCTGC ACTTGATTTC AACTCAACTG GATTAGCTAC AGCTGATCTC CTTACGACTT TCTCTGCTTC TACTACAGAC GGTGTTGATG CTGCTGATAT CACCTTCGAA AGTTTCGTTG TTGGTAACGC TGCTGCAGCT GCCAAGCTTA CAAGCATTGA TTTGGACGCA TCTCATGCTG GTGATATTAC TGCTGGTGCA TTTGATGCAG CAGGAGCAAC AATCACAAGT ATCACTATGG ACTCGGCTAC TTCTGGGTCT GTCATAGACA TCGCTGGTCA AATTACTGCT GACTCTGTTA CTAACATCAC AACTTCAGCT GTATCTGGAG GTTCTGTTGA TTTCTCCGGT GCACTAGTTA TAAGCACAGT TGGAACTTTG AAGCATACAG GTGCTGGTAA TTTCACTATG GACTCAACCT CTACTGCGAT TACTACTCTT GAGCGTTTAG ATCTAACAGG TGCAACTGGT ACAAATACAG TTGATATCTC TGGCAATACC AACGCAACGA CAATTAATCT AGGTACAGGT ACGAATACAA TTCTTCTTAC CGGTGCCGCA GATGACGTGA ATCTTTCATC TACAGCCGCA ACTGACACAT TGACTTATGG TGCGAGCACT TCTGTTCCAT CAATTTCTAA CTTTGCTTTC GGCTCTGGAG CTGACATTAT CAACATCGAC CTTTCTGAAA TTGAGGCAGC TTTAACTATC GACCTTCATG ACGGTAATGC TGCTGATGTT AATGCTGCTG AAACTTTCTC AATCAAAGAG ATTTCTGCTG ATACTACACT TGCCGCTGGT GATGATGTTC TTGTCTTAGT TGGTTCAACT TTCGCTACTA CCGACCTAGT TGAAACAGCG ATTGAAACTG GTGATTTTGA ACTTACTTAT GGAACTGCTC TAGCAGATAA CGACGGTACA TTACTCTTCT ACTCTGATGG AACTGATATG TACTTAGCTG TAAGTAAAGC GGGTGATACT CCAACAGCAA ACTTAGCTAC CGGCGAAACA ACTGTTTCTA ATTTGATGAA GTTTGAAGGA GTAACTAGTA TTTCTGCTGG TGATATTACT GCTGGTGATA TTGTTCTTGT TGCATAA
|
Protein sequence | MANATATQLQ QLYVAYFGRA ADPTGLDYWV GQGTTTKSFA ASMYAQDEFE SVNGSLSVEL QVNQIYQNLF GRDGDTAGLT YWANQIRTGS LELASIANDL IYAVNNGSSA TDLTALTNKT NAATSYTADI RESSSAFLAY QPKSSSPWVT GTNFETAKTF FKTVTATNAP SAAEVQASVD VIASNSSNVA TSDTAFTLTT GVDEPTTTYT KFDASLTSGG TQTLGSLDKI SGTTGSSDVL NATIKSSVTP ASITGIETIN VVADAAATLG LVNASGFNTV NAGGAAGALT ISGLPTTTTV TITDSAVDHT LSFKEVSGES DSATISFAQV TGGATTEINI AGIETITATS AGGSASNSFE LEAAALTTLT LDGSGDLTLS SLNTGGTTSL KSIDASAMTG ALDVTTGALS LGTSALTISG GSGNDSIDSS AHTGTDVSIS GGAGNDTVTH VLSIADTIAG GAGTDNLITT AVIASAGNVS DFETFTLDTS AGNLDQDFDH LSGNTFTKLV ADGDGTTTDN PTFTDAPSTI TDLELTADAS GTVVVDRKTD TSADAITITA KGDTTTAVTV NEEETITFAS DTAASVITTL NAADATKLSV TGSADATIST ISGNDNLATI DASASTGAVS IGDTANGSVV PMTITAGSGG YTGSGGTKAD TFTGGAGDDS IDGDAGDDSL VGAAGADDLD GGAGNDVISG GAGNDTITAG SGNDNLDGGA DADTFTLSTN YTTDDTIVGG AGEDSVSMTI AGGTTTYTAA SISGVEVFTL TGSTAANTYD FKNLTGITEV TIADGAAFNT TVQGLNSGVI VNILEPTDIT TIDTANAASV SVDVEVSKTG GSVAITDTAA VTITGKVATG AMQALALDAV DTTSLSLKTI LTHDLNTGAI TNTDKVTTFN LTTAAHTGTI DTTTFADATS LETITATGVG GDITTTTIGN GGAANDLSTI NATATVGSVI TFGAITADTT DSVTDNAMVI TTSATGVDTG NGNSAVVFGA VNNQYGTITL TNSGDTTDGT TTGDLTAVDV TVTSSGGDIT IGDVSASEDF SLTVTGGTAT VTDFDAETFS LIDASASTGK LTVTGTSSDG NITVLGGSAG DDITLATAIA TDKVHSISGG GGADTILGSA GAETIIGGAG NDSLDGAAGN DSISGGAGND TFSFAATGQL NNGDTIDGGD GTDNLDVTTA VDLDSSSTAS TSVTLTSIET VDLTLGADNL SFDASGWASL TGLDIAASAD TYTPTVNNLR TGVTVTLGNT FVEEATLDTV AGADLTIDAE ITDLVSVTIT DAENVTINGE GAAADLVSLV LDATDTKTLT LTADDGTALD TGSITGTDEI TTITATTTVA SGTITQGGGT IADVDGLTTL NLSATNASLT LGAMGTDATA NNAELLSTIT TSATGTGVVL TTGALYADST VDSTTDLAMT INSTTNVGAQ TVLGAIDNTY GSITGTFVSH SSDDTDVGNL TAVDMTLTVS GGGDTDFQDL IASGTVAVTA SGSGNLTIDD ANIDGTSASL SVDASAMSGT VSAVATNTSV ATTLTGGSGA DTLTGGSVAD TITGGAGGDT ITTNAGDDTV AGGGGNDTIT GGAGDNVITG GAGNDSITAG AGFDNIDSGA GDDTIVFAAN MSLSDTVAGG DGNDTITATL SSGSASQKVG TLSGVENFTL DFAQTAGSFD STNATTAAYT VTNAADAKLI DMDNLATGST IKITAAFDGL ALDYADDATA AITFDDNGAA ANFDNDTTFA VTDAQTVNIT ISGGETYTQT GATTLDATDT DYVTITGDAS STILTHTGAI SADNAVTFSL VSTDGAALDF NSTGLATADL LTTFSASTTD GVDAADITFE SFVVGNAAAA AKLTSIDLDA SHAGDITAGA FDAAGATITS ITMDSATSGS VIDIAGQITA DSVTNITTSA VSGGSVDFSG ALVISTVGTL KHTGAGNFTM DSTSTAITTL ERLDLTGATG TNTVDISGNT NATTINLGTG TNTILLTGAA DDVNLSSTAA TDTLTYGAST SVPSISNFAF GSGADIINID LSEIEAALTI DLHDGNAADV NAAETFSIKE ISADTTLAAG DDVLVLVGST FATTDLVETA IETGDFELTY GTALADNDGT LLFYSDGTDM YLAVSKAGDT PTANLATGET TVSNLMKFEG VTSISAGDIT AGDIVLVA
|
| |