Gene NATL1_21051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21051 
Symbol 
ID4781124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1750448 
End bp1756984 
Gene Length6537 bp 
Protein Length2178 aa 
Translation table11 
GC content43% 
IMG OID640085401 
Producthypothetical protein 
Protein accessionYP_001015925 
Protein GI124026810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.466354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACG CTACTGCTAC ACAACTGCAA CAGCTTTACG TTGCGTATTT CGGGCGCGCT 
GCCGATCCCA CTGGACTTGA TTACTGGGTG GGGCAGGGCA CTACTACTAA GTCTTTTGCG
GCAAGCATGT ATGCCCAAGA TGAATTTGAG TCTGTAAACG GTTCATTATC CGTTGAACTT
CAAGTCAACC AGATATATCA AAATCTTTTC GGACGCGACG GAGATACAGC TGGACTAACT
TATTGGGCAA ATCAAATAAG AACTGGTTCA TTAGAACTTG CTTCAATAGC TAATGACCTT
ATATATGCAG TTAATAATGG ATCGAGTGCA ACTGATCTAA CAGCCCTTAC AAATAAAACA
AACGCAGCAA CTAGTTATAC AGCTGATATA CGTGAGAGCT CTTCAGCTTT TCTTGCTTAC
CAGCCAAAAT CCTCTTCACC ATGGGTTACT GGTACTAATT TTGAAACAGC AAAAACTTTC
TTTAAGACAG TTACTGCTAC TAATGCCCCA AGTGCGGCAG AGGTTCAAGC AAGTGTAGAC
GTTATTGCTT CTAACAGCTC TAACGTAGCC ACATCAGATA CGGCATTCAC TCTTACAACT
GGTGTTGATG AACCTACAAC TACATATACA AAGTTTGATG CGTCTTTAAC TAGTGGTGGT
ACTCAGACAC TTGGTTCTCT AGACAAAATT AGTGGAACAA CAGGATCTTC TGACGTTTTA
AATGCAACCA TCAAGAGTAG CGTTACTCCA GCTTCAATAA CTGGAATCGA GACTATAAAT
GTTGTTGCTG ATGCAGCTGC AACACTTGGA TTGGTAAACG CGTCTGGATT TAATACAGTT
AATGCTGGTG GAGCTGCTGG TGCACTCACT ATTTCAGGAT TACCAACAAC AACTACAGTA
ACTATTACTG ACTCAGCAGT TGACCATACA CTTTCATTTA AAGAAGTTAG TGGTGAATCT
GATTCAGCAA CAATTTCTTT TGCTCAAGTT ACAGGTGGTG CAACTACTGA AATTAATATT
GCCGGAATTG AAACCATCAC AGCTACATCA GCTGGTGGAT CAGCAAGTAA CAGTTTTGAG
CTAGAGGCTG CAGCGTTAAC AACCTTAACT CTTGATGGAA GCGGAGATTT AACTCTTTCT
TCTCTAAATA CAGGTGGAAC AACAAGTCTA AAATCAATCG ACGCTAGTGC CATGACAGGT
GCACTAGATG TGACGACAGG AGCTTTGTCT CTTGGTACTT CAGCGCTTAC TATTTCAGGT
GGTTCTGGTA ACGATTCAAT AGATTCATCT GCTCATACAG GAACGGACGT TTCTATTTCA
GGTGGTGCTG GTAATGACAC AGTTACTCAC GTATTATCTA TAGCTGACAC GATTGCTGGT
GGTGCTGGAA CAGATAACTT AATCACTACA GCTGTAATTG CTTCTGCAGG AAATGTTTCA
GATTTTGAGA CATTCACATT GGATACTTCA GCCGGCAATC TTGATCAAGA TTTTGATCAT
TTGAGTGGTA ATACATTTAC CAAGTTAGTA GCTGATGGCG ATGGTACAAC TACTGATAAT
CCAACATTCA CGGATGCACC TTCAACAATT ACAGACCTAG AACTAACTGC TGATGCTTCT
GGAACTGTTG TTGTCGATCG TAAGACTGAC ACATCTGCTG ATGCTATTAC TATCACCGCT
AAAGGCGATA CAACAACCGC AGTTACGGTT AACGAAGAAG AGACAATAAC TTTCGCATCT
GACACTGCAG CATCAGTTAT TACAACTTTG AATGCTGCTG ACGCTACTAA GTTATCTGTT
ACTGGTAGTG CAGATGCAAC GATTTCAACA ATCAGTGGTA ACGATAATCT TGCAACTATT
GATGCTTCAG CTTCAACCGG TGCAGTAAGT ATTGGTGATA CTGCAAACGG CTCAGTAGTA
CCAATGACTA TCACTGCTGG TTCTGGTGGA TACACTGGTA GTGGTGGTAC GAAAGCTGAT
ACCTTTACTG GTGGCGCCGG TGATGACTCT ATCGATGGTG ATGCTGGAGA CGACAGTTTA
GTTGGTGCTG CAGGAGCTGA TGATTTGGAT GGAGGAGCAG GTAACGACGT TATTTCTGGT
GGTGCTGGTA ATGACACAAT TACTGCAGGT TCAGGTAATG ACAACCTTGA TGGTGGGGCA
GATGCTGACA CATTTACCTT AAGTACGAAT TACACCACAG ATGACACAAT TGTTGGTGGA
GCTGGAGAAG ACTCAGTTTC AATGACAATT GCAGGTGGAA CTACTACTTA CACAGCTGCA
TCTATCTCAG GAGTTGAAGT TTTTACGCTT ACTGGTTCTA CAGCTGCTAA TACTTATGAC
TTCAAAAACC TTACAGGTAT AACTGAAGTA ACAATCGCGG ATGGAGCTGC TTTTAATACA
ACTGTTCAAG GTCTAAATTC TGGAGTAATA GTCAACATTC TTGAACCTAC TGACATAACT
ACTATTGACA CAGCAAATGC TGCTTCAGTT TCTGTTGATG TTGAAGTTAG TAAAACAGGT
GGTTCAGTTG CAATTACTGA TACAGCTGCG GTAACGATTA CCGGTAAGGT TGCTACAGGT
GCGATGCAGG CTTTAGCTCT CGACGCAGTT GATACAACAT CACTTTCGTT AAAAACCATA
CTGACCCATG ATTTAAATAC TGGAGCTATT ACTAACACTG ATAAAGTAAC TACCTTTAAC
TTAACCACTG CGGCACACAC TGGAACGATC GATACTACGA CCTTTGCTGA TGCGACAAGC
CTAGAGACAA TTACTGCTAC TGGTGTTGGT GGAGATATAA CCACCACCAC AATTGGTAAT
GGTGGAGCTG CTAATGATCT ATCAACTATC AATGCAACAG CAACAGTTGG ATCTGTCATT
ACTTTCGGTG CAATAACTGC TGATACAACT GACTCGGTTA CAGACAATGC AATGGTTATC
ACAACCTCTG CAACTGGAGT CGATACAGGA AACGGAAATT CAGCTGTTGT ATTTGGTGCT
GTTAACAACC AATACGGCAC AATTACTCTT ACGAACTCTG GTGATACAAC TGACGGAACA
ACTACTGGTG ATTTAACTGC TGTTGATGTA ACCGTTACTT CATCTGGTGG TGACATCACA
ATCGGTGATG TTTCTGCCTC TGAAGATTTC AGTCTTACAG TTACTGGTGG AACAGCAACA
GTTACTGACT TCGACGCTGA GACTTTCTCG TTGATCGATG CGAGTGCCTC AACTGGCAAG
TTAACTGTTA CTGGTACTTC ATCCGATGGG AATATTACTG TTCTCGGTGG TTCAGCTGGT
GATGATATTA CCCTTGCTAC TGCTATTGCA ACCGATAAAG TTCATTCAAT TTCTGGTGGT
GGCGGTGCTG ACACAATTCT TGGATCAGCT GGTGCTGAAA CAATCATTGG TGGTGCAGGT
AACGACTCAT TAGATGGTGC TGCTGGAAAT GACAGTATTT CTGGTGGAGC AGGAAATGAT
ACTTTCAGTT TTGCTGCAAC TGGTCAGCTA AATAATGGTG ATACCATCGA CGGTGGTGAT
GGTACTGACA ATCTTGATGT CACAACTGCA GTTGATTTAG ATAGTTCTTC AACTGCATCT
ACCTCAGTAA CTCTCACGTC TATAGAAACC GTAGATCTAA CTTTAGGTGC AGACAATTTA
AGTTTTGATG CCTCTGGCTG GGCCTCTTTG ACAGGTTTGG ATATCGCAGC TAGTGCTGAC
ACTTACACCC CTACGGTTAA TAACCTCCGA ACTGGAGTGA CAGTTACTCT TGGGAATACA
TTCGTTGAAG AAGCGACTCT TGATACAGTT GCAGGAGCTG ATCTAACGAT TGATGCTGAG
ATCACAGATC TTGTAAGTGT TACCATTACT GATGCAGAAA ACGTCACTAT TAATGGTGAG
GGTGCAGCTG CTGATTTAGT TTCTCTTGTT CTTGATGCTA CTGATACTAA GACACTTACT
CTTACAGCAG ACGACGGCAC AGCATTAGAT ACTGGTTCTA TTACTGGTAC CGATGAAATC
ACAACTATTA CCGCTACAAC AACAGTTGCT AGTGGAACGA TCACTCAGGG TGGTGGAACT
ATCGCTGACG TTGACGGTCT AACAACACTA AACCTAAGTG CTACCAACGC AAGCCTAACT
CTTGGTGCGA TGGGTACTGA TGCAACAGCA AATAATGCTG AGCTTTTAAG CACTATTACA
ACATCAGCAA CTGGAACAGG TGTTGTACTA ACAACTGGAG CTCTTTACGC CGACAGCACA
GTTGATAGCA CCACTGACTT GGCAATGACC ATCAACTCAA CAACTAACGT TGGTGCTCAG
ACAGTATTAG GTGCCATTGA CAATACATAT GGTTCTATCA CTGGTACATT TGTCAGTCAC
TCATCAGATG ACACCGATGT AGGCAACTTA ACTGCCGTTG ACATGACTCT TACAGTCTCT
GGTGGTGGAG ATACTGATTT CCAAGACTTA ATTGCTTCAG GTACTGTTGC TGTCACAGCT
TCTGGTTCAG GTAATCTGAC TATCGATGAT GCAAACATTG ATGGAACAAG TGCTTCTTTA
AGCGTTGATG CCTCCGCTAT GTCTGGAACT GTTAGTGCCG TTGCTACTAA CACAAGCGTT
GCTACTACCT TGACAGGTGG TTCTGGCGCA GACACTCTGA CAGGTGGTTC AGTCGCTGAC
ACAATCACTG GTGGAGCAGG TGGAGACACC ATTACTACAA ATGCTGGTGA TGACACAGTT
GCTGGTGGCG GCGGTAATGA CACAATTACT GGTGGTGCAG GAGACAATGT AATCACCGGT
GGTGCTGGTA ATGACAGCAT CACTGCTGGT GCAGGTTTCG ACAATATCGA CTCTGGAGCC
GGTGACGACA CTATTGTCTT TGCTGCAAAT ATGAGTCTCT CGGATACTGT TGCTGGAGGA
GATGGCAATG ACACAATTAC TGCAACACTA AGCAGTGGTT CAGCTTCCCA AAAGGTTGGT
ACTCTCTCAG GTGTTGAGAA TTTCACCTTA GATTTTGCCC AAACAGCTGG TAGTTTTGAT
TCAACTAATG CAACTACAGC TGCCTATACC GTTACAAATG CTGCTGACGC TAAGCTCATT
GATATGGATA ACCTGGCAAC AGGAAGCACC ATCAAGATAA CAGCTGCTTT TGATGGCTTA
GCTCTTGATT ATGCAGACGA TGCCACTGCT GCTATTACAT TTGACGACAA TGGTGCTGCA
GCTAATTTTG ATAACGACAC TACATTTGCC GTCACTGATG CCCAAACAGT CAACATAACT
ATTTCGGGTG GAGAAACATA CACACAGACA GGAGCAACGA CTTTAGATGC TACTGATACT
GATTATGTAA CTATCACCGG TGATGCTTCC AGTACGATCC TTACTCATAC TGGTGCTATT
TCAGCAGACA ATGCAGTAAC TTTCTCTCTT GTCTCAACGG ATGGAGCTGC ACTTGATTTC
AACTCAACTG GATTAGCTAC AGCTGATCTC CTTACGACTT TCTCTGCTTC TACTACAGAC
GGTGTTGATG CTGCTGATAT CACCTTCGAA AGTTTCGTTG TTGGTAACGC TGCTGCAGCT
GCCAAGCTTA CAAGCATTGA TTTGGACGCA TCTCATGCTG GTGATATTAC TGCTGGTGCA
TTTGATGCAG CAGGAGCAAC AATCACAAGT ATCACTATGG ACTCGGCTAC TTCTGGGTCT
GTCATAGACA TCGCTGGTCA AATTACTGCT GACTCTGTTA CTAACATCAC AACTTCAGCT
GTATCTGGAG GTTCTGTTGA TTTCTCCGGT GCACTAGTTA TAAGCACAGT TGGAACTTTG
AAGCATACAG GTGCTGGTAA TTTCACTATG GACTCAACCT CTACTGCGAT TACTACTCTT
GAGCGTTTAG ATCTAACAGG TGCAACTGGT ACAAATACAG TTGATATCTC TGGCAATACC
AACGCAACGA CAATTAATCT AGGTACAGGT ACGAATACAA TTCTTCTTAC CGGTGCCGCA
GATGACGTGA ATCTTTCATC TACAGCCGCA ACTGACACAT TGACTTATGG TGCGAGCACT
TCTGTTCCAT CAATTTCTAA CTTTGCTTTC GGCTCTGGAG CTGACATTAT CAACATCGAC
CTTTCTGAAA TTGAGGCAGC TTTAACTATC GACCTTCATG ACGGTAATGC TGCTGATGTT
AATGCTGCTG AAACTTTCTC AATCAAAGAG ATTTCTGCTG ATACTACACT TGCCGCTGGT
GATGATGTTC TTGTCTTAGT TGGTTCAACT TTCGCTACTA CCGACCTAGT TGAAACAGCG
ATTGAAACTG GTGATTTTGA ACTTACTTAT GGAACTGCTC TAGCAGATAA CGACGGTACA
TTACTCTTCT ACTCTGATGG AACTGATATG TACTTAGCTG TAAGTAAAGC GGGTGATACT
CCAACAGCAA ACTTAGCTAC CGGCGAAACA ACTGTTTCTA ATTTGATGAA GTTTGAAGGA
GTAACTAGTA TTTCTGCTGG TGATATTACT GCTGGTGATA TTGTTCTTGT TGCATAA
 
Protein sequence
MANATATQLQ QLYVAYFGRA ADPTGLDYWV GQGTTTKSFA ASMYAQDEFE SVNGSLSVEL 
QVNQIYQNLF GRDGDTAGLT YWANQIRTGS LELASIANDL IYAVNNGSSA TDLTALTNKT
NAATSYTADI RESSSAFLAY QPKSSSPWVT GTNFETAKTF FKTVTATNAP SAAEVQASVD
VIASNSSNVA TSDTAFTLTT GVDEPTTTYT KFDASLTSGG TQTLGSLDKI SGTTGSSDVL
NATIKSSVTP ASITGIETIN VVADAAATLG LVNASGFNTV NAGGAAGALT ISGLPTTTTV
TITDSAVDHT LSFKEVSGES DSATISFAQV TGGATTEINI AGIETITATS AGGSASNSFE
LEAAALTTLT LDGSGDLTLS SLNTGGTTSL KSIDASAMTG ALDVTTGALS LGTSALTISG
GSGNDSIDSS AHTGTDVSIS GGAGNDTVTH VLSIADTIAG GAGTDNLITT AVIASAGNVS
DFETFTLDTS AGNLDQDFDH LSGNTFTKLV ADGDGTTTDN PTFTDAPSTI TDLELTADAS
GTVVVDRKTD TSADAITITA KGDTTTAVTV NEEETITFAS DTAASVITTL NAADATKLSV
TGSADATIST ISGNDNLATI DASASTGAVS IGDTANGSVV PMTITAGSGG YTGSGGTKAD
TFTGGAGDDS IDGDAGDDSL VGAAGADDLD GGAGNDVISG GAGNDTITAG SGNDNLDGGA
DADTFTLSTN YTTDDTIVGG AGEDSVSMTI AGGTTTYTAA SISGVEVFTL TGSTAANTYD
FKNLTGITEV TIADGAAFNT TVQGLNSGVI VNILEPTDIT TIDTANAASV SVDVEVSKTG
GSVAITDTAA VTITGKVATG AMQALALDAV DTTSLSLKTI LTHDLNTGAI TNTDKVTTFN
LTTAAHTGTI DTTTFADATS LETITATGVG GDITTTTIGN GGAANDLSTI NATATVGSVI
TFGAITADTT DSVTDNAMVI TTSATGVDTG NGNSAVVFGA VNNQYGTITL TNSGDTTDGT
TTGDLTAVDV TVTSSGGDIT IGDVSASEDF SLTVTGGTAT VTDFDAETFS LIDASASTGK
LTVTGTSSDG NITVLGGSAG DDITLATAIA TDKVHSISGG GGADTILGSA GAETIIGGAG
NDSLDGAAGN DSISGGAGND TFSFAATGQL NNGDTIDGGD GTDNLDVTTA VDLDSSSTAS
TSVTLTSIET VDLTLGADNL SFDASGWASL TGLDIAASAD TYTPTVNNLR TGVTVTLGNT
FVEEATLDTV AGADLTIDAE ITDLVSVTIT DAENVTINGE GAAADLVSLV LDATDTKTLT
LTADDGTALD TGSITGTDEI TTITATTTVA SGTITQGGGT IADVDGLTTL NLSATNASLT
LGAMGTDATA NNAELLSTIT TSATGTGVVL TTGALYADST VDSTTDLAMT INSTTNVGAQ
TVLGAIDNTY GSITGTFVSH SSDDTDVGNL TAVDMTLTVS GGGDTDFQDL IASGTVAVTA
SGSGNLTIDD ANIDGTSASL SVDASAMSGT VSAVATNTSV ATTLTGGSGA DTLTGGSVAD
TITGGAGGDT ITTNAGDDTV AGGGGNDTIT GGAGDNVITG GAGNDSITAG AGFDNIDSGA
GDDTIVFAAN MSLSDTVAGG DGNDTITATL SSGSASQKVG TLSGVENFTL DFAQTAGSFD
STNATTAAYT VTNAADAKLI DMDNLATGST IKITAAFDGL ALDYADDATA AITFDDNGAA
ANFDNDTTFA VTDAQTVNIT ISGGETYTQT GATTLDATDT DYVTITGDAS STILTHTGAI
SADNAVTFSL VSTDGAALDF NSTGLATADL LTTFSASTTD GVDAADITFE SFVVGNAAAA
AKLTSIDLDA SHAGDITAGA FDAAGATITS ITMDSATSGS VIDIAGQITA DSVTNITTSA
VSGGSVDFSG ALVISTVGTL KHTGAGNFTM DSTSTAITTL ERLDLTGATG TNTVDISGNT
NATTINLGTG TNTILLTGAA DDVNLSSTAA TDTLTYGAST SVPSISNFAF GSGADIINID
LSEIEAALTI DLHDGNAADV NAAETFSIKE ISADTTLAAG DDVLVLVGST FATTDLVETA
IETGDFELTY GTALADNDGT LLFYSDGTDM YLAVSKAGDT PTANLATGET TVSNLMKFEG
VTSISAGDIT AGDIVLVA