Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_00831 |
Symbol | |
ID | 4779188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 83844 |
End bp | 85352 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640083346 |
Product | hypothetical protein |
Protein accession | YP_001013912 |
Protein GI | 124024796 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.752054 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTGGT CTCTTGCAAA TATTGAACAA CTGAGTGTGA GTGGAACAAA CGCTTTAGGT GTAAGTTCTC ATTTGGAAAT AAATCAAGGC GATAGTAATG ACAAATATCA GCTTTATTAC ACAGGTGATG GAGGAGTCAC TGTCAATAGT ATGTCTACTG ATCTTGAGCT TACTAAGCAG GGAGAAATTA AATTAATTCA AGATCTCACA ATCATTACTA CTAATGAAGG TATACGTAGG GCTTATTATG TAGAAGTTGA TCCGAATACC GGCAATCATG AAATTCTTAC TGCATTGCTA TCAGAGGATG GATTAACTCT TTCACAAGCC ACTAGAACAG GAATATTAGA CAACGGAGAC AACGCTTGGG GAGTTCCTGA CTCAGTAGTT CTTCCAGATG GAAGAATCAG AATATACTGG GTTAGTACAG ATAATACAGA TTCTGAATGG ACCGTTCCAC AAGATGATGA AATTATAAAA AATGATGGTT TTAAGACCCC AAATGGAGAA TGGGTCTCGG CAAATGAATA TTTTGAAAAC GACAGAAAAA TACCTGATAG TGCAACTAAA ACACTTGCTA ATGAAGTGAT ACTTAGTGCA ACTTCTGATA CATCAAAGGG TACAGAATTT ACTGTAGATG AGGGCTATAG GACTGAAGGA GGGTATGTCG ACTTTGAGGT TTTAAAAGCT AAAGAAAATG ATTGGTTAGC GATAATGTCT TCTTCGCCAG TAACGATCCC TGATGAACCT CAGGGCATTT ATATAGGAAT TTCAAGTGAT GGCCTTAGTT GGGAAATAGA TGATAATAAT TTAGCTCCTC TTGAAAGGAG TTATCTCGAT CCTACAGGAC TCTTATTATC TAATACACCC AATAAATATC AGATTGTCAT GAGCTCATCG CTATCAATAC TTGGCGATAG AGAATATACA TTAGTAACAG CTGAGCTTAC ATCAGCTACT ACCACATATT TAGGCAATAG CTTTGATTAT AATTTCTTTA ATATTGGTAA TGGAGTATAT GGAATAAGAC CAGATTCTAC TGGAACTATC GATTCATTGA CTGGAATTTC AAATATTCAA TTCGATGATA AAAAACTAAA CATCACATCT GATATTAAAG CGACATTTGA TCAGGTCACA GGTTTAAATA CAGACTCAGG TGAGATGTTC CGTCTCTATA ACGCTGCTTT CGCACGCTTC CCTGATGCTG ATGGTTTGAA GTATTGGATT GAGCAATTTA GTTCTGGGAA AAATACAAGA CGAGTTGTTG CTCAATCTTT TTTAGGTTCT GCAGAGTTCA CTGAGAAATA TGGAAGCAAT GTAAGTGATG AGACATACGT GAATAACCTC TATAAAAATG TCCTTGGAAG AGACGCTGAT GCAGAGGGGC TTAACTACTG GGTAGGCAAT CTCAGTAATG GAATTGAAAC TCGATACGAA GCGCTCCTAG GGTTTTCAGA GTCAGAAGAG AACAAAGCGC TCTTTACAGA AATGACAGGT TTTGGATAA
|
Protein sequence | MTWSLANIEQ LSVSGTNALG VSSHLEINQG DSNDKYQLYY TGDGGVTVNS MSTDLELTKQ GEIKLIQDLT IITTNEGIRR AYYVEVDPNT GNHEILTALL SEDGLTLSQA TRTGILDNGD NAWGVPDSVV LPDGRIRIYW VSTDNTDSEW TVPQDDEIIK NDGFKTPNGE WVSANEYFEN DRKIPDSATK TLANEVILSA TSDTSKGTEF TVDEGYRTEG GYVDFEVLKA KENDWLAIMS SSPVTIPDEP QGIYIGISSD GLSWEIDDNN LAPLERSYLD PTGLLLSNTP NKYQIVMSSS LSILGDREYT LVTAELTSAT TTYLGNSFDY NFFNIGNGVY GIRPDSTGTI DSLTGISNIQ FDDKKLNITS DIKATFDQVT GLNTDSGEMF RLYNAAFARF PDADGLKYWI EQFSSGKNTR RVVAQSFLGS AEFTEKYGSN VSDETYVNNL YKNVLGRDAD AEGLNYWVGN LSNGIETRYE ALLGFSESEE NKALFTEMTG FG
|
| |