Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08691 |
Symbol | |
ID | 4779546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 804488 |
End bp | 806257 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 640084144 |
Product | hypothetical protein |
Protein accession | YP_001014692 |
Protein GI | 124025576 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.109178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000878377 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGAAAAAC TAAATAAATA TAAAAAGACT ATTACATTTG AGAAATTAAT TAAAGAGAAT AAGATTACAC AAAAGATGAT ATCAACACTA GCAAGCAATT ACTCTAAGGA ATTGATTAGT TCTAAAATAC TAAAGAGAAT AATGAGAAAG TCAAAGATAG AAAAGTATAA TTATATTAAA AAAGCATTAT TAAATTATAC TGCAATAATC GTCCATGATT CATTTGGATT ATCCTTAATT TTTGATGCCC TTATAAGAAA TGGGATGCAA AAAGCGATCT TTTATTGCAA TAGAGGTGGT TTTCAAATAA AAGATTTGTA TGATACAAAT TATTTTTTAG ACAACAATTT ATATTACAAA CGACATCTAC AATATTTATG TAACACAAAT GGAATAGCTT TTAAAGGTAT AAAGAAAGTT AGCTTAGGGA ATTTTTCTAT TAAGATAAGA GATTGGACAA TATATGTTTA TCGCTTTATA ATACTATCCT TAAGATGTAT TAAAAATAAT GAAAAGATAA AGAGGCAAAA ACTGAATGCT ATTCATCTAA TAAGGTCTGA AGTAGAGCTT TATTCTTCAG AGCCTATTAT CAAAGAAACA ACGGCTAGAG GAGACAACTA TATTTATATT GTAGATGATT TAATGAAGTT TCCAACATGT ACAAAAGTAA TTAAGAGTAA AGATTATAAC TGGTTATCAA TTCATAGCTT TACCAATTTT AAAGATATTT ATATAACTTT TATTAAAGTG ATATGGATCC TTAAAAACAT AGAGAGTTTT AATAAAAAGA TGATACCAAA TACAAATATG AGTAAATATG GTTTCTTAGG AGAATCAAAT CAAATCAAAT CGATATTATA CAAAATTTTA CTGAACTCAT TACCAGAAAT AATAATTCAT GAAAAGCAAT TAAAAAAAGT ATTAAATTTA TTAAAGCCAA AATATATTGT CTCGTACGAT CAAATAGATA AATATGGGGC AGTCCAAGGA TCTGTAGCAA AAGAAAATTC TATAGGTTCA GTAATGATAC AAACAACAGC GATTGATGAT ATTAAATATC CATATCCCCT TAGCATGGAC AATATGATTG TATCTTCAGA AAAAGTAAAA GATATTTTAT TGTCGTCAGG AGCCAAAAAA AATAAAATAC ATGACTTTGG TCTTCCAAGT CTGTATGGAA TCAAGAGTAA AGGTGATAAG AAAATAGAGG AATTATTAAA TAAGAGAGAT AATCAATTAA TTATTTTAAT AGCAACACAG CCGTTTGTCT CTGATATAAA TTACAACGAT TTGTTAGTTA ACAATGTTAT CAATACATTG GCAAAAAGCA CCTATAATAT AAAAATAGTG ATAAAGCCAC ATCCTCGAGA GGCAAAGCAA AAAAATTACA TTGAAAAGCA ATCAATTCCG AAACTACATA TCGTAACTAA TTATGATAAA TTTGAAAATT TACTTAAAAA AGCAGACATA GTTATATCAA GGACTTCAAC TGTTATTCAG ACATCAATTA TTGGTGGTGT ACCTCCAATT TCCTATTTAG AGATGTATCC TTCTGAAATA ATCAATAGGC TAGACTATCT TGAATCTAAA GCTACATATA AATGCTTAAC AAAAGAACAA TTAAGTTATA TATTAAGTGA ATATATCTCA AAAGAAAGAA GAATAGATAA ATTAAAAGAA TTCAAGAAAA ATCGGAATAG ATATATAAAT AAACAGTTTA AAGGTAATAA TAGTATCGAT AAAACAATGA ATCTTTTAGA AAATATATAA
|
Protein sequence | MEKLNKYKKT ITFEKLIKEN KITQKMISTL ASNYSKELIS SKILKRIMRK SKIEKYNYIK KALLNYTAII VHDSFGLSLI FDALIRNGMQ KAIFYCNRGG FQIKDLYDTN YFLDNNLYYK RHLQYLCNTN GIAFKGIKKV SLGNFSIKIR DWTIYVYRFI ILSLRCIKNN EKIKRQKLNA IHLIRSEVEL YSSEPIIKET TARGDNYIYI VDDLMKFPTC TKVIKSKDYN WLSIHSFTNF KDIYITFIKV IWILKNIESF NKKMIPNTNM SKYGFLGESN QIKSILYKIL LNSLPEIIIH EKQLKKVLNL LKPKYIVSYD QIDKYGAVQG SVAKENSIGS VMIQTTAIDD IKYPYPLSMD NMIVSSEKVK DILLSSGAKK NKIHDFGLPS LYGIKSKGDK KIEELLNKRD NQLIILIATQ PFVSDINYND LLVNNVINTL AKSTYNIKIV IKPHPREAKQ KNYIEKQSIP KLHIVTNYDK FENLLKKADI VISRTSTVIQ TSIIGGVPPI SYLEMYPSEI INRLDYLESK ATYKCLTKEQ LSYILSEYIS KERRIDKLKE FKKNRNRYIN KQFKGNNSID KTMNLLENI
|
| |