Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08631 |
Symbol | |
ID | 4780845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 793950 |
End bp | 795752 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640084138 |
Product | hypothetical protein |
Protein accession | YP_001014686 |
Protein GI | 124025570 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.143582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000750178 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAAAAGAC TATTAATAAC ATTATATGAT ATTGGCCTGA AAAGGCTTTT TCAAAGATTA TTATATGATT TCAAAAAATC TATAAATAAA CTTTTACCAT ATCATTTATT TAATAACTCA AAAAGATTTA ATCCTCATTT TTTAGACTTG CATCAAGAGT TGAATACATT TAAATATTCC CCTTCACATT TTAATAAATA TATACCAAAA AAAATAGAAT TTACTTTTAT AAATAAAAAG AGATTACTAG ATTTCCCTCT GAAATGGAAT TCGTCTGAGT GGGGAAGATT ATGGCAATTT AATTTACACT ATTTTGATTG GTCACGTTCT ATTCTAGAAG AATCTATTAT AAATAATAAA TGGACCGATG AATCAATAAT ATTAGAATAT CTAATAGATA ATTGGATAAA TTCTAATCTA CCAGGAAGAG GTGATGGCTG GAACAGTTAT ACTCTATCTC TAAGAATAAG AAATTGGATA TGGATTTTTA GAACATGCCC TAGTTTTATT AATCAGAAAA GAGTTTATTC TCTTTGGTAT CAAATATGTT GGTTACGAAA TCACCCTGAA GAATGTCATG GAGGCAATCA TTGGATTGAA AATCTTATAA CTTTGGTGAT AGGTAGTCTT CAATTCAATG AAGTAAGATC TAAAGAAATC TACAGATACT CTTTATACAA ATTAAAAAAT GAACTTGAAA ATCAAATTTT ATTAGATGGT GGACATGAAG AAAGAAGTGC TAGTTATCAT TTATTGATCT TGGATAGATT AGTTGAATTA GGTTGTGTTT TAGATAGTGT CAATGGATAT AGGCCTATAT GGCTACTAAA ATCGATTGAA TCTATGAATA AATGGTTAAA AATAACTTCA ATATCTAATA AAAGGCTTCC ACAGTTTAAT GATTACTCAA TTGACAGTAA TTTAGATCTA AATATAGTCA TATCTTTTGC AGATTCTTAT TTAAACAAAA CTAATTATCT AAGCAAAGGT TTTAGATCAA AACTTCTATC CAATTATCCA AAGGATACAC ATATAGAAAC CTCATGTTTA AAACCAAATA CTATTGAGAT ACCCTCAATC GTTGATCTAA AAGAAACTGG TTGGGTATTA TTGCGTCCAG ATAAAAACTG GACACTTGCA TTTAAATCTG GGAAAGCATG CCCAAACCAT CTTCCCGCCC ATGTTCATTC TGATATCTTA AGTTTCGATT TATTCAAAAA TGGAATCCCA ATATTTGTAT CAGCTGGCAC TAGTGAATAT GGAAATTCAA AAAGAAGGTT CTATGAACGT TCGGGTCAAG CACACAATAT CTTACAAATT GGTACTAGGA AGTATGGTAA TTATAATAAA ATAAATTGGA TTGAAGGTAT AGATGTATGG GGATCTTTTA GAGCTGGTAA GAAATCAATG CCAACTTATC GAAAAAGTAA ACAATTAAAA AATGGAGCAC TTTATACATC AGGTATTTAT GACACATATC AAAGATATGA AGCCTTTCAT AAACGCTCTA TACAAATGAG AATTGATAAA TCAAACAACC TTATATTTTT ATTAAAAGAT ATAATAAAAA CGGAAAATCC CATTTTTATA AGACAATGGT GGCATCTAGG TGTTGACGCC GATGAAACTT TGCTAGAAAA GATAGCTACT CAGCTCATTA AAAATAAGAA TTTGAAAGCA GAATATATCA ATACATATTA CTCTTCAGAA TTGGGGAAAA AAGTAAAAAG GAGAAGTTTA TCAATCACAG GACCAATATC AGATAAGCAT ACTGTATTAT CAGTTAAACT AAATATAAAA TAG
|
Protein sequence | MKRLLITLYD IGLKRLFQRL LYDFKKSINK LLPYHLFNNS KRFNPHFLDL HQELNTFKYS PSHFNKYIPK KIEFTFINKK RLLDFPLKWN SSEWGRLWQF NLHYFDWSRS ILEESIINNK WTDESIILEY LIDNWINSNL PGRGDGWNSY TLSLRIRNWI WIFRTCPSFI NQKRVYSLWY QICWLRNHPE ECHGGNHWIE NLITLVIGSL QFNEVRSKEI YRYSLYKLKN ELENQILLDG GHEERSASYH LLILDRLVEL GCVLDSVNGY RPIWLLKSIE SMNKWLKITS ISNKRLPQFN DYSIDSNLDL NIVISFADSY LNKTNYLSKG FRSKLLSNYP KDTHIETSCL KPNTIEIPSI VDLKETGWVL LRPDKNWTLA FKSGKACPNH LPAHVHSDIL SFDLFKNGIP IFVSAGTSEY GNSKRRFYER SGQAHNILQI GTRKYGNYNK INWIEGIDVW GSFRAGKKSM PTYRKSKQLK NGALYTSGIY DTYQRYEAFH KRSIQMRIDK SNNLIFLLKD IIKTENPIFI RQWWHLGVDA DETLLEKIAT QLIKNKNLKA EYINTYYSSE LGKKVKRRSL SITGPISDKH TVLSVKLNIK
|
| |