Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08351 |
Symbol | |
ID | 4780599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 767008 |
End bp | 768603 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640084110 |
Product | hypothetical protein |
Protein accession | YP_001014658 |
Protein GI | 124025542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0702953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.19264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGCGC CTTCACTGTT AGGCGAGTCA TTGGCATTAC AGCTAACATC TCAAGATGAT AATTTAGAAA TAATCCTAGA CAAAAAAGAT ATAAATGGGT TACCTAAACT TATTCTATTT TGCCTTGAAG AAGTAGAACT CTCAAACTCA ATCAAACTAG AAATTCATAA GTTAAAAGAA AGATGGGAGC AATCTCCTGT CCTAATTGTA ATACCAAAAA GTATTAAATT ATCTTCTGAT GATTTGATGA CTTTTGGTAG TGAAGGAGTT ATTCAAGATC CTACTGTTGA ACTTTTAAGA GATACAATCA ATATTTTGAT TGGCGGCGGC AGAGTATTTA AAATTAATAA TGAAACAAAT TACAATGCTG ACTCAATTCA TAATTCATAT GGCCTAGGAC ATTGGCTATT AACAAGTGGT TTATCACAAA TAAATAAAGA TCTATACACT ATAGATCAAA TAATAGCCAA GAAATCAACA AATACATTTT ATCTTTTCAT ATTAATAGGT AGAAGAAGAG AACTATTAAC AGCGAAGAGA TTAATTATCT GGTTATGGGG GCCGCTAGAG GTATTAATAG AGTCTCCAAT TAAAAGTAAT AACAATAAAA ATATAATCAA CAAATACAAT ACCGACATAA CAATAAAAAA TACCTCAACT AATGAATTAT GGAATGTTAT TTATAAACGG GTAAAGGAGA GACTTCAAGA TGACCTTACA AATTCTACGG GCGAGCTAGC AGCTCTTTAT TCATTAAATA AAAGCAAACG ATATAATCTT TTAAAAACTC TTCTGAAAGA ATTTTCAACT ATTATAATCA AACTTGATTC TAAAGATAAT AGAGAAAAAG GATTAGAGGA GATTTTACAA TCAATTACTC CTGAATTACG AGCTAATACA TTGCGCAATT TCATAGATTC ATATGATCGT TTAAAAAAGA ATGGTGTTGA CGTTTTTATT TCAGACTTTC TAGTACATAA TGCAGATCTC GGAATACTTG ATGATGAACT ACCATCAATA GCATTAATAA TAGATCCAAT ATTAAATAAT AAGCCGCTTC TTATGGATGG AGACTATTTA TCAATAGAAG ACCCTCGATC TATTATTCAA TTAGAAACAT TTATTCTAAA TTGGATATTC AGGTCAGCTG AAATAGTTAG TGAAGAGATT ATATCTTCAT GTTCTGAATG GCCAGAATTA CGTAAATACT TTCTAAATAA AGAATTAGTT TCAACAAGGG AACTTGAACG CAAACGAAAT CATATCAATA CAAATAATCA ACTTCAAAAT CTATTTAAAA AGCCGGTTAG ATTATATGAA AGTAAAAGAT TATATTATAC AGTCAAAAAC AATAACATTG AAAAAATTAT CACTCTTGAA CCTAGAGATG ATGAATTAAA GAAACTAGAC TGGCCCCAAA GGCAAATAGC ATTTATAATA GAATTAAGAG ATGCCTTGGC ACCACAAGTA CAGGCAATAA TTCAATACTT AGGTGATTTA ATAGTTCTAA TCCTCACTAA AGTCGTGGGA AGATCTATAG GATTAATTGG TAGAGGTATC GCTCAAGGTA TGGGAAGAAA CTTATCCAAA GGATAA
|
Protein sequence | MIAPSLLGES LALQLTSQDD NLEIILDKKD INGLPKLILF CLEEVELSNS IKLEIHKLKE RWEQSPVLIV IPKSIKLSSD DLMTFGSEGV IQDPTVELLR DTINILIGGG RVFKINNETN YNADSIHNSY GLGHWLLTSG LSQINKDLYT IDQIIAKKST NTFYLFILIG RRRELLTAKR LIIWLWGPLE VLIESPIKSN NNKNIINKYN TDITIKNTST NELWNVIYKR VKERLQDDLT NSTGELAALY SLNKSKRYNL LKTLLKEFST IIIKLDSKDN REKGLEEILQ SITPELRANT LRNFIDSYDR LKKNGVDVFI SDFLVHNADL GILDDELPSI ALIIDPILNN KPLLMDGDYL SIEDPRSIIQ LETFILNWIF RSAEIVSEEI ISSCSEWPEL RKYFLNKELV STRELERKRN HINTNNQLQN LFKKPVRLYE SKRLYYTVKN NNIEKIITLE PRDDELKKLD WPQRQIAFII ELRDALAPQV QAIIQYLGDL IVLILTKVVG RSIGLIGRGI AQGMGRNLSK G
|
| |