Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_05211 |
Symbol | |
ID | 4780761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 473614 |
End bp | 475317 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640083796 |
Product | secreted protein MPB70 precursor |
Protein accession | YP_001014348 |
Protein GI | 124025232 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.851281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.199605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAG TTCCAATTCA AATAATGGAC TTAACAACCC TTAAAGCGGT TGTATTTGAA CTTAGCCAAG ACATAATTCC AAGTCGATTT GAGACTGCAC AGCAAATTGA TTCACATACC ATTCAATTAG GACTTAGAAC CCTAGAAAAA CTAACATGGA TCGAAATAAG TTGGCTTCCT GAGTCCCCAA TAATTGTATC AATACCGCCT CCAAAAAGAT ATGGCGAGAA AAGTACATTA GCTAAACAAA TAAAACATTT ATTAGTCAAT TTAGCTTTAG TAGATATAAC GCAAATTGGT TTTGAAAGAA TAGTAAGGTT TAAATTTTCT AGTAGACCGG GTGAAGAAAT AGAGAAAGAA TTAATAGTCG AACTGATGGG TAGACACAGC AACATTCTAC TTTTAGACAG ATCAGGTAAA GTAATTACTC TTGGGAAGCA AATCAAAGAG AGTCAATCAA GATTAAGGCC AATAGGAACA GGAGATATCT ATACATCTCC TCCACCTCTT AAAGGCTTAG TTCCTGTATT ATCTGAAGCA TTCAATTCAT GGAAAGGGAA TATATGTTTA GTACCATCAA CATTTAAGAA TAGCCTTAAA GATACTTATC AAGGAATAAG TCCTGCTCTA ACATTACAAA TTGTGAGCAG TGATTATAAC GAATCACTTA AAATAATAAA TAAACCAGTA ACAAGTATTG AATTAGAAAC TTGGGAAGCA ATATATAAAA GATGGAAAGA GTGGTTATTA GATCTTGAGA ATAGTAATTA CACTATTAAC TTCGAAGGCC CAACAGATTA TATTGTCTGG GGAAAAAAAG AATCAACAGC TAAAAACAAA AAAATAGGAC TTAGCCTTAG CATGTATTAT TCGAATAAAC TCGTAGAAAG AAAAATAAAT TCTATTAGAG AAAAATTGAA ACAAGATCTG GCAAATTGTA AAGGCCATGA AAAAAGAAAG CTAGATGTTC AGGAATTACT AATTAAAAAT ATTTCTGAAT ATATAACCAT GCAAAATAAG GCTAAAAGCT TACTTACTTT ACCTTCTCCT ACAAAGAAGC AAATAATTGA AGCTCAAAGT CTTTTTAAAG AAGCTAAAAG AAAAAAAAGA TCTCGAGAAT CAATTGTCAA TAGAATCAAT TTTCACAAGA AAAAATTATC AGAAATTCAA TGCTGTGAAT CATTTCTTGA TTCATTTATA TATGAAGAAA ATGAGGATAA TAAAAACAAG CTAGAATCAA TTATTGAGCT TAAAGAAGAA GTAGAAGAAT ATATTTGCAT CAAAAAAAAT AATTCTAAAT TTAAATTAAA GAGAAAAAAA GAAAATTCAT TAAATATAAA AGAGATTCAG AGTCCTAGCG GTCTAAAAAT TCAAATAGGA GGTAATAACA GGCAAAACGA ATTAATCAGT TTAAAGAAAG GGAAAAAAGG AGATCTTTGG TTTCATGCTC AAGAAATACC AGGAAGCCAT GTTGTACTGA AATCATCTGA TGGACTTGTG GATGAGACAG ATATTCAATT AGCTGCTGAT CTGGCTTCTT TTTTTAGCCG TGCGAGAGGC AACAAACTTA CACCAGTAAA TATGGTTCCA ATTGAGAATC TAAAAAGGCT ATCAGGATCA CTACCCGGGA CAGTTAGTCA TAGGGGTGGG AAGGTCCTGT GGGGAAAAGC TGAGAGAGCT GCGAAATATT TCTACCAGAA GTGA
|
Protein sequence | MNKVPIQIMD LTTLKAVVFE LSQDIIPSRF ETAQQIDSHT IQLGLRTLEK LTWIEISWLP ESPIIVSIPP PKRYGEKSTL AKQIKHLLVN LALVDITQIG FERIVRFKFS SRPGEEIEKE LIVELMGRHS NILLLDRSGK VITLGKQIKE SQSRLRPIGT GDIYTSPPPL KGLVPVLSEA FNSWKGNICL VPSTFKNSLK DTYQGISPAL TLQIVSSDYN ESLKIINKPV TSIELETWEA IYKRWKEWLL DLENSNYTIN FEGPTDYIVW GKKESTAKNK KIGLSLSMYY SNKLVERKIN SIREKLKQDL ANCKGHEKRK LDVQELLIKN ISEYITMQNK AKSLLTLPSP TKKQIIEAQS LFKEAKRKKR SRESIVNRIN FHKKKLSEIQ CCESFLDSFI YEENEDNKNK LESIIELKEE VEEYICIKKN NSKFKLKRKK ENSLNIKEIQ SPSGLKIQIG GNNRQNELIS LKKGKKGDLW FHAQEIPGSH VVLKSSDGLV DETDIQLAAD LASFFSRARG NKLTPVNMVP IENLKRLSGS LPGTVSHRGG KVLWGKAERA AKYFYQK
|
| |