Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21981 |
Symbol | |
ID | 4779290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1860996 |
End bp | 1862855 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640085496 |
Product | hypothetical protein |
Protein accession | YP_001016018 |
Protein GI | 124026903 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.544475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAG AATTAAGCGA TTTTATTGAA GCCTCAGGTC TACTTTCGTA TGACCCGGTG GCTATTGACT CAATCTACAA AAAAAATCCA ATTCGTCTAC TTCGAAGACT CTGGCAGACA CTTTTACCAA TTGGATTATT TTTACTAGGT GTTGGATGGG AAAAGTTGAT TGGTTCTCTA GACAAAGATG AAAGAAAAAC TTTTAGGGCG AAAGAATTTA CAAACCTACT GGTAGATCTA GGACCAGCAT TCATTAAAGC TGGACAAGCA CTCTCAACTA GGCCAGACAT TGTTCCGCCA ACTGTTTTAG AAGAATTAGC ACAACTTCAA GATCAACTAC CTGGCTTTGA AAGTAAGCTC GCAATGGCTT GTATTGAACA AGATTTAGAT AATAAAATAG AGAACATATT TGAAGAGATA GATAAAGAAC CGATTTCAGC TGCATCTTTA GGTCAAGTAC ATAAAGCAAA ACTCAAAAGT GGAGAACAAG TTGCTGTAAA AATTCAAAGA CCAGGATTAA GAGAACAAAT AACTTTAGAT TTATACATAG TAAGAAATAT AGCAATCTGG TTTAAAAATA ATATAGGGAT TATTAGAAGT GATCTAGTTG CATTAATAGA CGAACTAGGT AAGAGAATAT TTGAAGAAAT GGACTACATT AATGAAGCAA ATAATGCAGA AAAATTTAAA GAACTTCATA GCGGAAATGA TAAAATTGCA GTACCAAAAA TTTACAGAAA AGCAACAAGT AGAAGAGTGC TTACAATGGA ATGGATCGAT GGGACAAAAT TAACAAATAT AGAAGCTGTA AAAAATTTAG GAATTAATCC AAATGAGATG GTGGAAATTG GAGTTAGTTG TAGTCTTCAG CAACTTATTG AGCACGGTTT TTTTCACGCC GATCCACACC CAGGGAACAT CTTGGCAATG AAAGATGGAC GATTATGTTA TTTAGATTTT GGAATGATGA GTGATATTAC GCAACAATCT AGGGTTGGAT TAATTAGAGC AGTTGTTCAT CTTGTCAACA GGAGATTTGA TAAACTTTCT AATGATTTTG TTCAACTAGG ATTTCTTTCT GAAGATGTTG ATTTAACTCC AATTGCGCCA GCATTTGAAA GTGTATTTAC CACCGCTTTG GAAATAGGCG TAAACAAAAT GGACTTTAAA GCCGTAACAG ATGATATGTC TGGAATTATG TATAAATTTC CTTTTAAATT ACCACCTTAT TATGCACTAA TAATCAGATC TTTAATTACT TTAGAAGGCA TAGCACTAAG TGTTGACCCT AATTTTAAAA TACTAGGAGC TGCATATCCT TATTTCGCAA GAAGACTAAT GGAAGATGAA AATCCAGAAT TAAGAAATAG TTTAAAAGAA ATGCTTTTTG ATGAAAATAC TCTAAAATTA GAGAGATTAG ACGATCTTCT AAAAAGCGCT ACAAAGGAAA AACAACTTGA TGGCGAAAAA ATATTAGATC AAACAATTGA TTTTTTATTT TCTAAAAAGG GTCTTGTTCT TAGAAATGAA TTAGTAAATA TCCTAGCTTC AAAAATAGAC TCTGTTGGTT GGAAAACTAT AATTAGATTA AATGAAAAGT TACCGTCGAA AATTCGTTCA AAAACAATAG AAAAATCATA TAAAATTAAT AATAAACAAC TTTTAAATAT ATCTTCAATA AAGAAAATAT TTAAAGTGTC AAAAATGAAA TCTGGTTTTA AAAGAAAAAT ATTTTTTAAA AAATTACCAA GAATTCTAAT AACTAAAGAT ACGTATCGGA TGGGCTTAGG GTTAATGCAA AAAACCTCAG AAAAAGGGAT AATTAGGTTA GTCAAAGTTG CCGCTGGCGT AAGGCAATAA
|
Protein sequence | MAEELSDFIE ASGLLSYDPV AIDSIYKKNP IRLLRRLWQT LLPIGLFLLG VGWEKLIGSL DKDERKTFRA KEFTNLLVDL GPAFIKAGQA LSTRPDIVPP TVLEELAQLQ DQLPGFESKL AMACIEQDLD NKIENIFEEI DKEPISAASL GQVHKAKLKS GEQVAVKIQR PGLREQITLD LYIVRNIAIW FKNNIGIIRS DLVALIDELG KRIFEEMDYI NEANNAEKFK ELHSGNDKIA VPKIYRKATS RRVLTMEWID GTKLTNIEAV KNLGINPNEM VEIGVSCSLQ QLIEHGFFHA DPHPGNILAM KDGRLCYLDF GMMSDITQQS RVGLIRAVVH LVNRRFDKLS NDFVQLGFLS EDVDLTPIAP AFESVFTTAL EIGVNKMDFK AVTDDMSGIM YKFPFKLPPY YALIIRSLIT LEGIALSVDP NFKILGAAYP YFARRLMEDE NPELRNSLKE MLFDENTLKL ERLDDLLKSA TKEKQLDGEK ILDQTIDFLF SKKGLVLRNE LVNILASKID SVGWKTIIRL NEKLPSKIRS KTIEKSYKIN NKQLLNISSI KKIFKVSKMK SGFKRKIFFK KLPRILITKD TYRMGLGLMQ KTSEKGIIRL VKVAAGVRQ
|
| |