Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07771 |
Symbol | |
ID | 4780841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 710036 |
End bp | 711589 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640084052 |
Product | GTPase SAR1 and related small G proteins |
Protein accession | YP_001014600 |
Protein GI | 124025484 |
COG category | [R] General function prediction only |
COG ID | [COG0486] Predicted GTPase |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.1398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0362439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCC ATAACTCTAG GAAAATAATA TTTTATGTAT TGATAATTTT CATTAGCTTG ATAATTATAG GTCTGGTCGG AGCAATAATT AGATTGATAA ATATACCTGC CATATTAATT ACAGTATTAA TAATATGTGG TTTAAGTTAT ACAAAAAAAA TAGACTGGTT ACAAAATAGT CTAAGATCAA TATTTAAAAT AAAGGATGAG AAGAAATCAT TAGATTTATC GCTAATTAGC AAAAAAGAAG CCGCAGATAA ATCATTAAAA AGTATTGATC ATTTAATCAC ATTAATCAAT GACAAAGTCA AGGCCAAGGC CTTAAAGGAT GAAAAGGATA GGGTTTCATT AGAGTTAGAT AGAGGAGATA TTATTTTAGT AGTTTTTGGA ATTGGCTCAA GCGGTAAAAC TTCATTAATA AGAGCCCTAT TAAAAAAAAT AGTAGGTAAA GTTAGTCCTG AAATGGGATC AACGAGAGGG AAAGAAACCT TCCGACTGAA ACTTAAAGGA CTTACAAGAG GAATAAGAAT AATTGACACT CCTGGCATAC TGGAATCCGG GAGAGGGGGT AGAGAGAGGG AAAAAAGTGC GTTAATGGAA GCACGTAAAT CTGATTTAAT GTTAGTAGTA ATTGAAGGTG ATTTACGTTC TGAAGAAACA AGAACAATTA GGAGTTTGTC AAAATTAGGA AAAAGACTTT TACTTGTGCT AAATAAAATA GATTTAAGAG GAGAAAGTGA AGAAAAAAGA TTAATTGAGA TACTAAATTC TAGATGTAAT GATTTTATTG GTCCAAATGA CATTATTTGT ACATCAGCAT CACCTCAGAC AATTGCAGTC ACTGGCAGAA AGCCTTATCA ACCAGCCCCT GAAATCAATA GTTTAATTAG AAGATTAGCA AATATACTTC ATGAAGAAGG TGAAGAATTA ATTGCGGATA ATATTTTACT TCAATGCAGC AATATTGGAA AAGAAGGGAA AAATTTATTA ATCAAACAAA GAACTCAATC TGCTAAAAAA TGTATAGATA AGTATGGGTG GCTCAGCAGC GGTGCATTAA TACTAACTCC AGTTCCTGTC TTAGACATGA TCGCTGCAGC GGCTGTAAAT GCACAAATGG TAATAGAAAT AGCTAAAATA CATGGAGTTA AACTTACAAA TGAAAGGGCA AAGAATTTAG CGCTTTCGGT AGGAAAAATA CTTGCAACTA TGGGTATAGT TAAAGGTGGA GTTTCTCTAA TAAGTTCAAC ATTAAGTTTA TCACTACCAA CATTAGTTAT TAGCAAAGTA ATTCAAGGTA TTAGTGTATC TTGGCTTACT AGGATTGCTG GAGCAAGTTT CATTACTTAT TTCCAACAAG ATCAAGACTG GGGAGATGGA GGAATACAAG AAGTTGTTGA ATATCACTAC AACTTAAACA AAAGGGAGGA ATATTTTAAA AGTTTTATTC GGAGAGCTTA TGAGAGAGTT ATTGATCCGC TAGTTGAAAA GAATTTGAAA AAGCTACCAC CGAGATCAAG GCCTCCGAAG GAGGGGGACT CATCGGTCCT CTAA
|
Protein sequence | MKIHNSRKII FYVLIIFISL IIIGLVGAII RLINIPAILI TVLIICGLSY TKKIDWLQNS LRSIFKIKDE KKSLDLSLIS KKEAADKSLK SIDHLITLIN DKVKAKALKD EKDRVSLELD RGDIILVVFG IGSSGKTSLI RALLKKIVGK VSPEMGSTRG KETFRLKLKG LTRGIRIIDT PGILESGRGG REREKSALME ARKSDLMLVV IEGDLRSEET RTIRSLSKLG KRLLLVLNKI DLRGESEEKR LIEILNSRCN DFIGPNDIIC TSASPQTIAV TGRKPYQPAP EINSLIRRLA NILHEEGEEL IADNILLQCS NIGKEGKNLL IKQRTQSAKK CIDKYGWLSS GALILTPVPV LDMIAAAAVN AQMVIEIAKI HGVKLTNERA KNLALSVGKI LATMGIVKGG VSLISSTLSL SLPTLVISKV IQGISVSWLT RIAGASFITY FQQDQDWGDG GIQEVVEYHY NLNKREEYFK SFIRRAYERV IDPLVEKNLK KLPPRSRPPK EGDSSVL
|
| |