Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18431 |
Symbol | |
ID | 4780604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1505182 |
End bp | 1506504 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640085132 |
Product | GTPase SAR1 and related small G proteins |
Protein accession | YP_001015663 |
Protein GI | 124026548 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.491382 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTACC CAAGAGATAT AGCTACAAAA TGCAAGTTTT TACTTGGTCA ATGGAAAGAG AACTTAAATC TCACTAATTA CGAAAGAACA AAATTTGAAG ACACTTTAAA TCAACTTGAT TTTCAAATAA ATAAATTAGA GAAAAAAGAA CTACAGATAT CAGTGCATGG CAGAGTAGGA GTTGGTAAAT CAAGCTTATT AAATGCATTA ATTGAAAAGC AAATATTTCC AACTGATATA ATTAACGGTA ATACAAAAAC CAGTAAATCT TATAAATGGG ACGAAAGGTT TCAAGGATTA AATAAGGTTG ATCTAATCGA CTCTCCTGGC ATAGATGAAA TAAATAATTC TAATAAAGAA GAAATAAATT TTAATACTGT CCTAGACACA GATTTAATTC TTTATGTAAT TGATAGTGAT ATAACGAGAG TCGACATGAA CTCCATTGAA GATCTATTAA GGCATAACAA ACCAATACTA ATAGTCTTAA ATCGTTGTGA TCAATGGAAT AGAAGAGAAA CAAAACTAAT ACTCTCAAGT GTTCATAGGA AATTATCATT TTGTAAACAA AAGGTTAAAA TTGCTCTAGT ATCTTCATCT CCAAGGAAAG CAAAAATAAA ACCAGACGGA ACTATTAGGA GTGAGAAAAC AATCCCTAAA GTTGGTATTC TCAAGAATGA ACTTAAAGAT ATTATCGACA AAAGTGGTGA ATTTTTTCTT TGTATAAATA CTTTAAGAAT TGCAGACCGA CTCTACAACT TACTCAAAGA GAATCGACTA CTGAAAAAGA AAAAAGAAGC ACAAAATTTA ATCGGCAGAT ATGCAACTTT AAAAGCCTCA GGGGTAGCAC TTAATCCCTT CTTAATGATT GATCTTATTA CCGGTCTAGC TTTTGATAGT TCTCTTATTA TTCAACTAAG TAAATTATAT GGGTTAGAAG TAGGTGGCCC CACCGCAAGG CAATTAGTAA AAAAGCTTAG TTTCCAAAAT TCATTACTAG GGGGTGCGCA GATAGGAATA CAAATTACCT TAAATATTCT CAAGCAAATA ATGATATTTG CAGCACCTCT TACTGGAGGA TTAAGCCTTG CGCCCACTGC TCCTATAGCC ATTGCTCAAG CTGCTCTTGC TATTCATGCG ACAAAACTTA TAGGTCGCCT CGCAGCTTAT AAATTTCTAA TTGGGACAAG TAGGAACGAT GGCAGGCCTC GATTAATGTT GAACTATCTT CTCAAAAACA ACTCAGACTT TAGAATAATG ATTGGTGACT TTAAATTTCT TACATCAAGT ACGGAAAAAA ATAAAAATTA TTTGTTGCCA TGA
|
Protein sequence | MIYPRDIATK CKFLLGQWKE NLNLTNYERT KFEDTLNQLD FQINKLEKKE LQISVHGRVG VGKSSLLNAL IEKQIFPTDI INGNTKTSKS YKWDERFQGL NKVDLIDSPG IDEINNSNKE EINFNTVLDT DLILYVIDSD ITRVDMNSIE DLLRHNKPIL IVLNRCDQWN RRETKLILSS VHRKLSFCKQ KVKIALVSSS PRKAKIKPDG TIRSEKTIPK VGILKNELKD IIDKSGEFFL CINTLRIADR LYNLLKENRL LKKKKEAQNL IGRYATLKAS GVALNPFLMI DLITGLAFDS SLIIQLSKLY GLEVGGPTAR QLVKKLSFQN SLLGGAQIGI QITLNILKQI MIFAAPLTGG LSLAPTAPIA IAQAALAIHA TKLIGRLAAY KFLIGTSRND GRPRLMLNYL LKNNSDFRIM IGDFKFLTSS TEKNKNYLLP
|
| |