Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02361 |
Symbol | |
ID | 4779559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 218139 |
End bp | 219725 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640083501 |
Product | hypothetical protein |
Protein accession | YP_001014065 |
Protein GI | 124024949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAC TTCAATCTTA TTTTTTAATA TCTGCGATAG TAATTTTATC AATTTTGACT GGGATATTTA TCTGGCGCAA TAAGCATCTT ACGCAAATCC CTAAGTTCAA TGAAGAATCA TTCAATGCTC CGGTTTCCTC AAAATATATA CCAAAGAATA CTGATCTCGT ATTCCACTGG AAACTGAATC CAGGCTTACT TCCAAAGTAC ATCGAAAATT ATCAAGATAA AGTTAGTAAA CACGCCATAA ACAAAAAAGT AAGTTTTATT AGAGATTCCT CTTTTCAATT AATTGGCTTT AATTTTGCAA AAGACATCTC AAAATGGGTA GGAGATTATG GGAGCTTTGC AGTATTTGAT TCAAACAAAA AAACTATAAA TGATTGGTTG ATGGTCTTAG CAATAAAAGA AGATGTAAAT ATTAAACAAG AATTAGAATC TATTTTAGGA TCAAAAGTTG TTGATGAGAG TACTACTCAA AGCAATAAAA TCAGCACCTC AAAAACAGAA ATAATTTCAA AACAAATTAA TTCAAATAAC TCAATCTACT TTGCAAATGA TGAAGATAAT CTTTTAATAT CATCCAATCC TAATATCATA CAATCTTCAA TAGAAGAATT AGATAGCAAT ATAATAAATA CAAAAAAAAT GTATAAGAAT ATTCAATTAA AGGATAATCT TAAAGACGGA TTATTATTAT TAGAAATGTC TCCAAAAAAG ATTTTAAATC TTATTGGTCA AGAAGAAGAT TTATTGAATA TAAATAAGGT AGATAATTTA CTATCTTCTG TAAATGTAGA TAAAAATAAA TTAAACTTAG AAGGAATATT AGCTTACAAT GTTAAAACTA AAATGCCAGT TAAAGATATT AATTCTAATT TAATTGATAT AAAAAAGGAA TCTGAATTGC CTGAGAATTA TATATTAGTT GACAATCCCA AGCAGTATTT CCAGAAAGAT TCTGTCCATC CATATCAAAA GCTAATAGCC TCTATTATCA AAGAATCAAC AACCTCAGAT TATTCTGAGC TCTTAAAAAT AATTCTTGAA AACTCTCAAG GAAATTTGAT TTGGATAAAT GATAAAGACT GGTTGATTTT AACTAGGAAA TCTGATACGA AAAAGACAGA GATAGATGAT ATTCTAAAAA AAGAGAATTT TTTGAATTCA AATCTAGATT TTAAAAGCAG AAAGCTAGAG ATTTGGTCAA AAATAAGTAC AAATGAAAAT AATACATATG AGCTAAAAGA TAACATTGAG GCAATTGTCG AAGAAGATGA CAAGACTTAC ATTTGGAGTC AAAACTTATC TTCTATATCA AATTTTGATA ATACAAACTA CCTAAAAAAT TATTCAGATA ATGAACAGAA TACAAATGAA TTTAATGATT TTGATGATAT CTTGAAAATT CATTTAGGGA AAGAAAAAAC TAAAGCAATT TTAAATAGTT TCTATCCATA TATCTTATTG AAAACTATGT TAGGAAACAC ACTAAATCCT CCTCAGGATA TTGATATAGC CATTGCAGTC CCTACAATTA ATTATCCAGA CTTCATTAAA GTTAAAATCA ACTTAAAAAC AAGTTGA
|
Protein sequence | MKSLQSYFLI SAIVILSILT GIFIWRNKHL TQIPKFNEES FNAPVSSKYI PKNTDLVFHW KLNPGLLPKY IENYQDKVSK HAINKKVSFI RDSSFQLIGF NFAKDISKWV GDYGSFAVFD SNKKTINDWL MVLAIKEDVN IKQELESILG SKVVDESTTQ SNKISTSKTE IISKQINSNN SIYFANDEDN LLISSNPNII QSSIEELDSN IINTKKMYKN IQLKDNLKDG LLLLEMSPKK ILNLIGQEED LLNINKVDNL LSSVNVDKNK LNLEGILAYN VKTKMPVKDI NSNLIDIKKE SELPENYILV DNPKQYFQKD SVHPYQKLIA SIIKESTTSD YSELLKIILE NSQGNLIWIN DKDWLILTRK SDTKKTEIDD ILKKENFLNS NLDFKSRKLE IWSKISTNEN NTYELKDNIE AIVEEDDKTY IWSQNLSSIS NFDNTNYLKN YSDNEQNTNE FNDFDDILKI HLGKEKTKAI LNSFYPYILL KTMLGNTLNP PQDIDIAIAV PTINYPDFIK VKINLKTS
|
| |