Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_17331 |
Symbol | |
ID | 4779097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1418931 |
End bp | 1420040 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640085020 |
Product | integral membrane protein |
Protein accession | YP_001015553 |
Protein GI | 124026438 |
COG category | [R] General function prediction only |
COG ID | [COG4956] Integral membrane protein (PIN domain superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.07903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACC TAGTTATTCT GGTTTTATTT CTTATATCAG GTGCAATCAC CGGTTGGATT GGAGTGAATT GGTTACCTGA AGAGACTTTA GATCATATAA CTAATCTAAG AAATTTAAAA ATTGCTTTAT CAGGATTAAC CGCTCTTATA GGCCTGTTAA TAGCCTTCCT TTTTCAGCAA TTTAGAAATA AATTAACTAA AAGAATAAGA ACTATGCCAA CAGATCTACT AGTTAGTAGA TCTGTTGGCA TAGTTCTCGG ACTTGTCATA GCAACACTTT TATTAGTTCC TGTACTACTT TTACCTTTAC CCTCTGAATT GTTTTTTGTA AAACCTATTT TTGCTGTACT TAGCAACATT TTTTTTGGAG TACTTGGATA TAACCTTGCG GATGTTCATG GACGAACCGT TTTACGTCTA TTCAACCCAA ATAGCACTGA ATCTTTGTTA GTTGCAGATG GGGTTTTAAC TCCAGCTAGT GCAAAAGTTT TAGATACCAG TGTAATAATT GATGGTCGCA TTCAGGCCCT CCTTAGATTT GGGCTTATAG AAGGGCAAAT AATCGTAGCT CAATCAGTTA TGGATGAGCT TCAAAAACTA TCTGACTCAA GTAACAACGA AAAAAGAGGA AAAGGAAGAA GAGGACTAAA ATTACTTAAC CAATTAAGAG AAAGTTATGG AAGAAGATTA GTTATAAACA GCACTAGATA TGAAGGGGAA GGAACTGATG AAATTCTTTT AAAATTAACC TCAGACATCT CAGGAATATT AATCACTGTC GATTACAACT TGTCTCAAGT AGCTCTAGTC CAAGAAATAA AAGTTCTTAA CTTAAGTGAT CTAGTACTTG CAGTTAGGCC TGAAGTTCAA CCTGGTGAGA AGTTAAATTT AAAAGTAGTT AGAGAAGGTA AGGAAAACTC ACAAGGCATT GCTTATCTTG AAGATGGCAC AATGGTTGTT ATTGAGGAGG GTCTCCAATG GATCGGTAAA AGGATAGAAG TAGTTGTGAC TGGAGCATTA CAAACACCGA CGGGCCGAAT GGTTTTTAGT AAAGCTTCAA ATGATCAGCC CCCGAACAAA TTTGAAAAAA CAAAAGCATC TCAAGGCTAG
|
Protein sequence | MADLVILVLF LISGAITGWI GVNWLPEETL DHITNLRNLK IALSGLTALI GLLIAFLFQQ FRNKLTKRIR TMPTDLLVSR SVGIVLGLVI ATLLLVPVLL LPLPSELFFV KPIFAVLSNI FFGVLGYNLA DVHGRTVLRL FNPNSTESLL VADGVLTPAS AKVLDTSVII DGRIQALLRF GLIEGQIIVA QSVMDELQKL SDSSNNEKRG KGRRGLKLLN QLRESYGRRL VINSTRYEGE GTDEILLKLT SDISGILITV DYNLSQVALV QEIKVLNLSD LVLAVRPEVQ PGEKLNLKVV REGKENSQGI AYLEDGTMVV IEEGLQWIGK RIEVVVTGAL QTPTGRMVFS KASNDQPPNK FEKTKASQG
|
| |