Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15721 |
Symbol | |
ID | 4780138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1278488 |
End bp | 1279570 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640084854 |
Product | hypothetical protein |
Protein accession | YP_001015394 |
Protein GI | 124026278 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01151] photosystem II, DI subunit (also called Q(B)) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.131507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA TTCAGCAGCA GCGTTCTTCG TTGCTCAAAG GTTGGCCACA ATTCTGCGAG TGGGTTACTT CCACCAACAA CCGTATCTAT GTCGGTTGGT TCGGTGTATT GATGATCCCT TGCCTTCTTG CGGCAACAAC TTGTTTCATC GTTGCATTTA TCGCTGCTCC TCCAGTTGAT ATCGACGGTA TCCGTGAGCC AGTAGCTGGT TCATTCATGT ATGGAAACAA CATCATTTCT GGTGCTGTTG TTCCTTCAAG TAACGCTATC GGCCTTCACT TCTACCCAAT CTGGGAAGCA GCAACTCTTG ATGAGTGGCT ATATAACGGT GGCCCTTACC AGCTTGTAAT CTTCCACTTC CTTATCGGTA TCTCTGCATA CATGGGACGT CAGTGGGAGC TTTCATACCG TTTAGGTATG CGCCCATGGA TCTGTGTTGC TTACTCAGCT CCTGTATCAG CAGCTTTCGC TGTATTCCTT GTTTACCCAT TCGGTCAGGG TTCATTCTCT GATGGTATGC CTCTAGGAAT TTCTGGAACA TTCAACTTCA TGTTCGTTTT CCAGGCTGAG CACAACATCT TGATGCACCC ATTCCATATG GCTGGTGTAG CAGGTATGTT CGGTGGTGCT TTGTTCTCTG CAATGCACGG TTCTTTGGTT ACTTCATCAC TTATCCGTGA GACCACAGGA CTTGATTCAC AGAACTATGG TTACAAGTTT GGACAAGAAG AAGAGACATA CAACATCGTT GCAGCTCATG GCTACTTCGG TCGTTTGATC TTCCAATATG CAAGCTTCAA CAACAGCCGT AGTCTTCACT TCTTCTTGGC TTCATGGCCA GTGATTTGTG TTTGGTTGAC ATCTATGGGC ATCTGCACCA TGGCGTTCAA CTTGAACGGT TTCAACTTCA ACCAGTCTGT AGTTGATACT TCAGGCAAGG TTGTACCAAC CTGGGGTGAC GTACTTAACC GTGCAAACCT TGGTATGGAA GTAATGCACG AGCGTAATGC TCACAACTTC CCACTTGACC TAGCAGCTGC TGAGTCTACT TCTGTAGCTC TTGTTGCACC TGCAATCGGT TAA
|
Protein sequence | MTTIQQQRSS LLKGWPQFCE WVTSTNNRIY VGWFGVLMIP CLLAATTCFI VAFIAAPPVD IDGIREPVAG SFMYGNNIIS GAVVPSSNAI GLHFYPIWEA ATLDEWLYNG GPYQLVIFHF LIGISAYMGR QWELSYRLGM RPWICVAYSA PVSAAFAVFL VYPFGQGSFS DGMPLGISGT FNFMFVFQAE HNILMHPFHM AGVAGMFGGA LFSAMHGSLV TSSLIRETTG LDSQNYGYKF GQEEETYNIV AAHGYFGRLI FQYASFNNSR SLHFFLASWP VICVWLTSMG ICTMAFNLNG FNFNQSVVDT SGKVVPTWGD VLNRANLGME VMHERNAHNF PLDLAAAEST SVALVAPAIG
|
| |