Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20371 |
Symbol | tas |
ID | 4779856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1680063 |
End bp | 1681034 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640085331 |
Product | hypothetical protein |
Protein accession | YP_001015857 |
Protein GI | 124026742 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAAGA AAAAAATTGG GATTGGTTTT GGAACATGGG CGTGGGGAAA TAAGCTTGTT TGGGACTACA AAGCTGAAAC AGATGATATT TTACTTAAAA AAACTTTTTT TGATGCAATA AATGGGGGAT TAGATCTTGT TGACAGTGCA GATTCATATG GCACGGGAAG TTTATTTGGA CAAAGCGAAA AGCTAATAGG CGATTTCCTT GAAGACTTGC CCAAGAGAAA GCTTAAAAAA ATTACTATCG CAACAAAGCT TGCACCCTTT CCATGGAGAA TTGGTCGCAA TGGTCTAAAC AATGCATTCC AAGAAAGTAA TCAGCGGCTA AAAGGAAATA TGACAAGAGT ACAACTTCAT TGGAGTACTT ATCGCTATGC ACCTTGGCAA GAAGAACAAT TACTAAATGG ACTAGGAGAT TTATACGAAG AAGGTTTAAT CAAAGAAATT GGGCTATCTA ATACTGGTCC AAAAAGACTA AATTTTTTAT TCCAAAAACT GAAAAAAAGA GGAATTAAAA TTAAGAGTAT ACAAATGCAA CTTTCGTTAT TAACAAAACC ATCTTTAGAA GATGAAAATA TAAAAATTAT ATGTGATGAG AATGAAATTG AATATTTAGC TTATAGTCCA TTAGGACTGG GAATCCTTAC AGTTCCACCT AATCGATCTC CTAAGCCTAC AACATTCTTA CGAAAAAGTT TATACCAAAG GATACTTCCA AAAACCATTG AATTGAGAAC ATTGCTAACT AATATCGGTA AAAAATATTC AGCTTCCCAA GCACAAGTTG CTTTGAATTG GGTTAGGTCT CATGGAGCTA AACCAATAGT CGGTATTCGT AATCCAAACC AAGCTAAAGA TGCAATTTCA GCACTTAATT GGTCTTTAAC TAAAAGTGAA AAACAAAAGC TCGATTTTTG CAGGAATGCA TGTCTAGCAA ATATGCCACT AAATCCTTTT ACTAGTCCAT AA
|
Protein sequence | MTKKKIGIGF GTWAWGNKLV WDYKAETDDI LLKKTFFDAI NGGLDLVDSA DSYGTGSLFG QSEKLIGDFL EDLPKRKLKK ITIATKLAPF PWRIGRNGLN NAFQESNQRL KGNMTRVQLH WSTYRYAPWQ EEQLLNGLGD LYEEGLIKEI GLSNTGPKRL NFLFQKLKKR GIKIKSIQMQ LSLLTKPSLE DENIKIICDE NEIEYLAYSP LGLGILTVPP NRSPKPTTFL RKSLYQRILP KTIELRTLLT NIGKKYSASQ AQVALNWVRS HGAKPIVGIR NPNQAKDAIS ALNWSLTKSE KQKLDFCRNA CLANMPLNPF TSP
|
| |