Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01341 |
Symbol | |
ID | 4779138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 130693 |
End bp | 131943 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640083398 |
Product | putative cysteine desulfurase or selenocysteine lyase |
Protein accession | YP_001013963 |
Protein GI | 124024847 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.284649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAATT TAGTAAAAAA TATTGCTGAA AAAAGTAGAA ATGATTTCCC ATTATTTAAT AGAGATATTA ATAAAAATTT AATATATTTA GATCATGCAG CGACTAGTCA AAAGCCTAAA CAAGTTATAG ATTCTCTAAA AAAATACTAT AGCTTTCAAA ATGCCAACGT TCATAGAGGT GCCCATCAGC TAAGTGCAAT CGCAACAGAA AAATTTGAAA ATTCTAGAAA GTTAACAGCA AATTTTATAA ATAGTAAGAA TGAAAAAGAG ATTATTTTTA CTAGAAATGC TACTGAAGCT ATAAACCTCG TAGCTTATAC ATGGGGCAAT TATGAACTTC AGGAAAACGA CGAAATCTTA ATAAGTTTAA TGGAGCATCA CAGTAATATA GTCCCCTGGC AACTAATAGC CAAAGCAAAA AAGTGCAAGC TAATTTATAT CAATATTGAT AAAAATGGAG AATTAGATTT TGATGATTTT AGAAAAAAAT TGAGTGATAA AACTAAAATA GTCAGCCTTG TTCACGTAAG TAATACACTC GGTTGTTGTA ATCCTATCGA GGAAATTTCA TCCCTTGCAC ACCAAAAAGG TAGCTTAGTT CTTTTAGATG CTTGCCAAAG TCTTGCTCAT AAGCAGGTAG ATATTAAAAA ACTTGGTATT GATTTTCTGG CAGGATCTTC TCATAAACTT TGCGGGCCTA CTGGAATAGG TTTTTTATGG GGTAGAGAAG AAATTTTAAA AAAAATTCCT CCTTTCCTTG GTGGAGGAGA GATGATTAAC GAAGTTTTTA AGGACAACAG CACGTGGGCA GAGTTACCGC ATAAATTCGA AGCAGGTACT CCAGCTATTG GGGAAGCCAT TGGTATGGGA ACTGCACTTA AGTATTTACA GTCAATTGGA TTAAACGAAA TCCATAATTA CGAAAAAGAA TTAACAAAAT ATCTTTTCGA AAAATTAGAG GAAATAGATG ATTTAAAAAT TCTTGGTCCT AGCCCTTTCA TTCAGCCTGA TAGAGGGCCT TTAGCAACCT TTTATATTAA AGGTGTTCAC TCCAATGATG TTGCTGAGTT ACTTGATAAC AGCAATATTT ACATAAGGAG TGGTCATCAT TGCTGCCAAC CACTTCATCG CTTCTATGGC ATAAAAAGCA CAGCTAGAGC AAGCTTGAGC TTTACATCTA CTCCATCTGA AATTGATTAT CTTGCTGAAG AACTAAAATC AGTAATTTCT TTTTTAAAGA AAAATTCTTA A
|
Protein sequence | MNNLVKNIAE KSRNDFPLFN RDINKNLIYL DHAATSQKPK QVIDSLKKYY SFQNANVHRG AHQLSAIATE KFENSRKLTA NFINSKNEKE IIFTRNATEA INLVAYTWGN YELQENDEIL ISLMEHHSNI VPWQLIAKAK KCKLIYINID KNGELDFDDF RKKLSDKTKI VSLVHVSNTL GCCNPIEEIS SLAHQKGSLV LLDACQSLAH KQVDIKKLGI DFLAGSSHKL CGPTGIGFLW GREEILKKIP PFLGGGEMIN EVFKDNSTWA ELPHKFEAGT PAIGEAIGMG TALKYLQSIG LNEIHNYEKE LTKYLFEKLE EIDDLKILGP SPFIQPDRGP LATFYIKGVH SNDVAELLDN SNIYIRSGHH CCQPLHRFYG IKSTARASLS FTSTPSEIDY LAEELKSVIS FLKKNS
|
| |