Gene NATL1_01341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01341 
Symbol 
ID4779138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp130693 
End bp131943 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content33% 
IMG OID640083398 
Productputative cysteine desulfurase or selenocysteine lyase 
Protein accessionYP_001013963 
Protein GI124024847 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.284649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAATT TAGTAAAAAA TATTGCTGAA AAAAGTAGAA ATGATTTCCC ATTATTTAAT 
AGAGATATTA ATAAAAATTT AATATATTTA GATCATGCAG CGACTAGTCA AAAGCCTAAA
CAAGTTATAG ATTCTCTAAA AAAATACTAT AGCTTTCAAA ATGCCAACGT TCATAGAGGT
GCCCATCAGC TAAGTGCAAT CGCAACAGAA AAATTTGAAA ATTCTAGAAA GTTAACAGCA
AATTTTATAA ATAGTAAGAA TGAAAAAGAG ATTATTTTTA CTAGAAATGC TACTGAAGCT
ATAAACCTCG TAGCTTATAC ATGGGGCAAT TATGAACTTC AGGAAAACGA CGAAATCTTA
ATAAGTTTAA TGGAGCATCA CAGTAATATA GTCCCCTGGC AACTAATAGC CAAAGCAAAA
AAGTGCAAGC TAATTTATAT CAATATTGAT AAAAATGGAG AATTAGATTT TGATGATTTT
AGAAAAAAAT TGAGTGATAA AACTAAAATA GTCAGCCTTG TTCACGTAAG TAATACACTC
GGTTGTTGTA ATCCTATCGA GGAAATTTCA TCCCTTGCAC ACCAAAAAGG TAGCTTAGTT
CTTTTAGATG CTTGCCAAAG TCTTGCTCAT AAGCAGGTAG ATATTAAAAA ACTTGGTATT
GATTTTCTGG CAGGATCTTC TCATAAACTT TGCGGGCCTA CTGGAATAGG TTTTTTATGG
GGTAGAGAAG AAATTTTAAA AAAAATTCCT CCTTTCCTTG GTGGAGGAGA GATGATTAAC
GAAGTTTTTA AGGACAACAG CACGTGGGCA GAGTTACCGC ATAAATTCGA AGCAGGTACT
CCAGCTATTG GGGAAGCCAT TGGTATGGGA ACTGCACTTA AGTATTTACA GTCAATTGGA
TTAAACGAAA TCCATAATTA CGAAAAAGAA TTAACAAAAT ATCTTTTCGA AAAATTAGAG
GAAATAGATG ATTTAAAAAT TCTTGGTCCT AGCCCTTTCA TTCAGCCTGA TAGAGGGCCT
TTAGCAACCT TTTATATTAA AGGTGTTCAC TCCAATGATG TTGCTGAGTT ACTTGATAAC
AGCAATATTT ACATAAGGAG TGGTCATCAT TGCTGCCAAC CACTTCATCG CTTCTATGGC
ATAAAAAGCA CAGCTAGAGC AAGCTTGAGC TTTACATCTA CTCCATCTGA AATTGATTAT
CTTGCTGAAG AACTAAAATC AGTAATTTCT TTTTTAAAGA AAAATTCTTA A
 
Protein sequence
MNNLVKNIAE KSRNDFPLFN RDINKNLIYL DHAATSQKPK QVIDSLKKYY SFQNANVHRG 
AHQLSAIATE KFENSRKLTA NFINSKNEKE IIFTRNATEA INLVAYTWGN YELQENDEIL
ISLMEHHSNI VPWQLIAKAK KCKLIYINID KNGELDFDDF RKKLSDKTKI VSLVHVSNTL
GCCNPIEEIS SLAHQKGSLV LLDACQSLAH KQVDIKKLGI DFLAGSSHKL CGPTGIGFLW
GREEILKKIP PFLGGGEMIN EVFKDNSTWA ELPHKFEAGT PAIGEAIGMG TALKYLQSIG
LNEIHNYEKE LTKYLFEKLE EIDDLKILGP SPFIQPDRGP LATFYIKGVH SNDVAELLDN
SNIYIRSGHH CCQPLHRFYG IKSTARASLS FTSTPSEIDY LAEELKSVIS FLKKNS