Gene NATL1_04651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04651 
Symbol 
ID4780078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp426553 
End bp427755 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content37% 
IMG OID640083742 
Productputative L-cysteine/cystine lyase 
Protein accessionYP_001014294 
Protein GI124025178 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCAA TATTGTTCAA AGACAATATG CCCGCGCTGC AAAATAAAGC TTATTTCAAT 
TATGGAGGTC AAGGTCCACT CCCAACTCAA TCATTAAATG CAATAACTTC TAGTTGGCAA
ACAATTCAAA AATTAGGTCC ATTCACAAAT AATGTTTGGC CATACATTAC GAAAGAAGTA
ATAACCACAA AAAATCTTAT AGCGGAGATT TGCTCTATAC ATCCCAAGAG AATTGCATTT
ACTGAAAATG TCACCTCCGG ATGTGTCTTG CCTTTACTAG GGCTACCATT TTCTGATGGT
GATAATTTAT TACTTAGCGA TTGTGAGCAT CCTGGGATAG TCGCAGCATG CAAAGAGTTA
GCTCGAAAAA AGAATCTAAC AATAGCAATA CTTCCTGTAT CAAAGTTATG CAATGGTAAC
GACAAGAAAG ATGAGACGTA TAACACAGTA CTTAAATTGA TAGATGAATA TCTTCAAAAA
AATACGAAGC TTGTTGTTCT CTCACATCTC CTTTGGAATA CAGGGCAAAT TATGCCAATT
GAACTTATTT CAAAAAGACT TAAAGAGCAT TCAAGCAAAC CTTATTTATT AGTTGATGCG
GCTCAAAGTT TTTGTCATAT TCCAAGCAAA GGTGCTTGTG ATAATGCAGA CATCTATGCG
TTTACAGGAC ATAAATGGGC TTACGGGCCT GAGGGCTTGG GCGCTGTTGT TCTTTCAGCA
AGAGTTCTAG AAGAATCAAG CCCAACATTG ATTGGTTGGA AAAGCTTAAA AGCTGAAGAA
GGAATACATA TAAATAATAA AGCTCCTTTT CATTCTGATG CCAGACGTTT TGAGATAGCG
ACTTCTTGTA TCCCTCTTTT AGCTGGTCTC AGAAGCTCTC TAGCAATGCT AAAAAATGAA
GGTAATGAGA CTGAGCGCTT TTTTAAGATT AAAAGCCTCA GCTGTATCCT GTGGGAGAAA
CTCAACCAAA TCAAAAACAT TGAGCTTGTT TTAAATAGTC CTCCACCTTC AGGAATTATA
AGTTTTACCA TTAGTGGGAA CCATTCTCCT GAAGAGGTTG TCGATTATCT TGGTAAAGAG
AATTTATGGA TAAGAGTCCT CGAGGATCCT AAATGGCTTC GTGCTTGCGT TCACATAACT
ACAGATTCAA ACGAAATAGA TAACCTTGTT ATTGGTCTAA AAAATTTCAT CTCGACTAAG
TAG
 
Protein sequence
MSPILFKDNM PALQNKAYFN YGGQGPLPTQ SLNAITSSWQ TIQKLGPFTN NVWPYITKEV 
ITTKNLIAEI CSIHPKRIAF TENVTSGCVL PLLGLPFSDG DNLLLSDCEH PGIVAACKEL
ARKKNLTIAI LPVSKLCNGN DKKDETYNTV LKLIDEYLQK NTKLVVLSHL LWNTGQIMPI
ELISKRLKEH SSKPYLLVDA AQSFCHIPSK GACDNADIYA FTGHKWAYGP EGLGAVVLSA
RVLEESSPTL IGWKSLKAEE GIHINNKAPF HSDARRFEIA TSCIPLLAGL RSSLAMLKNE
GNETERFFKI KSLSCILWEK LNQIKNIELV LNSPPPSGII SFTISGNHSP EEVVDYLGKE
NLWIRVLEDP KWLRACVHIT TDSNEIDNLV IGLKNFISTK