Gene P9303_03051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03051 
Symbol 
ID4778512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp320023 
End bp321327 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content54% 
IMG OID640085807 
Productputative cysteine desulfurase or selenocysteine lyase 
Protein accessionYP_001016323 
Protein GI124022016 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.512887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTT CCGTTAGTTC AGATCAGAGC ATGCCGTTGG CGATGGCTGC TGTAGAAGCT 
TCAGCCAACC TGGCAGATCT CACGCGAGCG GATTTCCCGC TACTAGGCCA GACCGCCTGC
TTAGGTCAGC CCTTGATTTA CATGGACCAT GCGGCGACAA GTCAGAAGCC ACGGCAGGTG
CTGGATGCCT TACAGCATTA CTACGACCAT GACAACGCCA ATGTGCACCG TGGTGCCCAT
CAGTTGAGTG TTCGGGCTAC CGAGGATTTT GAACGAGCAC GTCAGAAGGT GGCTGATTTC
ATCGCTGCCT CAAGCGCGCG GGAAATCGTT TTCACCAGGA ATGCAAGTGA GGCGATCAAC
CTGGTGGCTC GCAGTTGGGG TGATGCCAAC CTCCATGAAG GTGACGAGGT GCTGCTTACC
TTGATGGAGC ATCACAGCAA CATTGTCCCT TGGCAGATGC TTGCCAAGCG AACAGGTTGC
GTGCTGCGCT TTGTCGACCT CACCGATTGC GGGGAGCTTG ATTTAAATGA TCTTCGGCAA
AAGCTCTCAG AGCGCACTCG TTTGGTCAGT CTGGCTCACT TGAGCAATGT GCTGGGCTGT
TTTAACCCTA TCTCTGAGGT CACTGCAGAG GCTCATCGCT TCGGTGCTCT GGTGTTGCTG
GATGCTTGCC AGAGCTTGCC ACATATGCCT GTTGATGTGT CCCGGCTTGG ATGTGACTTT
CTCGTGGGTT CTTCCCACAA ATTGTGTGGT CCTACCGGGA TGGGTTTTCT TTGGGCTCGA
GAGGAGTTGC TTGATGCCAT GCCGCCTTTC CTCGGCGGTG GCGAAATGAT TCAGGATGTC
TATCTCGACC ACAGCAGCTG GGCTGATCTG CCTTACAAGT TTGAAGCAGG TACCCCTGCT
ATTGGGGAAG CTATTGGTAT GGGCGTTGCT CTCGACTACC TGAACCAGGT TGGCTTAGAT
CGTATTCACG CTTGGGAGCA GCAGCTCACG CTGCAATTGT TTGATCGCCT CCAAGGCATC
GATGGGTTGA CGATTCTGGG CCCAACTCCT CAGCAGGAGC CTGATCGGGC GGTCCTGGCG
GCTTTCACAG TGGATGGCTT GCATCCCAAT GATATTGGTG CCTTGCTTGA TTCAGCAGGG
ATCTGTATTC GTAGTGGCCA CCACTGCACC CAGCCTTTGC ATCGTCACTA TGGGATCCCT
GGATCAGCTC GTGCCAGCTT GAGCTTCACC AATACACCAG AAGAAGTCGA TCGTTTTGCT
GAGGAATTGG TTTCGACGAT CGGCTTCTTA AGAGAGCACA GCTAG
 
Protein sequence
MTSSVSSDQS MPLAMAAVEA SANLADLTRA DFPLLGQTAC LGQPLIYMDH AATSQKPRQV 
LDALQHYYDH DNANVHRGAH QLSVRATEDF ERARQKVADF IAASSAREIV FTRNASEAIN
LVARSWGDAN LHEGDEVLLT LMEHHSNIVP WQMLAKRTGC VLRFVDLTDC GELDLNDLRQ
KLSERTRLVS LAHLSNVLGC FNPISEVTAE AHRFGALVLL DACQSLPHMP VDVSRLGCDF
LVGSSHKLCG PTGMGFLWAR EELLDAMPPF LGGGEMIQDV YLDHSSWADL PYKFEAGTPA
IGEAIGMGVA LDYLNQVGLD RIHAWEQQLT LQLFDRLQGI DGLTILGPTP QQEPDRAVLA
AFTVDGLHPN DIGALLDSAG ICIRSGHHCT QPLHRHYGIP GSARASLSFT NTPEEVDRFA
EELVSTIGFL REHS