Gene P9303_21101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21101 
Symbol 
ID4777062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1871058 
End bp1872344 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content55% 
IMG OID640087618 
Productputative L-cysteine/cystine lyase 
Protein accessionYP_001018110 
Protein GI124023803 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.601707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTATG TGCTGCTGCA TCGGCTAGTG AAGCGAGCAG CAGATAAAAC GGCAGGCGAT 
CATGAACACA GGCCCCGACA ACCCATGCTT AGAGACCTCT GCCCAGCACT CGCTAACAAG
ACCTACTTCA ACTACGGCGG CCAGGGCCCC TTACCCACGC CTTCCCTTGA AGCGATTACT
GCAAGCTGGC AGAACATTCA AGAACTGGGT CCCTTCACCA ACAGCGTGTG GCCCTATGTG
GCCAAAGAAG TCCAAGCCAC TCGTTCACAC TTAGCCAAGC TCTGCGGCGT TGCCCCCCAT
CGCATCGCGC TCACTGAAAA TGTCACCAGT GGCTGTGTCT TGCCGCTCTG GGGCCTGCCT
TTCTCAGAAG GCGATCGCTT GCTCATTAGC GACTGTGAAC ATCCGGGAAT CGTTGCTGCA
TGCATCGAAC TTGCCCGCCG ACAACACCTG GAAATAGACA CGCTGCCTGT AAAGAACTTG
CGTCATGGTG CTAACGATCA AACGACTAGC GACAGCCTTG TGCTGGAAAG ACTTGAGCAA
CACCTCAAAC CAAGCACAAG GCTGGTAGTG CTCTCCCACC TGCTATGGAA TACAGGCCAG
GTGATGCCGA TCTCGGCTGT TTCAACAGCC CTCAGCCATC ATCCACAACA GCCTTTCTTG
CTCGTGGATG CGGCACAAAG CTTCGCCCAA ATGCCTATAC AGGAAGCCGC TGCCGCTTCA
GACATCTATG CCTTTACGGG GCACAAATGG GCCTGCGGGC CTGAAGGGCT TGGTGGAGTC
GCCCTCTCAG AACGGGTGCT CGCCGAAGCG AATCCCACCC TGATTGGCTG GCGCAGCTTG
CAGAACGAAG GCCATCTTCA AAGCAACCTG GACGAACTCT TCCATCACGA CAGTCGACGC
TTTGAGGTAG CAACCTCCTG CGTGCCGCTG ATGGCGGGCC TGCGCTGTTC GTTGGAGCTG
CTCGAAGCCG CAGGCTCGCA GCAGGAACGA CTGAGCCAGA TTCGCCAAGG CAGCCGACAC
TTATGGAATC AACTACAACA GCTCACAGGC GTCGAAACAC TGCTCAACAG TGCCCCAGCA
GCTGGTCTTG TCAGCTTTGA GTTACCCCAA GGCCCCCCAG CTCCTGATGT GGTTAAACAA
TTAGGAAACG ATCAGCTCTG GATTCGGCAT CTAGAAGATC CAATCTGCCT ACGTGCCTGC
GTGCACATCA CCACTGAAGA GCAAGAACTC AACACACTTA CAACCTCACT CAAGCAGCTA
GCTAGCAAAG GAGAGCCAAG CAATTAA
 
Protein sequence
MTYVLLHRLV KRAADKTAGD HEHRPRQPML RDLCPALANK TYFNYGGQGP LPTPSLEAIT 
ASWQNIQELG PFTNSVWPYV AKEVQATRSH LAKLCGVAPH RIALTENVTS GCVLPLWGLP
FSEGDRLLIS DCEHPGIVAA CIELARRQHL EIDTLPVKNL RHGANDQTTS DSLVLERLEQ
HLKPSTRLVV LSHLLWNTGQ VMPISAVSTA LSHHPQQPFL LVDAAQSFAQ MPIQEAAAAS
DIYAFTGHKW ACGPEGLGGV ALSERVLAEA NPTLIGWRSL QNEGHLQSNL DELFHHDSRR
FEVATSCVPL MAGLRCSLEL LEAAGSQQER LSQIRQGSRH LWNQLQQLTG VETLLNSAPA
AGLVSFELPQ GPPAPDVVKQ LGNDQLWIRH LEDPICLRAC VHITTEEQEL NTLTTSLKQL
ASKGEPSN