Gene A9601_04651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04651 
Symbol 
ID4717163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp404775 
End bp405950 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content29% 
IMG OID640078177 
Productputative L-cysteine/cystine lyase 
Protein accessionYP_001008860 
Protein GI123968002 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.283231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATA ATCTAAGAGA TCAAATACCC GCATTAAAAA ATAAGTATTA TTTCAACTAT 
GGCGGTCAAG GACCATTACC AAAATCTTCT CTAGAAGCAA TAGTTAAAAC TTGGGAGATT
ATCCAAGATT TAGGACCATT TACCAATGAT ATGTGGCCTT TTATTTACAA AGAAATATTG
ACCACAAAAA GAATCATTGC GCAAAAATTA GGTGTCAATT CAAAGAATGT AGCTTTTACC
GAAAATATCT CTTCCGGTAT GATTTTGCCC TTTTGGGGAA TAAAAGTAAA AGAGGGAGAA
GAGTTGTTAA TAAGTGACTG TGAACATCCT GGAGTAGTGG CTGCAAGTCG AGAATTTTGC
AGAAGAAATA AATTAATATT CAAAATTTTG CCAATCCAAA AAATTAAAAA TCTAAACGAC
GAAAATATAA TTTTAGAGAT TTTGAAAAAT CTAAATAGTA AGACTAAGAT CCTAATTATT
TCTCATATCT TATGGAACTT TGGATATAAA ATTCCTTTAA AAGAAATTTC TATCGAATTA
AAAAATAATC GAGAAAACTC TTATTTACTT GTTGATGGTG CTCAAACCTT TGGGCATATA
AATATTGAAA AAGAAGTTTT TTATTCTGAT TTATATTCAA TAACTTCTCA CAAATGGGCA
TGTGGACCAG AAGGACTTGG AGCCATTTAT GTCTCAGATA GATTTATTCG TGAAACAGAT
CCAACAATAA TTGGTTGGAA ATCATTAAAA AAAGAACAAG GCATTTATGA GCCTTCAGAT
AATCTTTTTC ATGATGATGC AAGGAAATTT GAAATAGCTA CCTCTTGTAT TCCTTTACTT
GCTGGGCTAC GGAATTCTTT AGATCTTTTG GATAAAGACT GCCATGAAAA AGAAAAAAAC
AAAAATATCA AAAAATTAAG TGGAAAACTT TGGGATGAAT TAAATCAATC AAAGGGTGTT
GAATTAGTTT TAGAAAAAAA ATATTTAAAT GGGATTGTTA GTTTTAATAT CGAAAATATT
AAAGATAAGG ATAAATATGT AAAGAAACTT GGAGAAAAGA AAATTTGGAT TAGAGTTTTA
GAAGATCCAA AATGGTTTAG AGCATGCGTA CATCAAATGA CTACAGAAGC TGAGATTGAT
TTACTTGCTA GAGAAATAAA AAAAATATTG ACTTAA
 
Protein sequence
MRNNLRDQIP ALKNKYYFNY GGQGPLPKSS LEAIVKTWEI IQDLGPFTND MWPFIYKEIL 
TTKRIIAQKL GVNSKNVAFT ENISSGMILP FWGIKVKEGE ELLISDCEHP GVVAASREFC
RRNKLIFKIL PIQKIKNLND ENIILEILKN LNSKTKILII SHILWNFGYK IPLKEISIEL
KNNRENSYLL VDGAQTFGHI NIEKEVFYSD LYSITSHKWA CGPEGLGAIY VSDRFIRETD
PTIIGWKSLK KEQGIYEPSD NLFHDDARKF EIATSCIPLL AGLRNSLDLL DKDCHEKEKN
KNIKKLSGKL WDELNQSKGV ELVLEKKYLN GIVSFNIENI KDKDKYVKKL GEKKIWIRVL
EDPKWFRACV HQMTTEAEID LLAREIKKIL T