Gene A9601_00821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00821 
Symbol 
ID4716765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp85602 
End bp86855 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content32% 
IMG OID640077780 
Productputative cysteine desulfurase or selenocysteine lyase 
Protein accessionYP_001008477 
Protein GI123967619 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGA TTCAAAATTT TCCTGAAATA ACTAAGAAAG ACTTTCCTCT TTTAAATAAC 
AACATAAAAA ATAATGAGCA AATCATTTAT TTAGACCATG CTGCAACCAC GCAAAAACCA
ATTCAAGTCT TGAAAAAAAT TGATGAATAT TATAGAAATT TTAATGCCAA TGTACATAGA
GGAGCTCATC AATTAAGTGC TAAAGCGACA GAAGAATTTG AAAATGCAAG ATTTTCAATT
AGTAAATACA TCAAAGCAAA TTCGCCAAAA GAAATTATTT TCACAAGAAA TGCTACTGAA
GCAATAAATC TAGCAGCTAG ATCATGGGGC GAATATTCAT TAGGAGAAAA CGATGAAATT
CTTCTATCAA TAATGGAGCA TCATAGCAAT ATTGTTCCAT GGCAAATGGT TGCAGCGAAA
AATAAGTGTA AATTAAAATT TATAGGCATC GATAAAAATG GGCAATTAGA TTTAGACGAT
TTTAAGTCAA AACTAACCTC TAGAACAAAG CTTGTTAGCC TAGTACATGT AAGTAATACT
CTAGGTTGCT GTAATCCAAT CAAAGAGATA ACTAAATTAG CTAAACAAAA AGGTTCTCTA
GTATTAATAG ATGCATGTCA AAGTTTGGCG CATCAAAAAC TAGATGTAAT AGATCTTGAT
ATTGATTTTT TAGCGGGATC AGGACATAAA CTTTGCGGTC CTACAGGAAT TGGTTTCCTC
TGGTCAAGAC AAGAAATTCT TGAAAAAATT CCTCCTTTCT TTGGAGGTGG CGAAATGATT
CAAGATGTCT TTGAAGAGAC AAGTACCTGG GCTGATCTCC CACATAAATT CGAAGCTGGA
ACTCCAGCCA TTGCAGAAGC AATAGGCCTT GCGGAAGCAA TTAATTATAT AAACACTATA
GGATTAAATG AAATTAATGA ATATGAAAAG ACTATTACTA AATATTTATT TGAAAAATTA
AATCAAATAG AAAATATTGA AATTATAGGT CCACCGCCGG AGATAGATCC AGACAGAGCC
TCACTTGCCA CCTTTTATAT AAAAAATATA CATTCAAATG ATATTGCTGA AATTCTTGAT
TCAAAAGGAA TTTGCATCAG AAGTGGTCAC CATTGCTGTC AACCTCTTCA CAGATACATC
GGAATTAAAT CAACAGCTAG AATCAGTATG AATTTCACAA CCAATAAGGA GGAAATTGAT
ATATTTCTTG AAAAATTAAA AGATACTATT GATTTTCTAA AAATCAATTC TTAA
 
Protein sequence
METIQNFPEI TKKDFPLLNN NIKNNEQIIY LDHAATTQKP IQVLKKIDEY YRNFNANVHR 
GAHQLSAKAT EEFENARFSI SKYIKANSPK EIIFTRNATE AINLAARSWG EYSLGENDEI
LLSIMEHHSN IVPWQMVAAK NKCKLKFIGI DKNGQLDLDD FKSKLTSRTK LVSLVHVSNT
LGCCNPIKEI TKLAKQKGSL VLIDACQSLA HQKLDVIDLD IDFLAGSGHK LCGPTGIGFL
WSRQEILEKI PPFFGGGEMI QDVFEETSTW ADLPHKFEAG TPAIAEAIGL AEAINYINTI
GLNEINEYEK TITKYLFEKL NQIENIEIIG PPPEIDPDRA SLATFYIKNI HSNDIAEILD
SKGICIRSGH HCCQPLHRYI GIKSTARISM NFTTNKEEID IFLEKLKDTI DFLKINS