Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_00821 |
Symbol | |
ID | 4716765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 85602 |
End bp | 86855 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640077780 |
Product | putative cysteine desulfurase or selenocysteine lyase |
Protein accession | YP_001008477 |
Protein GI | 123967619 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACGA TTCAAAATTT TCCTGAAATA ACTAAGAAAG ACTTTCCTCT TTTAAATAAC AACATAAAAA ATAATGAGCA AATCATTTAT TTAGACCATG CTGCAACCAC GCAAAAACCA ATTCAAGTCT TGAAAAAAAT TGATGAATAT TATAGAAATT TTAATGCCAA TGTACATAGA GGAGCTCATC AATTAAGTGC TAAAGCGACA GAAGAATTTG AAAATGCAAG ATTTTCAATT AGTAAATACA TCAAAGCAAA TTCGCCAAAA GAAATTATTT TCACAAGAAA TGCTACTGAA GCAATAAATC TAGCAGCTAG ATCATGGGGC GAATATTCAT TAGGAGAAAA CGATGAAATT CTTCTATCAA TAATGGAGCA TCATAGCAAT ATTGTTCCAT GGCAAATGGT TGCAGCGAAA AATAAGTGTA AATTAAAATT TATAGGCATC GATAAAAATG GGCAATTAGA TTTAGACGAT TTTAAGTCAA AACTAACCTC TAGAACAAAG CTTGTTAGCC TAGTACATGT AAGTAATACT CTAGGTTGCT GTAATCCAAT CAAAGAGATA ACTAAATTAG CTAAACAAAA AGGTTCTCTA GTATTAATAG ATGCATGTCA AAGTTTGGCG CATCAAAAAC TAGATGTAAT AGATCTTGAT ATTGATTTTT TAGCGGGATC AGGACATAAA CTTTGCGGTC CTACAGGAAT TGGTTTCCTC TGGTCAAGAC AAGAAATTCT TGAAAAAATT CCTCCTTTCT TTGGAGGTGG CGAAATGATT CAAGATGTCT TTGAAGAGAC AAGTACCTGG GCTGATCTCC CACATAAATT CGAAGCTGGA ACTCCAGCCA TTGCAGAAGC AATAGGCCTT GCGGAAGCAA TTAATTATAT AAACACTATA GGATTAAATG AAATTAATGA ATATGAAAAG ACTATTACTA AATATTTATT TGAAAAATTA AATCAAATAG AAAATATTGA AATTATAGGT CCACCGCCGG AGATAGATCC AGACAGAGCC TCACTTGCCA CCTTTTATAT AAAAAATATA CATTCAAATG ATATTGCTGA AATTCTTGAT TCAAAAGGAA TTTGCATCAG AAGTGGTCAC CATTGCTGTC AACCTCTTCA CAGATACATC GGAATTAAAT CAACAGCTAG AATCAGTATG AATTTCACAA CCAATAAGGA GGAAATTGAT ATATTTCTTG AAAAATTAAA AGATACTATT GATTTTCTAA AAATCAATTC TTAA
|
Protein sequence | METIQNFPEI TKKDFPLLNN NIKNNEQIIY LDHAATTQKP IQVLKKIDEY YRNFNANVHR GAHQLSAKAT EEFENARFSI SKYIKANSPK EIIFTRNATE AINLAARSWG EYSLGENDEI LLSIMEHHSN IVPWQMVAAK NKCKLKFIGI DKNGQLDLDD FKSKLTSRTK LVSLVHVSNT LGCCNPIKEI TKLAKQKGSL VLIDACQSLA HQKLDVIDLD IDFLAGSGHK LCGPTGIGFL WSRQEILEKI PPFFGGGEMI QDVFEETSTW ADLPHKFEAG TPAIAEAIGL AEAINYINTI GLNEINEYEK TITKYLFEKL NQIENIEIIG PPPEIDPDRA SLATFYIKNI HSNDIAEILD SKGICIRSGH HCCQPLHRYI GIKSTARISM NFTTNKEEID IFLEKLKDTI DFLKINS
|
| |