Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33837 |
Symbol | |
ID | 5000576 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 598311 |
End bp | 599801 |
Gene Length | 1491 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 59% |
IMG OID | 640415997 |
Product | predicted protein |
Protein accession | XP_001416706 |
Protein GI | 145344368 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.40604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCTCGG GCGCGGTGAA GATCGCGCCC GACGCTCTGA ACGACTTCAT CGCCCCCTCG CAGGACTGCG TCGTCGCTCT CGACGGTGGT AAGCTCAAAC TCGACGACGA TGGTGCCGTC GGCGCGTCTT CCGAAGACGC CTTCTCCACC GGCGAAGTCG CCCTGCGTCG ACGTAAACCG CGCGAAGACG ACGCCATGGC CGTGGATGCC GAGCCAACGT CGACTTTCAC ACCGACGATG ACGCAGGGCG ACGCGCTGAA GGTGTCGCTG AGCGATTGCT TGGCGTGCAG CGGTTGCGTG ACGAGCGCGG AGAGCGTGCT GCTGGAACAA CAATCCGTGG ATGAGTTCGC ACAGGCGTGC GCGCGCGCGC GGAGCGACGG AACGAGCGTC GTCGTCGCGA GCGTCAGCCC GCAGTCGTTG ATGAGCCTGA GCGAGGCGTA TGGATTGGGA GTGGAGGAGA CGCGCGCGCG GCTTGGCGGG CTGTTGAAGG CGGGATTCGG CGCGGCGAGG GCGTTTGATA CGTCATTTAG TCGGGATATA GCGCTCGTGG AGACGTTTGC AGAGTTTACG GAGTGGATGC GAGACGGCGC GAGGACGCCG ATGTTGGCGA GTGCGTGTCC GGGGTGGGTG TGTTACGCGG AGAAGACGCA CGGCGAACTC GCGGTGCCGC ACATGGCAAC GACGAAGAGT CCGCAGCAAA TCATGGGAAG GTTTGTGAAG AGCGCGGTCG CGCGCGAACT TGGCGTACCA GCACATAACG TGTACCACGT GAGCGTGATG CCGTGCTACG ACAAAAAGCT CGAGGCGACT CGCGATGATT TCGAGAGCGA CGGTGTCAAG GATGTCGACG TCGTGCTCAC GACGGGCGAG GTGGCTTTGT TATTAGAAAA GGCTGGTTTG TGCCATTTGA GAGACGCGCC GGCAAATGAT TTTGACGCAT TCGTGAGCAC AAACGAACAA GCACCAGAAA GTGTGTGCGC AGCGCCGGCG GTATCGGGAT CTGGGGGATA CGCCGAGTAC GTTTTCCGGC GCGCGGCGGC GGAGTTGTTC AATGCTCCGA TAACTGGAGA GATTGACTGG GTCAAGATGC GCAACGCGGA CATGCGTGAG GCCACACTAA CGATCAATGG TGAAGCTGTT CTACGCGTGG CTGTCGCGTA TGGTTTCAGA AACATTCAAA ATCTTGTTCG AAGCATCAAA TTAAAAAAGA GCAAGCACCA TTTCGTCGAG ATAATGGCGT GTCCTTCGGG ATGCTTGAAT GGCGGCGGTC AAATCCCAGC GCGCGAGGGA ACTGCGAACA AAGAATTGAT CGACAGACTG GATGATACGT ATAGGGAAAA CGCACGCGCA CGACCGATGG CGGATGTGTC GACGCTCTAT CGCGAATGGA TCGGCGGAAA TCCAGGATCG TCAAACGCTC GCGAAGCGCT TCGAACGCAA TATCACATTC GCGCAAAATC CGTCGGAGTC GTCCAACTGA ACAGTTGGTA G
|
Protein sequence | MFSGAVKIAP DALNDFIAPS QDCVVALDGV ALRRRKPRED DAMAVDAEPT STFTPTMTQG DALKVSLSDC LACSGCVTSA ESVLLEQQSV DEFAQACARA RSDGTSVVVA SVSPQSLMSL SEAYGLGVEE TRARLGGLLK AGFGAARAFD TSFSRDIALV ETFAEFTEWM RDGARTPMLA SACPGWVCYA EKTHGELAVP HMATTKSPQQ IMGRFVKSAV ARELGVPAHN VYHVSVMPCY DKKLEATRDD FESDGVKDVD VVLTTGEVAL LLEKAGLCHL RDAPANDFDA FVSTNEQAPE SVCAAPAVSG SGGYAEYVFR RAAAELFNAP ITGEIDWVKM RNADMREATL TINGEAVLRV AVAYGFRNIQ NLVRSIKLKK SKHHFVEIMA CPSGCLNGGG QIPAREGTAN KELIDRLDDT YRENARARPM ADVSTLYREW IGGNPGSSNA REALRTQYHI RAKSVGVVQL NSW
|
| |