Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_13980 |
Symbol | |
ID | 4999608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 983587 |
End bp | 984876 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 63% |
IMG OID | 640415029 |
Product | predicted protein |
Protein accession | XP_001415655 |
Protein GI | 145341104 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.332062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGACGT ACGTCGAGCG CGATCTCACC GCGGAAGAGC GCGTGGAGGT GATGCGTCGA CCGCGGGTGG ACTTTACGAG CATCCTGGAC ACGGTGAAGC CGATCGTGGA GGCGGTGGGA ACGCGCGGGG ACGCGGCGGT GCGCGAGTAC ACGAGCAAGT TCGACGGCGT CGATTTGGAA GCGGTGACGG TGCGCGTGGA TGAATTGCCC GATCCCGTGC TGGATGATGA TGTGAAGAAG GCGTTTGACG TGGCGTACGA CAATATCGCG GCGTTTCACG CGGCGCAGGC GAAGAGCGGG GACGTGGACG TGACGACGAT GCCAGGGGTG CGCTGTCGAC GCGTGTCGAG ACCGATCGGA GCGGTCGGAC TGTACGTCCC GGGCGGCACC GCGGTGTTAC CGAGCACGGC GCTCATGCTC GCGGTGCCGG CGAAGATTGC CGGGTGCGAA CGCGTCGTGC TCGCGACGCC GCCGCGCAAG GATGGTTCCA TCGTTCCGGA GGTGCTGTAC GTGGCGAAAA AGGCGGGGGT GACGCACATT CTCAAGGCTG GCGGCGCGCA AGCGATCGCG GCGATGGGAT TCGGCACGGA AACGTGCCCG AAGGTGGATA AGCTCTTCGG TCCGGGGAAT CAGTTTGTGA CGGCGGCGAA GATGTCGCTG CAAAACTCAG ACGCGATGGT TTCCATCGAC ATGCCCGCGG GCCCGAGTGA GGTGCTCGTC ATCGCGGATA AAGACGCCCC CGCCGCCCAC GTCGCCGCCG ATCTCTTGTC TCAAGCCGAG CACGGGCCGG ACAGCCAGGT GGTTTTGGTG ACGTTCCCGG ACGTGGATTT GAAGGCTATC ACGGACGCCG TGGCGGACCA AGCGAGCAAG CTCCCGCGCG CCGAAATCAC CGCCAAGGCG CTCGGCCACT CCTACGCCGT CGTCGTTGAC GACATGGCGG CCGCGTGCGA TTTTAGCAAT CGATACGCTC CCGAGCACTT GATCGTAAAC GTCGAAAATG CGGAATCATG GTTACCGCAG TTAGACAACG CCGGTTCCAT CTTCCTCGGG CGCTGGACCC CAGAATCCGT GGGCGATTAC GCCAGCGGCA CCAACCACGT TTTACCCACC TATGGCTACT CTCGCATGTA CTCCGGCGTA TCCTTGGACT CTTTCGTTAA GTACATGACG GTTCAAGAGC TCACCAAGGA AGGTTTAGAC GCGCTCGGTC CTCACGTCGC CAGGATGGCG GCGGTCGAAG GTTTGGACGC GCACAGATCC GCCGTGACTT TACGCTTAGG CATCGACTAG
|
Protein sequence | MRTYVERDLT AEERVEVMRR PRVDFTSILD TVKPIVEAVG TRGDAAVREY TSKFDGVDLE AVTVRVDELP DPVLDDDVKK AFDVAYDNIA AFHAAQAKSG DVDVTTMPGV RCRRVSRPIG AVGLYVPGGT AVLPSTALML AVPAKIAGCE RVVLATPPRK DGSIVPEVLY VAKKAGVTHI LKAGGAQAIA AMGFGTETCP KVDKLFGPGN QFVTAAKMSL QNSDAMVSID MPAGPSEVLV IADKDAPAAH VAADLLSQAE HGPDSQVVLV TFPDVDLKAI TDAVADQASK LPRAEITAKA LGHSYAVVVD DMAAACDFSN RYAPEHLIVN VENAESWLPQ LDNAGSIFLG RWTPESVGDY ASGTNHVLPT YGYSRMYSGV SLDSFVKYMT VQELTKEGLD ALGPHVARMA AVEGLDAHRS AVTLRLGID
|
| |