Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40179 |
Symbol | |
ID | 4999402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 459700 |
End bp | 460827 |
Gene Length | 1128 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 54% |
IMG OID | 640414823 |
Product | predicted protein |
Protein accession | XP_001415501 |
Protein GI | 145340791 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.10614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACTTA GCTCGCAAGA TTTCTTAGAT TTGACCCTCG ACGAAACCAT GCGATCTCAG TGCGCTGAAA CTATTCATCG CTATGGTCTC GGATCGTGCT CACCTCGTGG TTTTTATGGC ACATTCCGTC CGCACATGGA TTTGGAAGCC AAGATTGCGA AATTTCTCGG CGTGGGTGAA GCTGTGCTTT ACTCTTTCGG CGTTTGCACT GCGTCCAGTG TCATTCAAGC TTTAGCGTCA AAAAGTGATG TGGCTGTGGT CGATCGAGGC GTTGGACCGA GTATCATCGC TGGTTTACGT TTGGCGAAGC TCGAAATTAG ATGGTACAAT CACGCCGATC CTGCTGATGC GGCGCGTGTT TTTGCGCAAA TTGAAACCGA AGATGGTTCC ACGTCTGCGA GGCTCACCCG ACCTGTCAGA CGTCGATGGT TGATTACTGA AGCATGTTTT GGCTCCACCG GTCGATGTGC GCCTCTCCGT GAACTTGTAG CTTTGAAGGA TCATCACCAT GCACGAATGA TTCTCGATGA GTCGTTCTCC TTTGGCGCCA TGGGTGAAAG TGGTCGTGGT CTGATTGAAC ACGTTGGACT ACCCAGCAGC TCTGTTGATG TCATTTGTGC TTCGTTGGAG AACGCGTGCG CATCTGTGGG TGGTTTTGTG GCCGGAGATA CGGGAGTCGT GGCTTACCAA CGCTTGATGG GAAGTGGTTA CGTCTTCTCA GCGTCCTTAC CGCCCTACCT CGCGACGGCT TCTTTACACG CCATCAGCCG CATCGAAGCT GAACCGGCCA TGGTTGAAAA GCTTCATGAC GCGGCGCGAC GTACTCGCAG CGCACTCGTC AGTGGAGACA TTCCCGGTAT GACTACTGAT GCAGATGCCG ACTCGCCAGT CATCCCCGTC AAGCTCTCGG CCGGCGTTGG GAGCGGGGAC GAGAACATGC TTCTGCATCG CATCGCTGCT CGTATGCGAA GTAAAGGATT TGGTGTGTGC GTGGCTCGAG TCAGTCCTGT CATTTTACCG TCTCACCGCC CCCCGCCGTC CCTTCGTCTA TATGCGCACG CCAGTCACAC GGCGGACAAG ATTGACAAGA TGCTCACAGT GCTTCGAGAT GCTGCGTTGG ATATCCTC
|
Protein sequence | MVLSSQDFLD LTLDETMRSQ CAETIHRYGL GSCSPRGFYG TFRPHMDLEA KIAKFLGVGE AVLYSFGVCT ASSVIQALAS KSDVAVVDRG VGPSIIAGLR LAKLEIRWYN HADPADAARV FAQIETEDGS TSARLTRPVR RRWLITEACF GSTGRCAPLR ELVALKDHHH ARMILDESFS FGAMGESGRG LIEHVGLPSS SVDVICASLE NACASVGGFV AGDTGVVAYQ RLMGSGYVFS ASLPPYLATA SLHAISRIEA EPAMVEKLHD AARRTRSALV SGDIPGMTTD ADADSPVIPV KLSAGVGSGD ENMLLHRIAA RMRSKGFGVC VARVSPVILP SHRPPPSLRL YAHASHTADK IDKMLTVLRD AALDIL
|
| |