Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_46561 |
Symbol | |
ID | 5003823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 483918 |
End bp | 485659 |
Gene Length | 1742 bp |
Protein Length | 500 aa |
Translation table | |
GC content | 51% |
IMG OID | 640419244 |
Product | predicted protein |
Protein accession | XP_001419612 |
Protein GI | 145350438 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02167] bacterial surface protein 26-residue repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0924573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.560204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTGGTGACT GGAACACGAG CCAAGTCATG GACATGCGTT TTATGTTCGC TACTGCTTAT GCTTTCGATA GTCCTATCCG CAACTGGGAC ATCAGCCAAG TGACGAACAC GAGTTACATG TTCTATTCTA TTGCTGCATC TTTCAATGTT TTCAACCAAT CCGTCGAGAA CTGGAACACG AGCCAAGTTA TGGACATGCG TGGGATGTTC GGGAACGCAC GTTACTTCAA CAAGCCTATC GGCGGCTGGA ACACGAGCCA GGTTACGGAC ATGCGTAACA TGTTCGTTAC CGCCTCTTCC TTCAACCAGC CCATCGGTGA CTGGGACACG AGCCAGGTTA CGGACATGCG TTACATGTTC GCTACCGCCT CTTCCTTCAA CCAGCCCATC GGTGACTGGG ACACGAGCCA GGTTACAGAC ATGCGTGGAA TGTTGGGGTC TGTGTCCTTC AATCAGCCCA TCGGGAACTG GAACACGAGC CAGGTTACAG ACATGCGTAC GATGTTCGCT ACCGCCTCTT CCTTCAACGC GCCTATCGGT GACTGGAACA CGAGCCAAGT TACAGACATG GGTAGAATGT TTTACGCAGC CTCTTCTTTT AACCGGTCTA TTGGCGACTG GAACACGAGC CAAGTCACGG ACATGCAGTC GATGTTTTAC GCCTCTTCAT TCGATCAACC CATCGGGAAC TGGGACACAA GCCTAGTCAC ATACATGAAT TCGATGTTCA ACAGCGCAGT TTCCTTCAAC CAGTCTATCG GTGGCTGGAA CACGAGCCGA GTGGTGAGCA TGCAGTCGAT GTTCGCGAAC GCATTTTCCT TCAACCAGCC CATCGGGAAC TGGAACACGG GCCGAGTGGT GAGCATGCAT TCCATGTTTG ACAGCGCAGT TTATTTCAAC CAGCCCATCG GGAACTGGAA CACGGGCCGA GTGGTGAGCA TGAATTCCAT GTTTGACAGC GCAGGCTTTT TCAACCAGCC CATCGGGAAC TGGGACACGA GCCAGGTTAC AGACATGAAT TCCATGTTTG ATAGCGCAGG CTCTTTCAAC CAGCCCATCG GGAACTGGAA CACGAGCCAG GTTACAGACA TGAATTCCAT GTTTGATAGC GCAGGCTCTT TCAACCAGCC CATCGGTGAC TGGGACACGA GCCAGGTTAC AGACATGCAT TGGATGTTCC AGGGTGCCTC TTCCTTCAAC CAGCCTATTG GAGACTGGGA TACGAGCCAA GTGGTGACCT CGTACTCATT TAGCGAAATG TTCAAGGACG CAACGGCGTG GCTATGCTCA CACACACACA ATACGTCGGA TCCGATGCAT TTTCAAAGAT ACTACGATGG TCCACCGTCT TTTTGGTTCG AGTCAGCGGC TGGAATCGGT GCAGTGTGCT TGTCTCCACC ACCGCCAACC ACACTACCGT GTGTATATGT CGCCGCCCCT CCGTTGACCG CAAGGTCGAC TGGAACCGCG AGGTCGAGTG TCGCACTCTC GCTATATACG ACAGTCATTA TCATATACTT CCACGCCTGA AACTGAGACA GAAACTGAGA CATCGCACAT TCACAAATCC TACAAGGATA TGATACCGTA GTCATAGTGG ATATATACAG TGCTGCTCTC GCATTTCCAG TTCCTACCTG ATGAATACTG GATTATGGGG TGATTTTCAA ACAAGTGTAG AGTACTAGCC TCCATTTCTC GCTTGTTCTT GAAATACAAC TTGAATTACA ACTCGAGGCA AA
|
Protein sequence | MDMRFMFATA YAFDSPIRNW DISQVTNTSY MFYSIAASFN VFNQSVENWN TSQVMDMRGM FGNARYFNKP IGGWNTSQVT DMRNMFVTAS SFNQPIGDWD TSQVTDMRYM FATASSFNQP IGDWDTSQVT DMRGMLGSVS FNQPIGNWNT SQVTDMRTMF ATASSFNAPI GDWNTSQVTD MGRMFYAASS FNRSIGDWNT SQVTDMQSMF YASSFDQPIG NWDTSLVTYM NSMFNSAVSF NQSIGGWNTS RVVSMQSMFA NAFSFNQPIG NWNTGRVVSM HSMFDSAVYF NQPIGNWNTG RVVSMNSMFD SAGFFNQPIG NWDTSQVTDM NSMFDSAGSF NQPIGNWNTS QVTDMNSMFD SAGSFNQPIG DWDTSQVTDM HWMFQGASSF NQPIGDWDTS QVVTSYSFSE MFKDATAWLC SHTHNTSDPM HFQRYYDGPP SFWFESAAGI GAVCLSPPPP TTLPCVYVAA PPLTARSTGT ARSSVALSLY TTVIIIYFHA
|
| |