Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44322 |
Symbol | |
ID | 5004390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 419949 |
End bp | 421616 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 58% |
IMG OID | 640419811 |
Product | predicted protein |
Protein accession | XP_001420333 |
Protein GI | 145351972 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00480829 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0297324 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGCG AGGCGCGGAC GTACGCCACG GCGTGGGCGA ACGAACTGGC GAGCACGCTG TCGGAGGACG CGAGGGCGCT GGAGCGAAGG CTGTTCGGCG ACGAAGGCGG CGCGAGCGAG CGCGGCGAGG ACGAAGACGA AGACGCGAGC GAGGAAGAGA CGGTCGGGAC GAGCGGGAGG AAGGGGAAGG CGCGCGCGGA CGCGAGAGTG CGAAGGAAGA CGCGACGAGA TTGGAGTAAA GTGCCGGACG AGGCGCTGCC AAGAGTCGCC GTCGTGGGAC GGCCAAACGT GGGGAAGAGC GCGTTGTTTA ATAGACTCAC TGGAACGAAA CGGGCGATCG TGTACGACGA GCCGGGGGTG ACGCGAGATC GGATGTATGT GCGCGCGTAT TGGGGCGAAC ACGAGTTCAT GATGGTGGAC ACGGGCGGGT TGGAAAATTT ACCGGCCAAT CCCGAGGGTG GGCCGAAGAC GGACACCGTC GGCGGCGTCG AAATCTTACC TGGGATGATC GAGGCGCAGG CCGCGGAGGC TGTGCGAGAA GCGTCTGTGC TCATTTTTGT CGTCGACGGT CAAGTTGGGC TGACGGCGGC GGATATGGAT ATTTTCGCCT GGTTGCGTCG CACGCACTCG AAAATACCGC TCCATCTGGC GGTGAACAAG TGCGAGAGCA CGACGAAGGG GGAGGACCAA ATCCTCGAGT TTTGGTCTTT AGGCGACGTT ACCCCACTCG CTGTTTCCGC CATCAGCGGC ACCGGGACCG GGGAGCTTTT GGATAACATG TGCGCGACTT TGCCGCCGCC TCCGCAAGTC TCCGAGGATG AAAACGCAGA GGAGGAAGAT ATTCCCGTCA CAGTCGCCAT CATCGGCCGA CCGAACGTGG GGAAGAGTTC GTTGTTGAAT GGTTTGGCCG GTGAAGCGCG ATCGATCGTG AGCGACTTTT CTGGTACCAC ACGAGACAGC ATCGATACCT TGGTCGAGGA CAAGTACACG GGGCGAAAGT TTACACTCAT CGATACGGCG GGCATTCGGC GACGAACTCA AGTGAAATCC GGCACGGATG GAGCCGAAAA ACTCAGCGTC GGTCGCGCTT TGCAAGCGAT GAAGCGTGCA GACGTAGTGG TGCTAGTTAT AGATGGTACT GAAGGACCGA GCCAACAGGA TTTTGTACTC GCCGAGCGCG CCACGCAAGA GGGCTGCGCC ATCGTGTTGT GCATCAACAA GTGGGATTTG GTGGACAAGG ATACGCACAC GATGAACAAG TATACGGACG ATATGCGATT AAAGTTGCGT GTGTTTGAAT ATGCCGAAAT TGTGTACACG TCTGCGCTCA CTGGACAGAG AATCCAAAAG ATTTTAGACG CCGCGCAGGT GGCGAGCGAG AATCACCGCA AACGTCTCAC CACGGCGACG CTCAACTCTG TCGTACAAGA GGCGACTTTG TGGAAGCTCC CGCCGTCGAG AAACAGCAGA AAGGGTAAGA TTTATTACAT CACGCAAGCG TCGATACGTC CACCGACGTT CGTCTTTTTC GTGAACGACC CCAAGCTGTT TCCGGACACG TACCGTCGGT ACATGGAGCG CCAACTGAGA GAGAACATCG GCTTCCCGGG AACTCCGATT CGCTTGCTGT GGAGAGGAAA GGGCGTCGAA AAAGGGCCGA GGAAATAA
|
Protein sequence | MDREARTYAT AWANELASTL SEDARALERR LFGDEGGASE RGEDEDEDAS EEETVGTSGR KGKARADARV RRKTRRDWSK VPDEALPRVA VVGRPNVGKS ALFNRLTGTK RAIVYDEPGV TRDRMYVRAY WGEHEFMMVD TGGLENLPAN PEGGPKTDTV GGVEILPGMI EAQAAEAVRE ASVLIFVVDG QVGLTAADMD IFAWLRRTHS KIPLHLAVNK CESTTKGEDQ ILEFWSLGDV TPLAVSAISG TGTGELLDNM CATLPPPPQV SEDENAEEED IPVTVAIIGR PNVGKSSLLN GLAGEARSIV SDFSGTTRDS IDTLVEDKYT GRKFTLIDTA GIRRRTQVKS GTDGAEKLSV GRALQAMKRA DVVVLVIDGT EGPSQQDFVL AERATQEGCA IVLCINKWDL VDKDTHTMNK YTDDMRLKLR VFEYAEIVYT SALTGQRIQK ILDAAQVASE NHRKRLTTAT LNSVVQEATL WKLPPSRNSR KGKIYYITQA SIRPPTFVFF VNDPKLFPDT YRRYMERQLR ENIGFPGTPI RLLWRGKGVE KGPRK
|
| |