Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18182 |
Symbol | |
ID | 5005412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 582870 |
End bp | 585119 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | |
GC content | 61% |
IMG OID | 640420833 |
Product | predicted protein |
Protein accession | XP_001421509 |
Protein GI | 145354473 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.28956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00382688 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACGACG ACGCGCCGAT CGCGGACGTC GGGACGACGC GGCGAAACGC GGCGAAAACG ATCGAAGAGG CGATGGCGAG CTCGACGATC GCGCGGTTGG CGTGGAGCGG GTGGACGCAC GTGACGAGAC GACGAAGGGA GACGAACGCG CGCGCGACGG CGCTGTGGCG GGATCGGTCG CGACGAACGA GCGCGCGAAC GCTGATCGCG TGGGCGCGAA TCACGAAGTT TAACGCGAAA CGACGGGAGA TGACGCTGGA GCGATCGAGG GCGCAGCGGG AACACGCGAA AAAGTTGAGG ACGCTGAGGG AGTGGAAGGC GAGCGTGCGA CGGACGAGGG AGGCGAGACG CGACGTGATT CACAAGTATT TGGTGTGGCG GGCGAAGAAG TTCTTGAGGC ATTGGCGACA GGCGACGGAG GAGAGCGTTG AGGCGCGAGG GGGACGGAAG CGCGCGGTGG AACCCGCGGC GGCGGCGGAG CCGAGCGCGG CGCCGACCAA GGCGGCGGCG GTTCGCCGCA GGGAACGACA CGCCACCGTG AGCGTGGGCG ACATGAAGAG CAGATTGTGG ACCAAGGCTG ACCCCGCGGC GGTGCCGATG CCGGTGTCAC ACGTCGTGGA AGAGATTGAA GCCGCGGTGA TGGTCGAACG AACGACGTCG GGGACGTCGA CGACGTCCAC GACAACGACG TCCGCGCCTC AAACGCAGCG CGCGCAAGTT CGACGACGGA AAGTCGTCGT GCGCGAGTCC GGAGAAAATG AAGCGGAGGA AACGGTAGTC GTGACGAAGA AGAGTGCCGA TGCGAAGAAG ATGACGAAGA CGAAGACGAC GAAGCTGGTG GACGCCACGA CGATGACGCC GGTTGCGAGC GCGGTCTCGC AGCCGTCGCC GCGAACGTCA TCGTCATCGT CGCAGCACAT GTCGACCATC ATTATAGTCG TCGTACTGTT GGCGATTTTC GGCGCCATCG CGCTCGCGTC CATCTTGACT GGCGTCTCGC CTTCCTTTGT CGGCGCCGGG GCGACCTCCG TCGTGCGCCA GCATCTCTCC GAGGCGCAAC GAAACGTGAC GATTTTAGCG CGTGCGTCGG CGATGCAGCG AAGAGAATTA GAGGCGTGTC GTACGTTCGG TGGCGGCGTG CCCGACGCGA GTCAAAACGC CGCAATCGCT TCCGCGGAGG CCAAGGTTTC AGATTCCGTC GTCCAAGTTG CCGATTTAGA GGTGAAACTC ACCAAGGCTA AAGCCGACGT CGCCGCGGCG CAGAGCGCAC AAAAGTTAGC CGAAGACCGA TTCAATCGCT TAGGTTCGTC AGCGATGACT GTGGCGCAGT TACAAGAGTC GTTACAAAAG ATGACGACGC GAGCGACGTA TTGCGAGTCC ACGTCGGCGC AAGCGTTGAA GCTCAAAGCC GCCAAAGACA TCGCGACGGC GCGCGCCGAG AGTGCGGAGA AAGACGCCGC GGATACGAAG TCGAAGTTGA CCAAGACTAA GACGCACTTG TTGGCGGCGC AGCGCTCGTT GACGCAAACC GTGGATGTGG CGAATGAGTG CAGGTACCAA ATCGGGAAGC CCGCGTACGT GCCGTCGTAT CACTCCAACT CGTCCCTGCT CCGCGTCATT GCGGACATGT TCCCCGCGTT TTCGTGGTTC TTTTCATCCT CGGCGTTGTT GTTTTACGTT GTGTGCTCGT ATGCGTACTA TTTGCGCGCG ATGGTGGCAT CATTAGTTGG TGAAAGAGAT ACGCTCGTCG GTGAATTGGC GAAAATTCGC ACGCGAGGCG TGGCGCCAAA CCGGCCGCAT CTCGAGCACA CCAGGGTGAT GCACGAGCAT AGCGCGGAAA GTAGCGGTGC TGACGATGAT TCGCCATCAG TCGGTAAGAC GTCGTCGTTC GATCGGAAGC GTTCCCGCAG TGTGAGCGAC ATCGGAAAAG ATGGCGCGGA GACGTCGCAA AAGAAGACCG CGACGGCGAG CGACGTCAAG GATAACGACG AAGAGAACGA GTCCGCAGCG ACGAGAGTGG AGGAAGTCAC AAGCGCTGAA GATTTGAGCA AGGAGCAAAC AGAGCGCGCG CACAAACTCA TCGAAAGGTG GGCTGAGAAG TCAGCCGACG ACGAGATTAC GGGCGAACTC GATCGCATCC GTCGACTCGT CGAATTGGAA GACGAGCGCG AGCGATTAGA GGAGGAGTCG CAGCTCGAGC GGCTTAAGCG CCAGATTGAG TTATCTGCAA ATGATACGCC GTGGCTTTAA
|
Protein sequence | MNDDAPIADV GTTRRNAAKT IEEAMASSTI ARLAWSGWTH VTRRRRETNA RATALWRDRS RRTSARTLIA WARITKFNAK RREMTLERSR AQREHAKKLR TLREWKASVR RTREARRDVI HKYLVWRAKK FLRHWRQATE ESVEARGGRK RAVEPAAAAE PSAAPTKAAA VRRRERHATV SVGDMKSRLW TKADPAAVPM PVSHVVEEIE AAVMVERTTS GTSTTSTTTT SAPQTQRAQV RRRKVVVRES GENEAEETVV VTKKSADAKK MTKTKTTKLV DATTMTPVAS AVSQPSPRTS SSSSQHMSTI IIVVVLLAIF GAIALASILT GVSPSFVGAG ATSVVRQHLS EAQRNVTILA RASAMQRREL EACRTFGGGV PDASQNAAIA SAEAKVSDSV VQVADLEVKL TKAKADVAAA QSAQKLAEDR FNRLGSSAMT VAQLQESLQK MTTRATYCES TSAQALKLKA AKDIATARAE SAEKDAADTK SKLTKTKTHL LAAQRSLTQT VDVANECRYQ IGKPAYVPSY HSNSSLLRVI ADMFPAFSWF FSSSALLFYV VCSYAYYLRA MVASLVGERD TLVGELAKIR TRGVAPNRPH LEHTRVMHEH SAESSGADDD SPSVGKTSSF DRKRSRSVSD IGKDGAETSQ KKTATASDVK DNDEENESAA TRVEEVTSAE DLSKEQTERA HKLIERWAEK SADDEITGEL DRIRRLVELE DERERLEEES QLERLKRQIE LSANDTPWL
|
| |