Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25099 |
Symbol | |
ID | 5004120 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 42351 |
End bp | 45199 |
Gene Length | 2849 bp |
Protein Length | 729 aa |
Translation table | |
GC content | 58% |
IMG OID | 640419541 |
Product | predicted protein |
Protein accession | XP_001420060 |
Protein GI | 145351383 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGTCAGCGCG CGTCAGCGCG CGTCAGAGCG CGGCGTCCGA CGCGCTCCGA CGCGAACGCT CGTCGACGTT CTCGGCGCTT CGCGCGCCGC GTCGACGCGC GTCGACGCGC CGCGATCCGC GCGCGCGTCG TCGACGAGCG TCGGTCGTCG GAGACGCGAA TCTCGACATG GCGTGGGAGC GCGCGCGCGA AGACGACGGC CCGGGATCGG CGACGAGATG GTGCGAAAGC GCGGAGGCGA ACGCGCGTCG ACGGACGACG GGAAGGCGCG GCGAGGGGGC GCGGCGAGGG AAGGAGGCGG CGGGACGCGA ACGCGAAGGC GAACGCCGAC GCGGGAAGCG GCGGCGAGGG AACGACGCGG CGAAGCGAGC GATGCGGCGA TATCACGCCG AGCGACCGCA TCGGGCGTTG AACGTGTGCG ATAACGGTGA ACGCGAACGG ATGGAGCCGC ACGAGATGGA GGAGAAGCGA GACTTTGAGG AGAAGAGGGC GAGATTCGCG GCGAAATTCG CGGCGGGCGA GTTGAGTCCA GAGCCGGCGT ACAGGTACGA GGAGCCGGCG CGCGCGCGCG GGATTTGGGA CGGTAAAGGT GCGACGTTTG AGGATGCCAT CGTTAGCGGC GATCACGACC ACCACTCGAG CTCAGGGGCG AACTCGAGCG AGGAGACGGC GGTGCACTTG CGACACGAGA CGACGAGCGG ACGAACGCCA CCCCCGGTGA CGTCCAAGCC CTCGCGAACT GGGCCCGAAT CGATCAAACT GTGCGAGCTG CGCACGCACA AAAGCATGCG GAAAAACGAC ACGTCGACGC TGCCCACGGT GTACAGTTGC ATGAATCTAG GCGTGACGCC AGAGCAACTG TTGCAAGGCG AAGACAAAGC GTGGTGGAGC GAGTTCGGCA AGCAAGGGGC GATGGTGATC AAGATTAGCG ATGGGTGGGA CATCTTAGAC GATACGAGCG CGCTGTGTGA CTTGTCTTGG GTCGGCACCA AATCCATCAA GCTCCGCACC CTTGGACCCG ACTATCAGTA CATTAGACAA AACTTGCCGG ACAGAAAGAC GGGCCTGCTC GAGCCGTTTC GAGCGATGAA TGACGTGGAC TCCAACATGG AAATCGGCGC CTTTGTGGAA GATTTTCAAG AGCGATGTCG TACGCACGCC GATCAGTTTA GACTCATGGA CGTCAACGCT CGTGAATGCT GGTTCTTACA GCAAATTCAT TCTGGTTTTC CACTCCAGTT TCCATACCTT CAAGGCATCG CGATGGAGGC ACTCATGAAG AATGTTCGAG CCGTGGACGC GTCTGAAACG CCCCAAACCC ATCCTTGGGA TCCTCGATTA CTGGGTCAGA GAAAGCCAAG TGTGTTACGA CGGTGTTTGG AGCTGTGTGA AACCCAGAAC GTGGTGAATA AGAGGCTGTC TCTCGACGCC GCCCTGGACT GCGCAACGCC GCAGTCGGCG GAGGAGGCCC GAGCGGAGGA GAAAGGGCGA ACGGGCGGAG TGGTTACGAA AGTAGACCAA TCTGCCGTTC AACGGAGAAT CGAACGGCGA GAAGATAAGA AGAAGGGCAA AAAGCACAGC TCCGCTGGTA CTGTGATAGA AGCCGCCGTC TTAGAAGCCG CCGAGATGGC CACGGAGAAG GCGAGCGAGA CTCGACTCTG GGGCGTCGGC ATCGTGTCTC CTTGGCTTTA CTACATGGGT GTCGGGTCTA TTTTCCCCTT GCATTTCGAG GACTACGCGT TCGCTTCGGC AAATGTCATC TTGGCTCGTC CGGACTCGCA TTCGGCGGTC GTTTGGTATA GTATTCCTCG ATCCGATCTG TACTTGCTGC ACACGTATTT GCAAGAAACG CTCGGCGCGG AATACACGGT TGACATTCTC GAGATGCGCA GGCTTTGGCT CGATCCGGCG CGGATCCAGG ATTGGAATTC AAAGCGCGTG AATGGCGAAG AGAAGATCCA TGTTTATCGA CACGTTCAAA GAGCGGGCGA ATACGTCGTC ACCGACTACG GATCCGTTCA CTGGGGCGTG AATTTGGGCG ACGGCTGGAA AGCTGCGGTC AACTTTGCGT ACATGGACTG GAAACCCGCG GCGGAGGAAG TGAACGAAGT CTACAAGCGA CTCGAAAAAG AAACAGGAAT GTTTCGACAT CATCGCTGCT GCCCCAAGTT TGACGAATAT GTCGATTTCT GGGCCGATGA AAGAATTCGC CCCCCCCAAG GAGACGTGAC ACGATCGCGC CACGACTCAT GAAACGATGT GACATTTTTG TTGTAATGTA CATTATAATT CAACGCTACC GTCAGCTGAG GACTATCGCG AGGACTATCG CGGCGGCGGG ACCTTCGCAA TGGTATCGGG ATGATTTGGA TTGAAAACCG ATACGGGCGT ATAATCGACG TTGTACCCTT CTTTTTTCAG CTTAGCGACG AAGAACTCTG GAAATTCGCT CTCGAGATCA CTTCGCGTGT CTGGGACGTG AACGGGTTTG TCTGACAGGT ATAGCGGCTC TCTGAATGAG ATGTTCAACC AAGGCGTCGG GTCGAGTCTA CCGTCGATGA CGGTGCGAAG GAACGCCTGC CAAGTCCCTC GGCCAAGACT CAAATCCACT TCGTCTCGCA CCATTTGGGC CATCACAATA CGCCTGTGTC CCTTGACAGT GCTCAAGTCT ATCCAATTCG CATCATCCGC TTCTGCGATT ACTTTGATCC AATGTTTGAT CTGCTCGACG CCCTCGGTTC TCGCCAGTCG TTCCTCGATG TCGCGAGAAA CTGGAAGTAC CGCGACATGG CGATCTCGCG CTCGTTTACC GGGAGAGAAC CTCAAATCGA GCGATATGTG CTCTTTTGA
|
Protein sequence | MAWERAREDD GPGSATRWCE SAEANARRRT TGRRGEGARR GKEAAGRERE GERRRGKRRR GNDAAKRAMR RYHAERPHRA LNVCDNGERE RMEPHEMEEK RDFEEKRARF AAKFAAGELS PEPAYRYEEP ARARGIWDGK GATFEDAIVS GDHDHHSSSG ANSSEETAVH LRHETTSGRT PPPVTSKPSR TGPESIKLCE LRTHKSMRKN DTSTLPTVYS CMNLGVTPEQ LLQGEDKAWW SEFGKQGAMV IKISDGWDIL DDTSALCDLS WVGTKSIKLR TLGPDYQYIR QNLPDRKTGL LEPFRAMNDV DSNMEIGAFV EDFQERCRTH ADQFRLMDVN ARECWFLQQI HSGFPLQFPY LQGIAMEALM KNVRAVDASE TPQTHPWDPR LLGQRKPSVL RRCLELCETQ NVVNKRLSLD AALDCATPQS AEEARAEEKG RTGGVVTKVD QSAVQRRIER REDKKKGKKH SSAGTVIEAA VLEAAEMATE KASETRLWGV GIVSPWLYYM GVGSIFPLHF EDYAFASANV ILARPDSHSA VVWYSIPRSD LYLLHTYLQE TLGAEYTVDI LEMRRLWLDP ARIQDWNSKR VNGEEKIHVY RHVQRAGEYV VTDYGSVHWG VNLGDGWKAA VNFAYMDWKP AAEEVNEVYK RLEKETGMFR HHRCCPNAQV YPIRIIRFCD YFDPMFDLLD ALGSRQSFLD VARNWKYRDM AISRSFTGRE PQIERYVLF
|
| |