Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30221 |
Symbol | URE1 |
ID | 5000412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 736717 |
End bp | 739334 |
Gene Length | 2618 bp |
Protein Length | 840 aa |
Translation table | |
GC content | 56% |
IMG OID | 640415833 |
Product | urease |
Protein accession | XP_001416478 |
Protein GI | 145343758 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit [COG0831] Urea amidohydrolase (urease) gamma subunit [COG0832] Urea amidohydrolase (urease) beta subunit |
TIGRFAM ID | [TIGR00192] urease, beta subunit [TIGR00193] urease, gamma subunit [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTCGCGCG CGGAAGCGTC GCCGGGTCGC GGGCGCCAAC GCGGACGAAC ATATCAAAGC TCGCGGCGTC GTCGGCGAAA GAAGGCGCGC GAACGATGCG TCTCACGCCG TGTGAGATCG ATAAACTGAA CATTCTCTTC GCCGCGGGAC GCGTGGCGCA GCGAAGACTC GCGCGTGGAA TGCGTTTGAA CCACCCGGAG GCCGTGGCGT TGATCGCGAT GCAGTGCGTC GAGCTCGTGA GAGATCCAAA GTTCGATAAG GACGGGGAAC GGCGCGCGCT GACGTCGGTG GAGGTGCAGG ATTTGGGAAA GCGCGTGCTC GGACGACGAC ACGTGCTCGA CGGCGTCCCG GAACTCGTGG GCGACGTGCA GATAGAGACG ACGTTCGACG ATGGGACGAA GTTGATCACG ATTCACGATG CGATATGTGC GGATGACGTC GATCTCGAAC TCGCGCTGCT GGGATCGTTT TTACCGAAAC CGAGCGAGAA GGCGTTTCCT CCGGCGACAA AGGAGAAGAC ACCGGCGCCG GGGGAGGTTC GTACGTCGAA GGATTCTTTA GAGCTCCTGG CGGGAAGACC GAAACGAAAA CTGAAGGTAA CGAACACGTG TGATCGACCG ATTCAAGTGG GAAGCCACTT TCACCTCATC GAGGCGAATC GGTTTCTGAC GTTTGATCGC CGGCGAGCGT ACGGAGCGCG TTTGGCCATT CCGAGCGGCA CGGCGGTGCG ATTCGAGCCT GGAGAGTCGA AAACTGTGAA TATGGTGAAT ATCGCTGGTA ACCGGGTGGT CAAAGGTGGG AACAATTTGG TGAATGGACC CGCGACCGCG GATCGGTTGG ACGAAGTCAT GAAGCGGGTG ATCGACGGCG GCTTCGGGCA CGTCGACGCT GACGATTTGG GCGAAGGTGA GCCGTTGATG ATTCCGAGAC ACAAGTACGC GCACATGTAC GGACCCACAA TCGGTGATCG CGTACGACTG GGAGATACGA ACCTTTACAT CACCCCCGAG CGCGACTTGA CGATGAAAGG TGAAGAATCC AAATTTGGCG GCGGTAAGAC GCTTCGCGAG GGAATGTCGC AGCAGGCGGG TGTCGGCGAC GCTGATTCTC TCGACACAAT CATCACGAAC GCACTGATAG TCGATCATAG TGGTATTTAT AAGGCTGATA TCGGCATTAA AGATGGTCAC ATCGTTGGTA TAGGTCACGG TGGTAATCCA GACGTCGCTG ACGTGACCCC TGGGATGGTT GTCGGCGTGA ATACCGAAGC CATTGCCGCG GAAGGATGTA TCGTCACCGC GGGAGCACTG GACACGCACA TTCACTACAT TTGTCCGCAA CTGTGCACTG AGGCTGTCGC TAGCGGCATC ACAACCCTCC TTGGCGGCGG TACAGGACCA GCGTCGGGAA CGTGCGCGAC CACGTGTACG CCCTCTGCTG CGCACATGCA GTTCATGCTT GAGACGACGG ATGCGCTACC ACTCAATTTC GCGTTCACTG GCAAGGGAAA CACGGCGTCG CCCGAAGGCT TGCACGAAAT CATAAAAGCT GGTGCTGTCG GTATGAAACT ACACGAGGAT TGGGGCACAA CACCGGCTGC GATCGACAAC TGTCTCACCA TTGCAGAAGA GTACGACGTC GCCGTCACCA TTCACACGGA TACGCTCAAC GAATCATGCT GCGTCGAAAA ATCCATTGAG GCTTTCAAAG GTCGCACTAT TCATACGTAT CACTCTGAAG GTGCTGGTGG TGGCCATGCG CCGGATATCA TCAAGGTTTG TGGTGAAAAA ATGGTCCTTC CTTCATCCAC GAACCCAACA CGACCATACA CAAAAAATAC CGTGGACGAG CATCTGGATA TGCTCATGGT GTGCCATCAT CTCGATCCCG AGATCCCGGA AGACGTTGCG TTTGCTGAAT CACGCATTCG TGCAGAAACT ATCGCGGCCG AAGATGTTCT TCACGATATG GGTGCGCTTA GTATCATGGC GAGCGACTCG CAAGCCATGG GACGGGTCGG AGAAGTCATT CGCCGCACGT GGCAAACCGC ACACTCTAAT AAAGAACAAC GAGGTTTTCT TGAAGAAGAT GCAAACTCGG GTGCGGATAA CGTTCGGGTG AAACGGTATG TAGCGAAGTA TACCATCAAT CCTGCCATAG CTCATGGCAT GTCGCACAAG GTCGGCAGCC TAGAAGTGGG AAAATTCGCC GACGTCGTCA TTTGGAAGCC AGCTTTCTTC GGCGCCAAGC CAGAAATCGT CGTTAAAGGC GGACAAATCG CGTGGGCGCA GATGGGTGAT CCAAATGCAA GTATTCCGAC TCCTGAGCCT GTCATCATGC GAGGAATGTT CGGTGCTCTC AAACCGGGGA AGACTTGCAT AGCGTTCGTG AGCGCCGCGG CCGCCGCCGC GGATGTTGGC GCCGAGTACG GTCTCAATAA ACGCGTCGAG GCTGTGGTCA AATGCAGAGG CCTTAGCAAA GACGATATGG TTTTGAATAA CGCGTGCCCG AGGATAGAAG TCGATCCAGA AACATATGAG GTTCGAGCAG ACGGTGTCGT GTTGAAAAGT CAGCCTGCGC AAGAGCTGCC CCTCGCGAGA AGATACTTTA TCGTGTGA
|
Protein sequence | MRLTPCEIDK LNILFAAGRV AQRRLARGMR LNHPEAVALI AMQCVELVRD PKFDKDGERR ALTSVEVQDL GKRVLGRRHV LDGVPELVGD VQIETTFDDG TKLITIHDAI CADDVDLELA LLGSFLPKPS EKAFPPATKE KTPAPGEVRT SKDSLELLAG RPKRKLKVTN TCDRPIQVGS HFHLIEANRF LTFDRRRAYG ARLAIPSGTA VRFEPGESKT VNMVNIAGNR VVKGGNNLVN GPATADRLDE VMKRVIDGGF GHVDADDLGE GEPLMIPRHK YAHMYGPTIG DRVRLGDTNL YITPERDLTM KGEESKFGGG KTLREGMSQQ AGVGDADSLD TIITNALIVD HSGIYKADIG IKDGHIVGIG HGGNPDVADV TPGMVVGVNT EAIAAEGCIV TAGALDTHIH YICPQLCTEA VASGITTLLG GGTGPASGTC ATTCTPSAAH MQFMLETTDA LPLNFAFTGK GNTASPEGLH EIIKAGAVGM KLHEDWGTTP AAIDNCLTIA EEYDVAVTIH TDTLNESCCV EKSIEAFKGR TIHTYHSEGA GGGHAPDIIK VCGEKMVLPS STNPTRPYTK NTVDEHLDML MVCHHLDPEI PEDVAFAESR IRAETIAAED VLHDMGALSI MASDSQAMGR VGEVIRRTWQ TAHSNKEQRG FLEEDANSGA DNVRVKRYVA KYTINPAIAH GMSHKVGSLE VGKFADVVIW KPAFFGAKPE IVVKGGQIAW AQMGDPNASI PTPEPVIMRG MFGALKPGKT CIAFVSAAAA AADVGAEYGL NKRVEAVVKC RGLSKDDMVL NNACPRIEVD PETYEVRADG VVLKSQPAQE LPLARRYFIV
|
| |