Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_46292 |
Symbol | |
ID | 5003495 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 659094 |
End bp | 661830 |
Gene Length | 2737 bp |
Protein Length | 842 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418916 |
Product | F-ATPase family transporter: protons (vacuolar) |
Protein accession | XP_001419233 |
Protein GI | 145349634 |
COG category | [C] Energy production and conversion |
COG ID | [COG1269] Archaeal/vacuolar-type H+-ATPase subunit I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTGT TTCGGAGCGA GCGAATGTCG CTCGCGCGCG TCATCGTGCC CGAGGAAGCC GCGCGGGACA CGATCGAACG CGTGGGCGAG CTCGGGGTGA TGCAGTTTCA AGATCTGAAC TCGGACACGC CGGCGTTTAA GCGCGCGTAC TCGACGCAGA TACGACGCGC GGACGAACTG CTGCGACGGT TGCGGTACTT TCGAGACGAA GCGCGACGAG CGACGATCGC GGTGGCGCGA TCGAGACGGA GAAACGCGAC GGGACGCGGG AGCGGGGCGA CGACGACGAC GACGGACGAA CTGGACCACG TCACGGAGGA GCTCGAACGC GATCTCGCGC AAGCGCTGAA GAATTACGAG AGGCTGATGA GGACGCACAG CGAGTTGATG GAGCTGCAGT TGGTGCTGGA GAAGGCGGGG GGGATATTTG AGGAGAGTCG AGCGGACGGA GGAGGGCGCG GCGGCGGCGA CGTGGATTTT CGCGGGACGC GGTCGTACGA CGACGTCGAG GGGGGGGGCG AAAGCTTGCT GCCTGTCGCG AGTGTCGGCA TGGGCGCGGC GACGGAGCGG CGCGGTGAGA TGGGATCGAG GTCGGGGTCG TATTTAGAGA TGGCGGAATT GGACGCCGCG GGCTCGAGCG GGCGAAGCGG CGACGGCGCG TCCGCGTCGT CCAACAGCGC CGCGGGCGCG TCGGCGGTGC GACTCGGATT CATCACTGGA GTTATCCTCA CGAATAAGGT GATCTCGTTC GAGCGGATTT TGTTTCGCGC GACGCGAGGG AACATGTTTT TGAAGCAGTC GCAAATCCTG GGCACCGTCG TCGACCCCAC GACGGGCGAA AAGTGCGAAA AGACCGTGTG CGTGGTGTTT TTCGCGGGTG AGCGAGCGAG AGAGAAAATC ATCAAAATCT GTGAGGCGTT CAACGTGAAT CGGTATCCGT TCCCTGAGGA TTACACACGT CAGCGACAAA TGTACGCCGA GTGCACGGCG CGTCTCGTCG AGTTGCAGTC AACGCTCGAC GCTTCCACGC AGCATCGCGA CGATGTTTTG CGTAAAGTGG GCGACAGTCT GGAAGATTGG ATTCAAATAG TTCTGCGCGA GAAGGCTATT TATCACACGA TGAGCATGTG CTCGGTCGAC GTCACACGAA AGGTGCTCGT GGCTCAAGCT TGGATTCCTG ACTACGCGCT GTCCTCCGTG CAGACCGCCT TGACGGATGC GAATCATTCC TCGCTCGCTT CCGTTGGGAC GATTTTTCAG CAAATAGAAA CCAAGGAATC CCCGCCGACG CACTTTCAGA CGAACAAAGT CACCTCAGTC TTCCAAGGCA TCGTCGACGC GTACGGCGTC GCGAGCTATC GCGAGGTGAA TCCCACGGTG TTCACCATCG TGACTTTCCC ATTCCTATTC GCCGTCATGT TCGGTGATTT TGGCCATGGA TTTCTCATGC TCTTCGCCGC GCTATATTTG GTGATGAACG AAAAGAAGCT CGCGGCGTCG GGACTGAACG AAATCATCCA GATGGCGTTC GACGGTCGAT ACGCCATCCT ACTCATGTCC ATTTTTAGTA TTTACACTGG TTTACTCTAC AACGAATGTT TCTCTGTACC AATGAATTGG TTCGGTGCGA GTAAGTATGT GTGCGATCCA AATGACCCGA CGGCGTCTAC GACGTGCGAT TCGGCGTATA AGACCGGCCT GGTGAATAAC GGCGACGGCG CGTACGCCTT TGGCGTCGAC CCCATCTGGC ACGGTTCGCG TTCGGAATTG CCGTTTTTGA ACTCGCTGAA GATGAAAATG TCTATTTTGA TGGGTGTGAC GCAGATGATG CTCGGAATCT TCATGTCATT CCTAAATCAG GTGTACACAA ACGACAAGCT CTCGATGTAT TGCGAGTTTT TCCCGCAAGT CATCTTCCTC GGCGCGCTCT TCGGATACCT GTCGTTGCTG ATCCTCATCA AGTGGTGCAC GCCTGGTTCG ACTGCCGATT TGTATCACGT TATGATATAC ATGTTCCTTT CCCCGGGCAA CGTCGATTGC GCCGGCGAAG GCGAAAACGG CGGTCCCGGT TGCCCCGAAA ACGTCTTGTT CCCAGGGCAA GCGGGCTTTC AAAATTTTTT GTTGTTTCTC GCCTTTGTGG CGGTGCCGGT GATGCTGTTT CCGAAACCGT ACATTTTGAA GAAGCGACAC GAAGCGTCCC GAGGCGGGGT TCGGCGAGGC GGCGTGCGAT ACGCGCGGCT TGATGCAGAA GATGACGACG ACGAGGCGTT TTTGCAAGCG TCAGATGCGG AGAACAGCTC ACCGTCGGCT GAAGAGGAAG AAGAATTTGA TTTCGGCGAA ATCATGGTGC ATCAAGGGAT CCATACCATT GAATTCGTTC TCGGCGCTGT GTCAAATACC GCGTCTTATC TTCGTCTTTG GGCGCTGTCA CTAGCGCATG CGCAACTCTC AGCCGTGTTT TGGGATCGCG TCTTCATGGG CGCCGTGGCG AGCGGAAACG TCGTCGCCAT CGTGATGGGT TTCGCCGTGT GGGCGTTCGC CACGATTGGG GTGCTCATGC TCATGGAATC TCTGTCCGCG TTTTTACACG CGTTGCGCTT GCACTGGGTC GAGTTCAACA ACAAGTTCTT CAAAGGAGCC GGCTATGCCT TCGTCCCGTT CACCTTCGTC GGTCTCAGCG ACAAGTCCGA CGACGCGTGA ACAGAACGGC AAATTAGCAT CATGATATGA TACGAAT
|
Protein sequence | MELFRSERMS LARVIVPEEA ARDTIERVGE LGVMQFQDLN SDTPAFKRAY STQIRRADEL LRRLRYFRDE ARRATIAVAR SRRRNATGRG SGATTTTTDE LDHVTEELER DLAQALKNYE RLMRTHSELM ELQLVLEKAG GIFEEKMAEL DAAGSSGRSG DGASASSNSA AGASAVRLGF ITGVILTNKV ISFERILFRA TRGNMFLKQS QILGTVVDPT TGEKCEKTVC VVFFAGERAR EKIIKICEAF NVNRYPFPED YTRQRQMYAE CTARLVELQS TLDASTQHRD DVLRKVGDSL EDWIQIVLRE KAIYHTMSMC SVDVTRKVLV AQAWIPDYAL SSVQTALTDA NHSSLASVGT IFQQIETKES PPTHFQTNKV TSVFQGIVDA YGVASYREVN PTVFTIVTFP FLFAVMFGDF GHGFLMLFAA LYLVMNEKKL AASGLNEIIQ MAFDGRYAIL LMSIFSIYTG LLYNECFSVP MNWFGASKYV CDPNDPTAST TCDSAYKTGL VNNGDGAYAF GVDPIWHGSR SELPFLNSLK MKMSILMGVT QMMLGIFMSF LNQVYTNDKL SMYCEFFPQV IFLGALFGYL SLLILIKWCT PGSTADLYHV MIYMFLSPGN VDCAGEGENG GPGCPENVLF PGQAGFQNFL LFLAFVAVPV MLFPKPYILK KRHEASRGGV RRGGVRYARL DAEDDDDEAF LQASDAENSS PSAEEEEEFD FGEIMVHQGI HTIEFVLGAV SNTASYLRLW ALSLAHAQLS AVFWDRVFMG AVASGNVVAI VMGFAVWAFA TIGVLMLMES LSAFLHALRL HWVEFNNKFF KGAGYAFVPF TFVGLSDKSD DA
|
| |