Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42103 |
Symbol | |
ID | 5006301 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 247482 |
End bp | 252113 |
Gene Length | 4632 bp |
Protein Length | 1226 aa |
Translation table | |
GC content | 52% |
IMG OID | 640421722 |
Product | ABC(ATP-binding) family transporter |
Protein accession | XP_001422137 |
Protein GI | 145355800 |
COG category | [V] Defense mechanisms |
COG ID | [COG1131] ABC-type multidrug transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0166966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.134886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGACG GAGGCAGTGC GTATAGCGTG ACGTTAGACA CATCTCTCAA GCCATTTCCT TGGTTGGGGT TCGACTACAA CGTCGGTGGC GTCATCGCCG CAGCCTTTTA CGGCTTCGTC GGCTCGCTCG CGTTTCAAAC CAACGTCGTC CTCGTCATGA AATCCATCGT CGTTGAGAAG GAATTGCGAT TGCGAGAAGG GATGAAGATG ATGGGTATGT CAAACACGAT GTTCTGGTGG TCTTGGTTCT TCACGCACTG GCTTTCGGCG ATGATTTCCG TCGTCTTGAT CACGTTGGTC GGGATTTACC CGTTTTCGTA CACGAATCAA TTCATTCAAT TCATCTTTTA CACGTTTTGG GTGGCGTCTT TAACGCTTTG GAACTTTTGG ATCTCGACAT TCTTTTCCAA ATCGATCACC GCCACCATAG TCGGCTGTTT CGCATACGTC CTCACCATGG TTCCTTCCAT CGCGGTCCGC ATCACGCAGC CCGAGGGCAG CGGCGCTTGG ATCTTAGCGT GCATATTTCC ATCCGGGGCG ATGAACATGT GGGGCGCGGC GCTGGCTATT CTGGAGGTGA ATAAGAAGGG TATCACGATG GAGACATTCA ACGAAGACGT AACGCTCAAG GGGAACGTCA CGTGCGCTGG TATCTTAGGC ATGGTGATTT TCGATTGCTT ATTTTACGCG TTTCTCACCT TTTACTTCGA CGCCGTGTGG AAAACCGAAT ACGGCACGCG CCAGCCTCCT TGGTTTCTAT TCACGCGTAA GTACTGGTGC GGTGACGCGT CGAAGGTCAT CGATGATGAA ACGCTTGGAA TGCACGAGCA AGAGTCTGGG GACGCCGTCG AGCCGCTCAC GAAGCAACAG ATGAAGTCAG CCAGCGTCGT CGTTCGCGGT TTGACGAAGA AGTTTGGCGA CGAAGTCACT GCGGTGGATA ATTTGAGTAT GACCTTTGTT CCTGGCCAAG TGAGTGGATT ACTTGGTCAC AATGGTGCTG GTAAAACGAC GACGATTAGT GTTCTCACGG GTATGATTGA ACGGACCTCT GGTCGAGCGA CTATTGACGG GTACGACACC AAAACCCAGA TGCGAGAAAT TCGCGCCGGT CTCGGTATCT GCCCTCAATT CGACGTCCTT TGGCCGACAC TCACGGTGCG CGAACATTTA CAATTATACG CCGCATTTGC GGGTATGCAA AAAGAGGATG TAGAACGAGA ACTCAAGCAA GTCGTGGAAG AGGTTGCGCT GACGGAGAAG ATTGACGCGA ACTCGAAGGA TCTATCGGGT GGGATGAAGC GTAAGCTCTC GCTCGCTATT GCGTTTATCG GCAGTCCGTC CGTAGTGTTT CTCGACGAAC CAACGTCTGG GATGGACCCT TACTCGCGAC GATTCACGTG GGACGTCATT CGCAAACGAG CCGCCAACTG CACAGTCTTG CTCACGACGC ACTTTTTGGA CGAGGCTGAT TTGCTGTGCG ATCGAGTCGC CATCATGAGC GCTGGTCATC TTGCATGCAT CGGTTCGCCA CTATTCCTCA AGTCTAAATT CGGCACTGGT TATCTGCTCA CGTTTGCGCG TCATAGTCGC GCGTCATCGA CGCAAATCGC GTCGATGACG CAGCACAACT CGAACGCCAT GCTGCGAGTG ATTCAACACT TCGTACCCAA AGCCGTTGTT CACAGCGACG TCGGCGCCGA GCTATCGTTT TCTTTACCAT TCGAGTCTAC CGGCGACTTT TCTGCGTTAT TCAAGGCTCT CGATGAGCAG ATCGGGAATT TAGGGTACGC AAGTTACGGC ATCTCGTGCA CCACGCTCGA GGAGGTATTT CTATCTCTCG CACATGGGGT GAAGGCGGCG TCCACGGCGC AATCGAAAGG CAAGGTTGTG GACTTAGACG CACCCGTGGA AGAAGCCGAC GAATGTGAAG ACCTCGATGC AAAATTGGTC GCTGGCAACC GCAAGGACAT TCGCCAAACC TACGCGAAGG GTGGTGCCTT ACTCAGAATC CAACTCAAAC GCTTGCTCTG GAAGCGTTGG CTCAATTGGC GTCGCAGCAT GCGGTCGATA TTCATGCAAC TTCTCGTACC TTGTTTGCTC GTCGTGCTCG CGCTTTATTT GACCACGTTG AGTTTTGAAC CTGGGAGCAC GATTTCGGCG AAAGAAATCA GTCGAAAGCT ACTCAATAAC AAGAAGACCC TCGTGACGTA CTCCCCTTCG GCGACTGAGG CGGTCGCCGT CACGACGAAC TTGATGACAG ACACTTACGC GTTACGCTCT CAGTATCAAC CCGAGATTTC GTGTATGTGC AACTGCTTAG CGAAAGATCA AACGATCACG ATGTCTGCCA TGACGTGTTG CGCGTATAAC AGAACCATTG AGTCAACGTG TCGAGTAGCG GCATTGGCAC GTCTTGCGGG CGGGACGTAT TGCGTCAAGT CTCCAAAAGG CTACGATGTA CAAAGAGACA TCGTTTCTAT GGGGCTATCG TGCTCGTCCA TGATAGATGA CAGTATCGAT AGTTACTTAC TAGACGTCCA AGAGCCATCG GTGCCTTGTG ACATGCAACC GACGGGATCG CCGTGCGACG TCTTGTACGT TGACGAGTAC GACGGCACGC GCTATAGTCA CACGCTGTAC GGACATCAAA CAGCCTTACA CGGTATGCCG ACCGTCATCA ACAACGTGAA TAGTGCTATT TTGCGCAAGC GCACGTCCGA TTCTAGTGCG AGCATCACGA CTACGATTCA CTGGTATCCG AGCGCGGTGA ATACACTGAA GGAAGGAGAC ATCGAAGAGC CGGACAACTC TGGAACTACG TTCATCGTTT CCATGTTCGT CGTCATGGGG CTCGCTGTTC TCAGTGCTGG GATTTCAATC TTTCCAGTGT ACGAACGATG CAATAACTCC AAACACTTAC AGCTCGTGAG TGGAATCGAT AAACGAATTT ATTGGCTGGC GCACTACGTC GCAGACGCCA TACAGCTCGT TATTCCGTTC GCCGTCATCG TCGTCATTTT TGCCGGTTTC AATGCGTCTT ACTTCCAAGG TCAACTCGGG GCGATCACTG TTTTGTTGGG TTTTTTCATG TTGACCTCTA TTCCACACGC GCATTATCAA GGATTTTGGT ACACTAGCGA ATACTACACC TTTGTTGGGC AAATCGGAAC AAACACCACC GTTGGCGTGA TCACCACCAT CGCCGGTATC GTCACCGATG CCCTGAAAGA CTTGAATAAA GAGACGTTGT TGGTGAGCAA AATCTTCAAT TACACGTTTC CGCTCATCAT TCCGCACTTT AGTTTCGGGA AAGGTCTATA CGACATCGCA CAAAATGGTT TGGACAAGAC GCGGAAAATT TTCCGCGAGG ATTGCATGTG CCTCGTCCCC GTCGTACCAA AGGGTTCTTT CAACGTCATC GCCGACGATT TGGGCTATTT GATCGGAACA TTTTTCATGT GGAGCGCTTT GTTGTTTTAC AAAGAGTATA AAGAGATTTT CGCAAGTTGG ATACTGATGA AACGTGGAAA CTCGCAAGAG GTATCGCCGA GCGTCGACGA AGATGAAGAC GTGCGAGCCG AGCGCGAACG CGTGCTCTCA GGAGACATCG ACGGTGATGG CGTCATCATG GACCGCTTAT CCAAGACGTA CAAAGGCTTG TCAGCGAGTT CTACAAAGCT CGCCGTTCGA AATCTCAGCG TGGGTCTACA TCGCGATCAA TGCTTTGGTT TGCTCGGCAT CAACGGCGCG GGAAAGACGA CGACATTCAA GATGCTCACG GGCGAGTTCC CACCATCTGC GGGTGACGCG ATCATTCAAG ATCGAGACGG TGCGTCTCAC AGTGTGCGTA CTGATCTTAA CGACGCGCGA CGATTGATGG GATATTGTCC TCAGTTCAAC GGCTTGCAAC CAAACTTTAC CGCACGCGAG CACATTGAAT TCTACGCCGC CATTCGCGGA ATGCCGACGG AGATGATTCC GCGCGTGACG GAAGATCTGC TGCAGCGTAT GGGTTTGACT TTGTATGCCG ATAGACAAGC TGGAACATAC AGTGGAGGCA ACAAGCGTAA GCTTTCTGTC GCGTTGTCGC TCGTCGGCGA ACCAGAAGTC GTCTTCTTGG ATGAACCATC AACTGGTATG GATCCCGAAG CGAGACGGTT CATGTGGGAC GTGATCTCGT CCATGATGGT TGGACGCACG ATTGTTCTCA CATCGCACTC TATGGAGGAG TGTGAGGCAC TCTGTAATCG CATCGGTATC ATGGTGAGTG GCGAGTTCAA GTGTCTTGGC TCCTTGCAGC ACTTGAAGTC GCGCTTTAGT GAAGGATACT CCATCGATCT GCGCTTTTCC GACGGAAAAG GAAACGCCGT TATGGAAGCG CTACGAGCCA AACACGGAGA CATTGGAGCG GAGATAGTAG AAACACACGC CACAGAAATT AAACTTCGCG TGATGAATCC CGAAATGAAA CTTTGGCGTA TTTTTGACGC CGTCGAAGCT TTGAAGCAGT CGGACGACGA CGGCGCAAGA ATTGACGACT ACTCCGTGTC GCAGACGACT TTAGAGCAGG TCTTCATCCG ATTTGCAAAG GAGCAAACCG AGGAGATACA CGCGGCACCG GGATTACAGT AG
|
Protein sequence | MKDGGSAYSV TLDTSLKPFP WLGFDYNVGG VIAAAFYGFV GSLAFQTNVV LVMKSIVVEK ELRLREGMKM MGMSNTMFWW SWFFTHWLSA MISVVLITLV GIYPFSYTNQ FIQFIFYTFW VASLTLWNFW ISTFFSKSIT ATIVGCFAYV LTMVPSIAVR ITQPEGSGAW ILACIFPSGA MNMWGAALAI LEVNKKGITM ETFNEDVTLK GNVTCAGILG MVIFDCLFYA FLTFYFDAVW KTEYGTRQPP WFLFTRKYWC GDASKVIDDE TLGMHEQESG DAVEPLTKQQ MKSASVVVRG LTKKFGDEVT AVDNLSMTFV PGQVSGLLGH NGAGKTTTIS VLTGMIERTS GRATIDGYDT KTQMREIRAG LGICPQFDVL WPTLTVREHL QLYAAFAGMQ KEDVERELKQ VVEEVALTEK IDANSKDLSG GMKRKLSLAI AFIGSPSVVF LDEPTSGMDP YSRRFTWDVI RKRAANCTVL LTTHFLDEAD LLCDRVAIMS AGHLACIGSP LFLKSKFGTG YLLTFARHSR ASSTQIASMT QHNSNAMLRV IQHFVPKAVV HSDVGAELSF SLPFESTGDF SALFKALDEQ IGNLGYASYG ISCTTLEEVF LSLAHGPDNS GTTFIVSMFV VMGLAVLSAG ISIFPVYERC NNSKHLQLVS GIDKRIYWLA HYVADAIQLV IPFAVIVVIF AGFNASYFQG QLGAITVLLG FFMLTSIPHA HYQGFWYTSE YYTFVGQIGT NTTVGVITTI AGIVTDALKD LNKETLLVSK IFNYTFPLII PHFSFGKGLY DIAQNGLDKT RKIFREDCMC LVPVVPKGSF NVIADDLGYL IGTFFMWSAL LFYKEYKEIF ASWILMKRGN SQEVSPSVDE DEDVRAERER VLSGDIDGDG VIMDRLSKTY KGLSASSTKL AVRNLSVGLH RDQCFGLLGI NGAGKTTTFK MLTGEFPPSA GDAIIQDRDG ASHSVRTDLN DARRLMGYCP QFNGLQPNFT AREHIEFYAA IRGMPTEMIP RVTEDLLQRM GLTLYADRQA GTYSGGNKRK LSVALSLVGE PEVVFLDEPS TGMDPEARRF MWDVISSMMV GRTIVLTSHS MEECEALCNR IGIMVSGEFK CLGSLQHLKS RFSEGYSIDL RFSDGKGNAV MEALRAKHGD IGAEIVETHA TEIKLRVMNP EMKLWRIFDA VEALKQSDDD GARIDDYSVS QTTLEQVFIR FAKEQTEEIH AAPGLQ
|
| |