Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45629 |
Symbol | |
ID | 7200666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 762862 |
End bp | 769342 |
Gene Length | 6481 bp |
Protein Length | 2144 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179711 |
Protein GI | 219117848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCATT TTGCTCTGGT GCCAAAATCG AGCACGACTG TGCCTTTGCA TGATCAAGAG AACAGTTCTT GTATTCGGGC TCCTCGGGTT CAGGCTGAGC TAACGAGCCT ATACGAGACA TACGCTTCCT TGTATCCCAA CCCACGTCTC ATGGAAGAGT CCCAGGATCG AGACAACTTG TTACGCGCAA TATCATCCGC ATCGCATGCC GTACAATTGA ACTCGCCCAC ACAGAAGAAA ACAGGCTTGT TCGGCTGGTT CAGATTTTCG TCGGATCTGG CGAAGCCCCT AGCGATACAA GCTTTCTCAG ATCAAGGAAA GGATGACAAA CGATTACGGT ATGCGGAAGG AATGAGCGAG GACAATCTCC AGCTGCTAGG AACCCACCTG GCGTCTCTTG ATCCTTCCTG TATACCTGCA GAGGCTTACA GCATGGCGCG ATTTCACCTC GTCTCAGGAC GAACGAAAAG TGTGAACGTG GCTCGGGCAG CGATATGGCA GCCACCGCAT GATACCACGA CACTTTCTGT TGTGGTTACA GGCCTTGGAA GCATAGTGGA GTTGGTCTTA GATGAAAGCA ACGAACATGA ATTGGATGAG GAGAAAGGAG AAGTTCAATA TACTTCAGAC CGGAATGAGC TGCTAGAACA CTTTCGCCAG ATATCGGCTA CTTCATATTC GGCCATCTAT GCTCGCGCCA GCACTGTGGG TCCTAACTTC TTGGTTGTGT CATGGGGACT GGAAGATGGT GTTGTGGTTT GTTACCGTCG GGTCAAACTC GCAGGCGACC AGAACACACA CTCATGGCAA GCCTGTGGGA TGCTCAGTCC AACCGACGCA GTACAAACAC ATTTGGGTGA CGACGTCTTT GTCCAAGATG AATCGGCTTC TTTGCGAGTG TCGGATATAG TACCCTTGAT TGTCGAGACG GGAAAGGTGC CAGCAGCGAC TATCGTGGTA TCTCGTTTGG GGGGGACCAT TGAGATCGTT CCACTGCCTT CTCGTCTCTG GTACGGCCCC ACCCTCCAGC AGCCCGCGTT AAAGCCGCCC CGAATACAAG TACGACGGGG GGAAGCACTT CACTACGCAA TTGGCGAGCT GACAGACTTG GCATCCTTTC AAAATCGGAT CACGGCGTTG ACGACGAGTG ACTATCATAC TGACGTGCTA GGCTTGGAGA CGTTTCGCAC AAGTGTCACT AGAGGAACGA CGTGGGATAA GGAAGCGTAT CCTCAGGGGC CGCCAGCGGA ACATATTATG GCCGCATTTG GTGCAAAAAA CGGTAGAGAA GCCGTAACGT TTTGGTCTGT CAGCACCATG CTGATGGATG AGACAACAAC AAAAGAGAAC TGTGATGAGT TCGGCTTTTT GCTTCATGCA AATATGCTCG AAGCTATTGA CGTCGGTGCA GTTGGACCAC CAGTCAGTGT GTTTGCTAGC GATCCAATTA TGCGGCACTG GAGGCGGCCA CGGCATGTGG AGCTTTGGGC CGACGCTGAG CAAGGGGTGT TGGATGAAGA AAGTTCTAAG CGAATCACCA CAATATCGGT GTCAGCACCG ATAGTGACAA TACGCTTTAC AACCGCGAGT AGCTCATTGG AAATGGCCAT GTTGGACTGG AACGGTGGGG TAACGATGAT GGATTGCCGC TTGCTGGAAC ATACAGCATC GCAAACCTTA GCGGAAAACG AATTTAAGAT TGTTTATCAA CCTGGTAGTC CCGAAAATAT CGAACCGTTG GTGGAATCGA TCGTAGATCG TTCGCAGTCA CTCAAGTATT TGGTGGATGA GTATGGTGTG TCTCGAGCGA TCGATTTACG GTGGTGGTGT CCTCCTGAGT GTTTGCCGTT GCTGGCAATC TTGATAGAAT CTCCGCTGCG TCTTTGCTTT CTTTCGTTGG AACAAATCGC CAAGGGCACA GTTCCTTGCA GCCTTCGACT CGGAATGGCC TCTGGAACGT TACTGTCTTC TTCCTTAAAT GACGACTTAC TTTTGACATT GCTAGGATGC AAAGGGGGGA GAAGGCAGTT GTCTCTCTTT GCCCTCCATC CGCTCGACAC TCGAACGACT GTGGAATCGC TCTTTAACGA ATCTAAATTC GAAGAAGCAA TCGCCGCCAT TAGTCGGCTT GCCGTGCGTG ACCAAGTGAT TCTCTCAGAT ATGGTTGAAG AGTGTAACAA GCATTTGTGG CGAACCACAC GGGACTTTCG CTACTTGAGT AACAGTTCTG ATGATGCTTT TGTCGTGCAA GAGGCCATGG TTGATGTAGA ATATCTTGCC GAATGCTACG ACTTCGATAC CTATCTTCGG TTGCAAAATG TTGCTCTAAA TCGGATCGAT TATGGAAGGT TGGAGGCTTC ATTCGCATTG AAAGGATTCT TTGGGCCGGA GTCCATCAGG CGCAACTTGA GACAACGATT GGTGCTCTTG GGGACATATC GACTCCTCTG TAAGCGCTTA TTCTCAACAC CGTCGCTTGC ACATTTTCTG GAAAAGTTCG TCTCAGTCAA CATAGCCGAT CTGGCTACAA GCCTGGCTAG AGCCGGAGAT ATTGAAGCTT TGTCAATTGT GGTGTTTCGC CACGAAATCG ACACTGTCAG ATACTTGAGG GTGCTTTGTG AGATACATCC GACAATAGCG TTGTGTCGAT ACCTTCATCT TCTTCCAGTT TGGCAAGAAA ATGTTGAAGC AAAAGGCTTT CTCTATACGG GTAGTAAGGG ATATGTTTTG CGAAATTGGG CCGAAATGCC AGATACGTTG TCGAGCTCCA TTGGCTTCCA ACTCCTCGTG GACGAAGAAG ATCGTGATGC GGTCCAGAAA CTGTACGCAG AAATCGGACC CGATGAAAGA TTGAAAAGTA GTTTGTCGTC CTGCTGGCAA ATACTAAGAG ATTGGTACTC TTCATGGGCG CATATACTTT GCGACTTTGT CTGCGACCTT TCTTCGACAA TCAAGTTCTG CAATGTTTCA CTTAAGGCCC TCCAATGCGA GAGTGCAGAG AAGAGAGACG CTTTGGACGT TTTGGATGCC AGTCAGAATC ATTCAGAAAT TCATCGAATT TTGCGCCGGT CGGAGGCTCT ACAACTTTTG CTCGTTGAAA TCCTTGATGT CAACAATCCT CCAGTAACAC ATTCGACGAT GGGCCTGAAG GAGCTTGAGT CGCTCGACAC TCGCTCCGTA GTACAGCTGG CGCTGACAGG AGTCGTTGGT TTCGATCAAG TTTTGGCACG ATTTCAATAT GTTCTTATCC CATTGCTGCA TCCAACGCAC GCAACGAGAG ACGACACCGG CTTGTACGCT GAACTAGACA GGGAACTTGC TTTATATTGT TTTACAATCT TTGAAGCCGC TCCCATGGAA AATTCTAGTG ATACGTGGGA CCGTACTCAA AGAGCCATAG AGATCTGCAC CGCAATTGTC TCTTTGAGTA CCACAGGAAT TCCTGCCAAG CAGCGGGTTG TAAAGGACAG CCGAATTTTG ATGGACATTG TTACGAGACT GTTTGAAAAT ACCGTTGCAA TCATTTGTCA ACTATGTCCA AAGCTTACCA ATGCGAGAGC GATTGTGGAA ACGCTTTGGA TGGCCTACGA ATCGTTACCA AAGCATCTAG GGAATGGTAG TGCCGAAGAG GTTGACAACA TGTACCGTTC GCTTGTACTA ATTGATATTC TTTCTCAATG GCCAGACTGC TCACCATTTC TTATTGTGTC GAAAAATGAT TCATTGCGGT CTGGCCGGGA AGGCCAAATA CGTGCCGGGA AGCAAGCTGT TGAAAGCATT TGTACGTCGT TCTGCATGCA GGTAGGCTTG AAATTCTCAA GAATCGAAAA CGTTGAGCAT GGACAGCTTC TTTCGATGCT AATAGCAGAC ATCCAGCAAA TCAATCAACA ATGCTGTGAC GGGAATCTAA GTACAGTGGA AATCGTGACT GTTCGTCTTT TTGAGGTTCT CCTACAGCAG CACCAGGTAG TACTCATACA CAAACTTCTC ACATGCTATG GCGCCCTCGT TGACAAAGCA ATGGCGTCAA ATTCTGTGAT GGCGTTCGTC AACGAAGCAG TATTTGAAGA AACAAATGAT GAATCAACAC TGTCAGCGGC AATGGCATGC CAAGATGTCT TATATCCTTG GTTTCCGAAC TTGCATTCGG AATTTAAATC GGTCCGTGTT TATATGGACG TTGCAAAATT CATAAATTCA GCGTTGGCAA CTCATTGGGT GTCTCCTGGA GTCCTTCAAA AGCAGCTGCC ACTGGATGTA GTTGAGTCTA TGCTGATCTC TTACCCATCG GTCGTAGTCT ACGGTTACAA GAATTGGGCA GATGAAAAGT TTGCAAAAGA TGCCAACAAA TTGATTCGTG ACTTCTTTAA AGTGCAGGAC TTAGCTTCTG AGGATGTGGC GGCGTGGGAT GGCCATAAGT TGCCCTCGTT GCCAGGCACC TTTATCTTAC ACATGGCGCA GCGTCTTGAT CTGACTGATA CAAATGCGTT TGTTTTTGTG AAGAACAGGA TGATACACCA TGCACTCATC GGAAAGCACT GCGGTGCCGC CGCGGCCATT TGCCGAAGCC TTTTCCACGA TGAGCTAGAC TCTGTCTCGC GTGATTTGTC CTTGGCCGTC GTTGATGCGG CATCTGCCAT TATCCGAGAG GATTCTTTTG ACGATGCTGA GACCAAATAT GAACTATGTA AACTCGCTTT GTCAAGCCGT ATTGGACCAC TATCCGTGGA GTCGCCACCA GCACATAACG AGCTACTGGG ATACTATTCA AGCATTGAAT ACTTGTGTAC CTCTACCACT CCCACAAATG CTTTCGCGGC TTCAAGCCTT TTTCAGGATA CTCTTGAGCA GTACTCTATC AACTTTTCGG AATTGTTGCG CTCTCTACGT CATCAAAGCA TGTACAACAC GGTCGACGAT GCCCTCTTGA GCACGATCGC TAGATATTCC ATGTTTTGGT GCATATTTCA GTGTTCAAAG TACGAGGAAG CTGATCACGT TGCTGTGCAG CAAGCTGAAG TGACCAGCGC TGCTCAACTC GCCTGCGGAC TTCTTTTCTA CATCAAAGAT TCTGATATTG CAATAGGATG CCTTAAAGAG GTTCAATCCG TGCTTCAAGA AGAGTTTGCG GCTATCGTGT CATCACTTCC TTCCCCGAAC TCAGACTATG TCCGGCCTGA TTGTAGCATC GTTCACAAGC TTGTCGGGCG CGGCTACACC GAAACGGGCG CCAAAAGAGC CTCATTTGTG ACAAACAATG AGAGCTACGA AGCTGCTCTT CAGTGGGCCG TAGCACACTC ACTTGATCCA GATTTTGACA ACCCTCTTGT TTTGACGAAA GCCGCGGACA CAAAATGGAT AGATCAAGCT ACTATGCAAC AGCTCCGTGC GACACTGGAA GCATCGACTC GTATTATATC TAAGAGCTCT GAGCTAGACG AGCTAATTGC GAAGTGGAAA GAGGGTAGTA TACTACCCTT CCAAAGTGTT CAGTCCGGAA AGCTTGAGTC TTTCTCGAGA CGCTACCATA GGCATGATTT TCCCACTGGA AGACGGACTG AAGAACATTC AAAAATTCTT CCTCATATCG GATCCTCTTC GCAGAATTTG TTGGTAGAGC AAAGTTGGCG AAGCGAAAGC GCAGACATGT CGTTCGTCGA CGAGTTGGAC GTCAGTGCTA TTCCCAATCG AGCAGGAGAT GTTTACGAGA AGGTATTGTT GTTAGACGAG AAAGATGTGG ATGGCAAGCA AGAGTCACCT GACCCCATAG ACCGGATTCG TCAAAAAAAC TCAATCGGCA AGGAGCAGCC CCTGGTCCCC GGACAGAGTG GAGCAATGAT ATTTCAGGCT GCGCAACCAA AGGAAAACCT CGATATCGGT TACAAAAGAA AAGTTCTTTT TGCTCGCCAG GAAGACGGTA TCCAGACAAG CAACGAAAGA AGCACTTCGT TGGCAGGCCA CGATGGTACC GAACGATCAC GGGAAGCTAG TAATACGACT GGCGCAACAC AACTTACTAC TGAGAATGTT TTAAAGCCAT CTCATTGGAC CGCCGAAGCG AGTAAACGAA ACCAGGCTTT GCGAGATTCT TTTGAAAGGC ACGAGTCACT GGGAGTGGGG GAAATTAAAA GCCCTGACAT ACCTTTTCTA CGGGAGCAAC GGCGCGCTTT GCTCCGTTCT AGTCGCGTTG CACGAGCCCC ATCGTCCGCA CCAGCGACGG ACGCGGACGA ACGCCGCCGC TTAATCGAAG AAGGTCGCCG CTTGCTACTA CAAGCCCGAG GTGCTGGCGC CATCGCATCT CAGCGCACTT CTCATCGTAA TCCGCTCCTT TCTAACACGG CTGATTCGGC GTCACCTAAA CCTCTACTGG CACAAGATGA ACCAAATCTA AAAGCACCCT CCTCCAATGA AGGAGGGGAC ATTGACGATG TTTGATAGCT GCAAGATAGG AATTTAAAGC ATGAAAGAAA ACACGTTGCT G
|
Protein sequence | MSHFALVPKS STTVPLHDQE NSSCIRAPRV QAELTSLYET YASLYPNPRL MEESQDRDNL LRAISSASHA VQLNSPTQKK TGLFGWFRFS SDLAKPLAIQ AFSDQGKDDK RLRYAEGMSE DNLQLLGTHL ASLDPSCIPA EAYSMARFHL VSGRTKSVNV ARAAIWQPPH DTTTLSVVVT GLGSIVELVL DESNEHELDE EKGEVQYTSD RNELLEHFRQ ISATSYSAIY ARASTVGPNF LVVSWGLEDG VVVCYRRVKL AGDQNTHSWQ ACGMLSPTDA VQTHLGDDVF VQDESASLRV SDIVPLIVET GKVPAATIVV SRLGGTIEIV PLPSRLWYGP TLQQPALKPP RIQVRRGEAL HYAIGELTDL ASFQNRITAL TTSDYHTDVL GLETFRTSVT RGTTWDKEAY PQGPPAEHIM AAFGAKNGRE AVTFWSVSTM LMDETTTKEN CDEFGFLLHA NMLEAIDVGA VGPPVSVFAS DPIMRHWRRP RHVELWADAE QGVLDEESSK RITTISVSAP IVTIRFTTAS SSLEMAMLDW NGGVTMMDCR LLEHTASQTL AENEFKIVYQ PGSPENIEPL VESIVDRSQS LKYLVDEYGV SRAIDLRWWC PPECLPLLAI LIESPLRLCF LSLEQIAKGT VPCSLRLGMA SGTLLSSSLN DDLLLTLLGC KGGRRQLSLF ALHPLDTRTT VESLFNESKF EEAIAAISRL AVRDQVILSD MVEECNKHLW RTTRDFRYLS NSSDDAFVVQ EAMVDVEYLA ECYDFDTYLR LQNVALNRID YGRLEASFAL KGFFGPESIR RNLRQRLVLL GTYRLLCKRL FSTPSLAHFL EKFVSVNIAD LATSLARAGD IEALSIVVFR HEIDTVRYLR VLCEIHPTIA LCRYLHLLPV WQENVEAKGF LYTGSKGYVL RNWAEMPDTL SSSIGFQLLV DEEDRDAVQK LYAEIGPDER LKSSLSSCWQ ILRDWYSSWA HILCDFVCDL SSTIKFCNVS LKALQCESAE KRDALDVLDA SQNHSEIHRI LRRSEALQLL LVEILDVNNP PVTHSTMGLK ELESLDTRSV VQLALTGVVG FDQVLARFQY VLIPLLHPTH ATRDDTGLYA ELDRELALYC FTIFEAAPME NSSDTWDRTQ RAIEICTAIV SLSTTGIPAK QRVVKDSRIL MDIVTRLFEN TVAIICQLCP KLTNARAIVE TLWMAYESLP KHLGNGSAEE VDNMYRSLVL IDILSQWPDC SPFLIVSKND SLRSGREGQI RAGKQAVESI CTSFCMQVGL KFSRIENVEH GQLLSMLIAD IQQINQQCCD GNLSTVEIVT VRLFEVLLQQ HQVVLIHKLL TCYGALVDKA MASNSVMAFV NEAVFEETND ESTLSAAMAC QDVLYPWFPN LHSEFKSVRV YMDVAKFINS ALATHWVSPG VLQKQLPLDV VESMLISYPS VVVYGYKNWA DEKFAKDANK LIRDFFKVQD LASEDVAAWD GHKLPSLPGT FILHMAQRLD LTDTNAFVFV KNRMIHHALI GKHCGAAAAI CRSLFHDELD SVSRDLSLAV VDAASAIIRE DSFDDAETKY ELCKLALSSR IGPLSVESPP AHNELLGYYS SIEYLCTSTT PTNAFAASSL FQDTLEQYSI NFSELLRSLR HQSMYNTVDD ALLSTIARYS MFWCIFQCSK YEEADHVAVQ QAEVTSAAQL ACGLLFYIKD SDIAIGCLKE VQSVLQEEFA AIVSSLPSPN SDYVRPDCSI VHKLVGRGYT ETGAKRASFV TNNESYEAAL QWAVAHSLDP DFDNPLVLTK AADTKWIDQA TMQQLRATLE ASTRIISKSS ELDELIAKWK EGSILPFQSV QSGKLESFSR RYHRHDFPTG RRTEEHSKIL PHIGSSSQNL LVEQSWRSES ADMSFVDELD VSAIPNRAGD VYEKVLLLDE KDVDGKQESP DPIDRIRQKN SIGKEQPLVP GQSGAMIFQA AQPKENLDIG YKRKVLFARQ EDGIQTSNER STSLAGHDGT ERSREASNTT GATQLTTENV LKPSHWTAEA SKRNQALRDS FERHESLGVG EIKSPDIPFL REQRRALLRS SRVARAPSSA PATDADERRR LIEEGRRLLL QARGAGAIAS QRTSHRNPLL SNTADSASPK PLLAQDEPNL KAPSSNEGGD IDDV
|
| |