Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17772 |
Symbol | |
ID | 7196838 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1858805 |
End bp | 1862965 |
Gene Length | 4161 bp |
Protein Length | 1313 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177389 |
Protein GI | 219111275 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGCCATTC ATTCATTGTC ATCATTTTCG ACACCGATAC AAGTTGGAGA AACCATTCAC CGTACAATGG AGACAATCGT ACATTATTAC CGAAAAACGG AGCCCCCGCA TTCCTTGTTT CCCTCTATTA CAGAGGAGCT TAAGGTTTTA GGGCTCGGAG CGGACGGTAA CAAAATCGAG GCGGTCGAGA CGGAGAGCTG TTTTAATGTT CTTCTGGGTA CGGAACTCGT CGCCGAGCAG CAAGAGAAGT TGGAATGGCT GCTCGCGGAA ACTTTTGATC GCGATTCGCT TCAAATCGGA AAGAGTTCTT TTGATACCAG CGTCGGAACG GACACTTGGA AAGTGGAGTT TGGCCCACGG ATGACGTTTA CTTCGGCATT TTCGTCGAAC GCTGTAGGTA TCTGTCAAGC TTGCAACTTA CCTATTTCTC GCCTGGAACT GTCTCGTCGC TACTTATTCA CTACAAGCGA GGCATTGTCG GATGAAGCGA TAGTCGCAGT TAAAGCCATG CTGCACGATC GCATGACGGA GGAGGAGTAC GCTGCTCCCT TGAAGTCGTT CGATAGTGGC GCTCATGCAA AGCCAGTTCG GACAATCCCC ATCATGGCAG AGGGCCGAGG AGCCTTAGAA ATAATCAATA AGGAGATGGG TCTCGGATTT GATGACTTTG ATTTGGATTA TTATACCAAT CTCTTCAAGG TAAGTGCGTT CGGTGCGAGT TGTGTGAAAG CTGTATCAGC TTTAATTCAC ACGTCGAAAC ACCTGGTCTT TAGGAGAAAT TGGGACGAGA TCCCACCGAC GTCGAGTGTT TCGACATGGG ACAGTCAAAC TCAGAACATT CGCGCCACTG GTTCTTCAGC GGCAAGATGG TAATTGACGG GAAAGAGAAA CCTCATACGC TTTTCCAAAT GGTCAAAGAC ACACTTCCCA AAGATATTCC CAACAACTCG ATCATTGCCT TCCACGATAA CTCTTCTTCG ATTCGAGGGT ATGAGTGTAG CGCTCTTCGT CCTGTCTCGT TGGACAGTGC AGGGCCAGTG CACGTGGGAA ATCAAACTTT GCATCCTATT TTGACGGCTG AGACGCACAA CTTTCCCTCT GGTGTCGCCC CTTTTCCCGG TGCCGAAACA GGCACAGGCG GTCGCCTTCG TGACGTGACG GCGACCGGCC GTGGTGCCTA TCCTGTGGCA GGAATCTCAT CGTATTGTGT TGGAAACCTA CAAATCCCTG GATATGATTT GCCTTGGGAA GACAAGTCGT TTGTGTACCC GAGTAATTTG GCATCGCCAC TCAACATTGA ACTTCGAGCC AGCGATGGTG CATCGGACTA TGGAAACAAG TTTGGAGAGC CCGTCATCCA TGGATTTACC CGATCCTTCG GACAGCGACT TTCGAATGGA GAACGTTTCG AGTGGGTCAA GCCGATAATG TTTTCAGCTG GAGTTGGACA GCTCGACGGA GATCATACAA CCAAGGGAGC GCCTGAAAAA GGGATGCTAG TCGTCAAGAT CGGAGGCCCA GCTTATCGTA TTGGAATTGG AGGGGGGGCT GCTTCTTCTC GTGTACAGAG TGCGGAAAAT GCGGACTTAG ATTTTGATGC CGTTCAGCGC GGAGATGCTG AAATGGAAAA TCGAATGAAT CGTCTAATGA GAGCGTGCTG CGATCTCGGT GAAAGAAACC CAATAGTGTC AGTGCATGAT CAAGGAGCAG GAGGGAATGG AAATGTTCTC AAGGAAATTG TAGAGCCTGC CGGAGCTTCA TACGATATTC GCAAGGTGAG TGGTCGCCAG AACTGAAAAT TTTTTTCGTC TAGTTCTTAG GAAAACTCAA TATTCTTATT GTGATTCATT GCAGGTATAT GTTGGGGATG AAACACTTTC GGTTCTTGAA ATATGGGGGG CAGAATATCA AGAGAATAAC GCACTACTTA TTCGTCCTAC TGATCGAAAT TTGTTTGAAG CTATTGCCAA ACGCGAAAAT TGCCCTGTCC GCATTCTAGG TGAGGTCACT GGAGATGGCA AGGTTGTCGT TCACGATTCG AAAGACAATT CGACTCCTGT GGATCTTCCT CTTGAACTGG TTTTAGGGAA GATGCCTCAG AAAACATTTG TCGATGATCA TATTGCGAAT AAGCTCGAGC CCTTACGCCT TCCTGAAACT GCAACGGTGG CGTCCGCTTT GGATCGCGTT CTTCGATTGC TTTCAGTGGG ATCGAAACGA TTCCTAGTGC ACAAAGTAGA TCGTTCGGTT ACCGGTCTTT GCGCACAGCA ACAATGCGTA GGCCCTTTGC AACTGCCTTT GTCCAATGTA GGCGTCACAG CGCATACTCA TTTTGGAATC ACAGGAACGG CTGTTGCGTG CGGCGAGCAA CCAATTAAGG GACTCGTGGA CTCTGCAGCT ATGGCAAGGA TGACTGTGGC CGAGGCAATG ACTAACATAA TGTGGGCAAA GCTTTCGAAG ATCGAGGATA TAAAAGCTAG TGGAAATTGG ATGTACGCTG CAAAACTTCC TGGTGAAGGT GCTAAAATGT ACGATGCATG CGAAGCGCTA AGAGACGCCT TGCTTACTCT TGGTGTTGGA ATCGATGGTG GTAAAGATTC TTTGTCAATG GCCGCAAGGT GCGGTAACGA GGTAGTTAAG GCTCCAGGTG AGCTCACGAT GACCTGCTAC GTGACCTGCC CTGATATAAC GAAGACCGTC ACACCTGACT TGAAATGTCC AAGCGGCGGG TCCACTTTAA TATACGTGGA TCTCGGGAAT GGTAAGACGC GCCTCGGCGG ATCAGCACTT GCTCAGGTGT ACAGCCAAAT CGGTGACGAA TCGCCTGATA TTGACGACTT TGGTGTGTTG AAAAATGCCA TGGTGGTGAC GCAAGATTTG ATTGAGAAGC GCACTCTTCT TTCTGGGCAC GATCGGTCTG ATGGTGGCCT CATCGTCACT CTAGTGGAAA TGGCAATCTC CGGTAATTGC GCCATTGACA TTGCACTACC GCCTGCCGGC GCCAATGCTT TTGATGTCCT ATTCAACGAA GAAGCTGGCA TCGTTCTGGA AGTTTCTAAC GAAGATGGAG GGGCCGTTAT GAAAGCTTAC GCGGATGTCA ACATACCCGT ATTCAAAATT GGTACGGTTT CTTGTGGCGA CTCGATCAAG GTTTCAATTG GAAATGAGTC TCCCTGCATT GACAGCAAGA TGACTGTGCT TCGTGATGTA TGGGAAGCAA CTTCGTTTCA ACTGGAGAAA CGGCAGAGAA ATCCAAAATG CGTTCACCAG GAAGAAGTAG GTCTCAAACT ACGTCATGCA CCACATTGGA AACTAACCTT TGAGCCACTT CCTACTGATA TTTCTATCAT GAACTCGACC TCCAAACACA AGGTGGCGAT TATACGTCAG GAAGGAAGTA ATGGGGATCG AGAGATGATT TCGGCGTTCT TGTCCGCGGG CTTTGAGTCT TGGGATGTCA CTGTCAGCGA TCTCTTAAGC GGCTGTATTA CTCTCGACAT GTTTCGCGGT ATCGTTTTTG TCGGAGGCTT TTCGTTCGCT GATGTCCTTG ATAGCGGAAA GGGATGGGCT GGTGTCATTA AATTCAACGA AAGCGTCTTT CATCAGTTCC AGAAATTCCG AACCCGGAAG GACACGTTCA GCCTTGGTGT ATGTAACGGA TGTCAACTTA TGGCGTTGCT CGGATGGATT CCGTCTACGG ATGGCTTGGT CGAAGAGAAT CAGCCTCGCC TCTTGCACAA TGACAGTGGC AAGTTCGAAA GCCGCTTCTC GAGTGTCAAG ATCCAGTCGA GCCCTGCTGT TATGTTCAAA GGAATGGAAG GCTCGTCTCT TGGGGTTTGG GTGGCGCACG GAGAAGGCCG TTTCCATTTC CCTGACCCGT CTGTTCAAGA AATGGTGAAA GAAAAGGACC TTGCCCCGCT TCGCTATGTG AACGATACAA ACGACGTAAC GCAAGAATAT CCATTTAACC CGAATGGCAG TCCTGACGGT ATTGCCGCTC TTTGCTCGGA AGACGGACGC CACCTTGCAT TGATGCCTCA CCCAGAACGT GTTTTCACGA CCTGGCAATG GCCCTGGACT CCCGCAGAAT GGAAGGACTT TAAAGTTGGA CCATGGCTGC ATATGTTCCA GAACGCCCGC ATCTTTTGCG ACGAAAACTA A
|
Protein sequence | METIVHYYRK TEPPHSLFPS ITEELKVLGL GADGNKIEAV ETESCFNVLL GTELVAEQQE KLEWLLAETF DRDSLQIGKS SFDTSVGTDT WKVEFGPRMT FTSAFSSNAV GICQACNLPI SRLELSRRYL FTTSEALSDE AIVAVKAMLH DRMTEEEYAA PLKSFDSGAH AKPVRTIPIM AEGRGALEII NKEMGLGFDD FDLDYYTNLF KEKLGRDPTD VECFDMGQSN SEHSRHWFFS GKMVIDGKEK PHTLFQMVKD TLPKDIPNNS IIAFHDNSSS IRGYECSALR PVSLDSAGPV HVGNQTLHPI LTAETHNFPS GVAPFPGAET GTGGRLRDVT ATGRGAYPVA GISSYCVGNL QIPGYDLPWE DKSFVYPSNL ASPLNIELRA SDGASDYGNK FGEPVIHGFT RSFGQRLSNG ERFEWVKPIM FSAGVGQLDG DHTTKGAPEK GMLVVKIGGP AYRIGIGGGA ASSRVQSAEN ADLDFDAVQR GDAEMENRMN RLMRACCDLG ERNPIVSVHD QGAGGNGNVL KEIVEPAGAS YDIRKVYVGD ETLSVLEIWG AEYQENNALL IRPTDRNLFE AIAKRENCPV RILGEVTGDG KVVVHDSKDN STPVDLPLEL VLGKMPQKTF VDDHIANKLE PLRLPETATV ASALDRVLRL LSVGSKRFLV HKVDRSVTGL CAQQQCVGPL QLPLSNVGVT AHTHFGITGT AVACGEQPIK GLVDSAAMAR MTVAEAMTNI MWAKLSKIED IKASGNWMYA AKLPGEGAKM YDACEALRDA LLTLGVGIDG GKDSLSMAAR CGNEVVKAPG ELTMTCYVTC PDITKTVTPD LKCPSGGSTL IYVDLGNGKT RLGGSALAQV YSQIGDESPD IDDFGVLKNA MVVTQDLIEK RTLLSGHDRS DGGLIVTLVE MAISGNCAID IALPPAGANA FDVLFNEEAG IVLEVSNEDG GAVMKAYADV NIPVFKIGTV SCGDSIKVSI GNESPCIDSK MTVLRDVWEA TSFQLEKRQR NPKCVHQEEV GLKLRHAPHW KLTFEPLPTD ISIMNSTSKH KVAIIRQEGS NGDREMISAF LSAGFESWDV TVSDLLSGCI TLDMFRGIVF VGGFSFADVL DSGKGWAGVI KFNESVFHQF QKFRTRKDTF SLGVCNGCQL MALLGWIPST DGLVEENQPR LLHNDSGKFE SRFSSVKIQS SPAVMFKGME GSSLGVWVAH GEGRFHFPDP SVQEMVKEKD LAPLRYVNDT NDVTQEYPFN PNGSPDGIAA LCSEDGRHLA LMPHPERVFT TWQWPWTPAE WKDFKVGPWL HMFQNARIFC DEN
|
| |