Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44908 |
Symbol | |
ID | 7199603 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 609454 |
End bp | 612506 |
Gene Length | 3053 bp |
Protein Length | 877 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179032 |
Protein GI | 219116474 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTTTTCCG GCACTCACAT TCAAAGAATG ATATATGATG CGCGAGTCCG TGCTACTGTT GCATTTCCTC TTATTGGGTG CTACCGATCT CTGGACGGCT GAAGCTGCTA TCGCAACTAG CGACACCGTT ATCTCGTCAC GGCGGATCGA CACGGCGGTT GCCGCCGAAT CGGCATTGAT TGATAAGGAA ACCGGCCGAA TTTATTGGGA AGGAGGTTCG CAGACGACTT TGTCGGCAAC ATTGGGGCCA ACGTTAGCTC AGTTTGGAAT CCAACACGTA AAGGCGACTC TATTGTCGTT ACTGCTGGTT TTTGCCGTCG TGGGCGTTCT CTTAGGTTGG CTGCGACACA AAGTAGAAAC GTCACTATTG TTCCAATCCG ATCATCGGAG AGTTTACATT TCGATAGTTT ATCATCTGCT ACAATGGACG ATATTGAGAA CGCCACGGCT GCCCCCAAAA CTGGTTACGG CGATTGTGTT ACTGTACTTC CTGGAAGCCT TCCAATGCAG CACTCGGACG TACTTGGCTA ACGCAATTTG CAGTCCCGAA GAAGTTGAGC GCTACATCGA AAATCTAAGG AGTCAGGATC CCAAAATTCA ATGGACCGTT CGGTCTTTTC ACTATGAGCC CTTCTATACG GCGCTGTTAC GAATATTTCA GCGGCAACGT AAATCTACGA GCGAAATCGA TTGTGATGCT ACTGAAGGCA CTACTTGTTT TTCAAAATTT ACAGGCCCAA ATACTGACAC AGATGAAAAT ATTCGGGAAT CAAGGAGAAA AGGGCCGACT TTGGATTCCA ATCACTGGTG GGTGCGCAAA CTCATTACTC ATAACGCCAC TGGCACATAT AATTACCAGC AAGTGACAGA TCTCACGACA GCCGGAGTTT GGCGACGTGC TCCGGCCTCT CCCATTGCGC CATTCTCCAA ATTAATCCTG AGCAAGCATG TTGTCCTATT GGACGGCAAG ACAAGGGGCG ATTATCTTTC CCAGCAAGCC GACTTTGCTA CAAAGCACGG CCAAGAGGAC CGGATGGCGG AATACGCCAC CAACCTTGCC GTCGAAGGCT TTCAATCCCG TGTCCTGGCC GTGAGAGCGA ACAGCAACGA TTTGACAGGG GAAGGTTGCT GGTGGACTAC CCGCTTTTTT CAGTCTCACA TGTTTTGGCT GGCTACGGCC TTCGGGCTCA CAGTCCCGTA TCGGTATTGG TTTGCGCGAC ATTGCGACGA AATTCGTATC CGCGTTGTTA AAGAGATATC AGCCGCTCCA GTTCCAGCAC CATCCTGGTC TTGGTTTGGA CCATCGAAGA ACAACGTGGC TGATAGCAAG ACTTGTCGAA CGTCAAAAAT GGGCAACGGA GATAGTGACG AAAACTACCG TTCACTCATG CAGACACTGA GGTTGTACGG CACAACCAGC GTTAAAGACA TGAAGCCGTC TTCAACAACC AAACTAGTAG AACCAACAAA AATCTCGGGA GAGAATAACA ATGTGACTGT TGCTGAGCTG CAGCGCGAAG TGGATGATGC CAAAGAAGCT GCTTCGCTGT TTTCCGATTT GTTGGCGTCT GACACGGAAA CAGCTTCGGA GGCACTACAC GAAGAGGCTG CTCCACCGGG GGGCACAAAA GACAATTCTG TCTCTGAAAA AGATGCTAGC TCTATCACTT TGGCAGAGGA CCTGTCGGCT AGTTCTACAA ATACAACAAC AACTAGCAAA AAAGACCAGT AGTATAACTC ACAGTCTGTT TTGAGCGAGT AGTTTCCACG TGTCACATTG CCATCAAAAA GGGAATCTGG ATTTGCCCTG CAAAACAATT GAAGATCCGA ACTTGAAGTT CCTTTGGTCA GGCTATGCAA CCCTTTCCAA CTCATTGGAA CGAACCACAT ATTGTTGGCG ATCACGATGA CGATGATGCG AGTGGGTTGC CTTTTTCTCT TGGTCGCAGG GTTTGCAGGC GCCTTTACTC CGCTCCAACC ATTCAACTAT GGATCATCCA CGTCGGTGGT TAAGAGCCGT CGTTCGGCCT TGAGCGCGAT GCCCGATGGT GGTGTCGTGA TTACTGGTAT GTGCAACTCG AGGGGTGTGT TCCAATTTCA ATTCTACCGT GCAAGCACGT GTCACACAGT CAGTGCAACT CTGCTGGGGC TTTCTATGGA TCTGTCTTCA TTTACTTACA CAATGTAATT TAGGCGCCGC GGGCGGGGTC GGCTTTGCGT ACGCTGGGGA ATTCATGCAG CGAGGCTACG ATGTTGTAAT TTGTGACGTA CGGGATTGTT CGTCGGCCGC CAAGGCCCTG GAATCACGAC ACCCCGAAGG AGGAAAAATA CATCACGTTA AATGTGATGT GTCTTCCCAA AAAGACGTCC TAAACTTGGG AAAATTTGCC AAGGAAAAGC TCGGAACAAT CGGATATTGG ATCAACAATG CGGGCATAAA TGGTGGACGA CGGGATTTAC GGGAAGTGTC GATGGACCAA GTAGAGATGG TTGTCCGGGT GAACCTAGTC GGTGTGCTAC TTTGTACTAA AATAGCAATG GAAATTATGG GCGAACAGGA AGAAGTGGTC GGGCACATTT TCAATACAGT CGGATCAGGG GTCAAAGGTG GCGGTACACC AGGGTACGCC TGTTATGGTG CCACCAAACG TGGTTTGCCA CAATTGACTG CCACACTCGT CAAAGAACTC GATGAAGGAG TACAGGGTTA CGAAAAGAAA AAGACCAAGG GAACAATTCA GGTTCACTCG CTATCGCCTG GTATGGTTTT TACTAAATTA CTGCTGGACG ACTCAACTCC CGAACTACGC AAGTTCCCCT TTGGAGTTCT GGCCGCCCAA CCCGAAGAAG TGGCAGCAGA TTTAGTACCC AAAATTTTGG CCCAAAAGAG CAACGGTGGG TCGGTCGAGT TTTTGACGAC TGATCGTATC CTGAATAAGT TCTTTGAAAG ATTCATTTTA CAAAAGAAGT CCGCTTACAT TGATGATGAC GGTAACGTCA TCAAAATGCC GGGCGAACAG TACGACGAAA CCGGTGCACG AGCATTATAC TAA
|
Protein sequence | MMRESVLLLH FLLLGATDLW TAEAAIATSD TVISSRRIDT AVAAESALID KETGRIYWEG GSQTTLSATL GPTLAQFGIQ HVKATLLSLL LVFAVVGVLL GWLRHKVETS LLFQSDHRRV YISIVYHLLQ WTILRTPRLP PKLVTAIVLL YFLEAFQCST RTYLANAICS PEEVERYIEN LRSQDPKIQW TVRSFHYEPF YTALLRIFQR QRKSTSEIDC DATEGTTCFS KFTGPNTDTD ENIRESRRKG PTLDSNHWWV RKLITHNATG TYNYQQVTDL TTAGVWRRAP ASPIAPFSKL ILSKHVVLLD GKTRGDYLSQ QADFATKHGQ EDRMAEYATN LAVEGFQSRV LAVRANSNDL TGEGCWWTTR FFQSHMFWLA TAFGLTVPYR YWFARHCDEI RIRVVKEISA APVPAPSWSW FGPSKNNVAD SKTCRTSKMG NGDSDENYRS LMQTLRLYGT TSVKDMKPSS TTKLVEPTKI SGENNNVTVA ELQREVDDAK EAASLFSDLL ASDTETASEA LHEEAAPPGG TKDNSVSEKD ASSITLAEDL SARFAGAFTP LQPFNYGSST SVVKSRRSAL SAMPDGGVVI TGAAGGVGFA YAGEFMQRGY DVVICDVRDC SSAAKALESR HPEGGKIHHV KCDVSSQKDV LNLGKFAKEK LGTIGYWINN AGINGGRRDL REVSMDQVEM VVRVNLVGVL LCTKIAMEIM GEQEEVVGHI FNTVGSGVKG GGTPGYACYG ATKRGLPQLT ATLVKELDEG VQGYEKKKTK GTIQVHSLSP GMVFTKLLLD DSTPELRKFP FGVLAAQPEE VAADLVPKIL AQKSNGGSVE FLTTDRILNK FFERFILQKK SAYIDDDGNV IKMPGEQYDE TGARALY
|
| |