Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51018 |
Symbol | |
ID | 7202210 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 8458 |
End bp | 13746 |
Gene Length | 5289 bp |
Protein Length | 397 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181116 |
Protein GI | 219121527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.322976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGTGTGGT GTATGGTATG CCGTCTCACG CCCCGTTTTA CATCGAGCAT CTGCTATATG CGCCTTCACA ACAATCTAGC AAACCAGAAA CCATGGCCGC CCGTCCTCTC GTCAGCGTCT TTTCTCTCTC CGGTGACAAG TCCGGAGATG TGAGTCTTCC TGCTGTAATG ACGGCTCCTC TGCGCCCGGA TATCGTTCAG TTTGTGCACA CCAACATGAA CAAGAACCAT CGTCAGGCGT ACGCTGTTAA CATTCGCGCC GGAAAGCAAG TTGTCGCGTC GTCTTGGGGT ACTGGACGTG CTGTCGCCCG TATTCCTCGT GTTGGTGGAG GTGGTACTTC CCGTTCTGGA CAGGGTGCCT TTGGTAACAT GTGCCGTGGT GGACGCATGT TCGACCCCAC CAAGACCTGG CGCAAGTGGA ATAAGAAGAT CAACATTTCG CAGAAGCGTT ATGCCGTTGC TTCTGCCCTA GCTGCCACCG CCGTCCCGGC TCTTGTGATG TCCCGTGGGC ATGTCGTCGA CAACGTTCCG GAAATTCCCC TCGTTGTGGA AAACGCCGTT GAGTCTGCCA AGAAGACCTC GGCCGCGAAA GACATCCTCT CCGCCATTGG TGCCTTGGAT GATGTCGAAA AAGCGGGAGA ATCGAAGCAA ATCCGTGCCG GTAAGGGTAA GATGCGTAAT CGTCGTTACA CCCTCCGCCG TGGACCTCTC GTTATCTACA AATCGAACGA TGGCGTGGAA CAGGCCTTCC GCAATCTTCC CGGCGTAGAG CTATGCTGTG TTGACCGACT GAACCTTTTG CAGCTAGCCC CGGGTGGTCA CATGGGACGA TTCTGCATCT GGTCTCAGGC TGCTTTGGAA GAGCTGGACA TTATTTACGG AGAAAATGGC AAGCGCATCC CCCAAGCAGC CATGACCAAC GCCGACTTGG CCCGTATCAT CAACTCCGAT GAAGTACAGA GTGTCGTCAA CCCCGCCAAA CCTGGACAGA AGGACAACGC TCCCAAGCAA AATGCTATTC GTAACGTTGA GGCGTTGGAG AAGCTCGATC CATTTGCCGC CGAAAAGCGC CGTGCCCAAG CTCGCAATGA CGAGGCTCGT GCTTCGAAGA AGGCCGAAAC TTTGGCGAAA AAGCGTGATA GCCGCACGGC TAAGAAAGCT TTCAAGGAGC AAGGAAAATC GTTCTATGCA AAGGTTTCCC AGCAAGGAAC AGTCTGTGAG AATGGCTTTG CTCTAGAGTA AAGACCAAGC AAACAATTAA AGATATGGCG CCATGGATTT TTGCGATCTG AATATACCAC CTAGCATTTG AGGAGTCGTT CCTTCGTCAT GAAAGCTGTC TTTCCCGTAG TATTAGCCTC AACCAACTAG GCCACCATGT AGTTCACAAT CAGTGAAATA ACAAGCGCGA ACCACATCAA TTCGAAAGAA CTAATGTGAG AGGTTACACT AGTCTATTTA TGGACTAGTC AAGCCGTTGT ATCCATCTCG TGACTGGATT CGAACATACG TCATTACATT GTTTGCCGCG ACTCTGACAC ATTTCGGAGA GGTTGGGACC CGATTGCCAT CGCAAGATCG ACATCGAACA AACTTCACTG CTAGGACGGT AAATTACTAT TCACTGTAAT AACATTGCGA CGTGATTCAG TATGGCCATA CGGACCCGGA TAACTTGTGA CGTAGTCATT GGGGTTATCC TTCGGTCATG CAACTAGTCT AAACCTTGGG CACATCGGAT GATGTGCTCA CTTATTTTAT TATAAAGCAC TTTGCAAGAT GTACTTCTAG TCCCCTTGTG TGCTTGGTTG ACCAACATGC AAATTGATTG TCTCCGTGGA TGCGGGACAA TGGTCAACGA ACTAATTCAA TATAGTCCGG ATATCGTGTT GACCAGAAAT GTACCTACAC GTGGATATCA AATCAGACGA CATTGAGAGG ATCTGCGATT TTACCTAAAT CTGTACATCC CACAGATCTG AAAATCCACT ATCGGATCGA GTCTCTACAA CTACTGCCAT GAAGTACGTC GCCACTGCTC TCTTCCTCTC CCAGGCTACT GCCTTCACCA TCGTTGGCGC TCCTCGTCTC CAGACACGCC TCGCTGCTGC TGAGTACGAA GCGATGGATG GTGAAGGAAA AATTAATCTC AAGGTCTGTA CTTGCAAAAA AATCAGCATG CTTTCAAAGT TCCTTCCTCG CCTAAGCTTT GTTTTTTACT TTCTAAAGAT TGATTTGGAC TCACCGAAGG TTGCGACGAT GGATGACATT GAAAAAGGCA AGAAAGTCTA TTGCCGCTGC TGGTTGTCAG GAACCTTTCC CCTTTGCGAC GGTACCCATC AGAAGCACAA CGATGCTACG GGCGACAATG TTGGCCCACT AATCGTATCC GTGAAGAAGG AATAGGCAGC CTCGTCGAAG GAAGCCCCGC TTTTTGAATA AAGGACCATT GACTGTAATT CAATCAAAAG CCTGTCTTGA TGCCAAAAAA AGAACATACC TAAATCCAAT TATTCACGAT ATCGCCCACA CATATAGCAT GATAATAGTT CTTTTGGAAG TATTCAGACA TCAAGCATGC TTTGGACGAC TTGCTCTTGT ACGCTTGCCG AGTGTTTTTT TAGTTCAACG GATCGAATGC GCTTGATAGA TTCCGTCTTC CACATTGGAA CTCCTTCGCT TTCTTTCAGC ATCGCGTTTT CCGATATAAA GGCCTGGGCA AGCGCTGTCG CCAGCAATTG GCGCAATTGG ATAGTCTCGT CTCCCGGTGA TTGTCTCGAA CTCAGCGCAC ACAGTCTTGC AAGACATTGC ATTAAGATGG CTACACCTGT CTTGTCAAAG GCCGACCGTC GGGCAAACTC AGCTAGACAT GATGTGCTCT TCTCCTGCAA TTCCAATACT TTCTTGGCTA CCTGAAGCAC AGACCAGTGT AGCGAATAAG GAAGTGCGCA GGAAAGATCA CCAGTGTCGA CCTTCTCGTC TCGCAGACAT GCGACAATGT GCTCTGCTGT CAAGACAAGG TCATTGGACA GCATGGCGTC AAACAGTAAC ACAACACACT TGAGTCCCTT TGCTTCTGGT ATTGCGATTA AATCGCTATC CAGCAAATCG ACTGCTTCAC GAGCACTAAC CGCTACCTTG GAATTCGCGG GTGAATCGCC TACCAAGATC GTAAGCAGTA ATCTGGCTGT TTCAAGGCAA ACAGATGTCA ACTTTCGATA ACAGGATCCT TGATGAACAA CAGGCAAAAG ACTGGCTTCC GATTCAATAC CGACTATTAG CCATTGCATA GCCTCCCGAA GGCAATCGTT CTCGATGTAG TGTCGATAGC GTGACAAGAT TATCGCAACG TAGGCGTCCA AACCAAGCAA ACTCTTTCGC TCAACAAATG GACGGCAAAG GGATAGAAGC TCCCGCGCCT CAGCATCAGT GGTAACCAGA AGGTGCTCAA AAATAGAGGC AATGCTTGCG ACTGTCTCAG GGCAGCCAGC GTCGAGAGCT ACCGGTGAAA GCAAGCGAGT TGCTAAACGT GCGCCAGCGT CGGAATTGTT ATCTTGAAGA GAAGAAATAA TCGCTGAAGC TGCCTCGCAA ATGAACTCAG TCTTTCGAAA AGCTGATTCA GAAGGAAGGA TGTTGACTTC GATGTAAGAA TTATAGAGGA TAGAAAGCTT CATAAATACA GCAAGAGAAA GGTTTTCGCG TGAGGAAGTA GCATCTAGAT GTGCCGAAGT CCAGCGCTTT TGAGCTCGCT TACCAAGGTT GTCGGCAAAG CAGCTCCACT CGCTCCTCCG TAAATATACA TTATTTCAAT TGCCCGACCT GCGTTCCCAT TCATAATACT GAGTAGTTCT TCCATTGGCC CATTGTCAAT TCCCCGACGT TCCATCTCTT TCATAAACCG GCTGGCACTC GCACTCATAT TCCTCGGTCG TATATCTGGT CGAACGAAGA GGAGCTCAGC GCACATCTGC TCGGCCCACG AATCGAAGGA AAGCTGCTTC AGTTGTCCAC AGAGAATAGC CATGATGGAA TCGAGTTCAG GAATACGGCG CGAAAGCTTG AAAGACCCCC GACACTCGGA TACATACACC TGCCATGATC GATGTTTCTG CATCGCCGCG TCAGAGTTGT ACACCATTGA GAAATTCGTT TGACCGCTAG TCGCGCCTTC CCAGAACTTG TAATCCGAGC GTTCTACAGT CAATCCCTCC AAGAAGTATT CTGTCTGTTC AATATCGTCG TCGTGCCAGT CAGGTGTGTC GAGGCAATCA TCATAATAGT CGTTGCGTCC ACCCGGAAGG GGAGCTCTCA GCATCAACTC TTGAAGAATC ATGAATCCCT CACGCGTTTC GTCGAGAGTC GATACCAAGT AAGGGTCATC CGAATACGAA TTGAAATGCT TGTTTGAGCG AACAAACATA GAATGCGTAG ACAAAACATC CCAGGCCTCT TCCAAACATC CACGTAGCGT TAGGGTACGA ATACACGCCC AAAACAATTC GCCGTCCCCG TATTGCTCTG GCTGGGCGGA AGCACTCATT TCCTGTATCG ATGGATAGAT TTTTTCAACC TCGAGCATGT GTTGACATCG AAGATAGCGT ACCATATCCG CTGTGGCCAC TCCTGGCTTA TCGAAGGGGT CCTGCTGAAA GTTAATATCG TTGTGAGACG GCAACAGTGG TAAAAATACG TCAGAGAGAT GCATCACGGT GTAAATCGCT TTGAGAAACT CGAGACTGTT GCAATCGTCC GAAGACAAAG CATCTGGACC CCCCTGCTGT TGCTCGTCAT CCCATCCTTT CACACAATTT TGAACAGCCG AGCGACATTG CAGTGAAAGC TGTTGAAAGG ATTTGAAAGA TTCTCGCTGG TCGTCTTCGT AAAAATCTTC GTGCGCTCGG ATAATCGCGT ACAATGCGGA CAACTCGTGC TTGTTGCAGC CTGCACCTGG ATAGGCAAGT TGCTGAAGTA GAGCTGGATT TCCATCTTCT GAAGAAACCA AGTAAAGGGA AGCGGGTGAA GATGTGTCCC ACAAGAAAGG GCTACTCTCA CGAATATATG AAGCCATCTT TGAAGTGATA TACTTGCAAT CTAAACGGGT AAGAAAGAAG CTGCCTGTCG ACGTAGTAAT TGTATTTGTC TTTGATCGGG ATTGCCGGCA AACATTGCCG TGTGCCGGTT GCGTGCTACA GGATGCATTG CGCATGTCCA AAGCATGCTG CAAGTTTTCA TACGCAGGAC AGACGTGATA TTTTACAGTT GGCTTGTTAA GGTGAATGAA AAGCGATAA
|
Protein sequence | MPSHAPFYIE HLLYAPSQQS SKPETMAARP LVSVFSLSGD KSGDVSLPAV MTAPLRPDIV QFVHTNMNKN HRQAYAVNIR AGKQVVASSW GTGRAVARIP RVGGGGTSRS GQGAFGNMCR GGRMFDPTKT WRKWNKKINI SQKRYAVASA LAATAVPALV MSRGHVVDNV PEIPLVVENA VESAKKTSAA KDILSAIGAL DDVEKAGESK QIRAGKGKMR NRRYTLRRGP LVIYKSNDGV EQAFRNLPGV ELCCVDRLNL LQLAPGGHMG RFCIWSQAAL EELDIIYGEN GKRIPQAAMT NADLARIINS DEVQSVVNPA KPGQKDNAPK QNAIRNVEAL EKLDPFAAEK RRAQARNDEA RASKKAETLA KKRDSRTAKK AFKEQGKSFY AKVNEKR
|
| |