Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14391 |
Symbol | |
ID | 7203138 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 542454 |
End bp | 544136 |
Gene Length | 1683 bp |
Protein Length | 525 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182249 |
Protein GI | 219123890 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0183197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCA ATGTAGACGA AAACAGGGGA GACGCTCGTG GAGCTGCACT CTTGTTGGAG GGAGTCACAG TGTATCGCGG TCCAGCCGAG ATTCTTAGAA ACATCGACTG GCGCGTGGAA CCACGAACTA AGTGGGCTCT AGTTGGTGCA AATGGAGCCG GAAAATCAAC ACTTTTAAAA GCCCTTGTTG GGGAAGTGGA TTCGCGTGGA AAAATTGTGA TCGGAAACAA GGAACAAGTG GGGTACTTAC AGCAGACAGC TGTTGCTGGA AGCAATGGCA CCGTCTTTGA AGAGGCCTCA TCTGGAATGC GCGAACTGAA TACGGCTAAA CAAGCAATGG AAAAATCTCA AGAAGTGGGT GATTTACAAG CCTTAGAGAG GGCAACGACA AGATTTGAAC TCATCGATGG CTACAAACAA GAGCAGAAAG TTGCCAGTGT TTTGAAAGGT CTGGGGTTTA CAAACTTTGA AATGCGTTGC CACGAGCTGT CCGGTGGATG GCAGATGAGA GTAGCTTTCG CACGATTGCT TCTCAGTGAG CCAACTCTTT GCCTGATGGA CGAACCCTCC AATCATTTAG ATGCGGCTGC CAAGAAATGG CTTGCAAAGT ATCTTGCTAC GTACGATGGA GATGGAGCCA TGATTCTAGT CACCCATGAT GTGGACCTAC TTAAATCTAT GGATCATATT GCTGAGGTTG TACCTGGAGC AGGAAGCTTA CAGATTTACA AGTCGTGCAA TTACAACCAG TACTTGGATT TGAAAGAGCA ACGGGCAGCT GCCGCAATTT CTCAGTATGA ACGAAGTACG GAAAAAGCTG CCAAGCTACA AGCTTTTGTG GACCGCTTCG GTGCTTCGGC AACGAAAGCT TCAGCCGCAC AATCCCGTGT CAAGATGCTT GAGAAAATGA AGCGAGACGG ATTGCTGAAT GCACCAGCGG ACGATATCAT TGCACAACGC TTCAAGCCTT CGTTAATACT CCCGGATCCT CCCCGAGCCA TTGGTGAAAA GTTGATCTCT CTGCAAAAAG CTGGTGTGGG CTATGATGGA GAGGTGCTTG TATCAGATAT CAACATTGAT ATAATGAAAG GTATGAAACT TTTAATTCGT GGGCCGAACG GAGCTGGAAA GTCGACGGTG ATGCATTCCC TTCGTGGCTC AATTTCATTG ATAGATGGTG ACAGAAGTAC AAACCCCGAC TTGCGGCTCG GGGTGTTCAC CCAGGATTTA GCTCAAGAGC TTGACCCCAG TGCCCGGGCT GTCGACTTAG TCACAGCGTA TGCTCGTACA GGGCTGGATG GAGATATTAC TGTCTCGGAA CAAGAGGCAC GGGCGGCGAT GGGTAGACTG GGTCTACAGG GCGAAAAGGC TTTACGTCAC ATTTGCGATT TGAGCGGTGG AGAAAAGGCA CGTGTAGCTT TGGCGATGTT CGCTTTGAAG GCTAGCAATG TTTACTTACT GGACGAGGCG TCTAACCATC TCGACTCAGA ATGGTACGTT ATAGAAGCTT CTTTCAGGAG GTTCTGTTGT AATTTTATGC ATCTATCTTT TATTGTGTTT TACGCATCTG TCTTACTCAA CAATGCACTG CTTTTACATA GCGTTGAAGC CCTTGGTGAA GGGCTCGGAT CCTGGGGCCA CGACACTGGC GCAATGGTCG TAATTTCTCA TGACAAGTCG TTT
|
Protein sequence | MTTNVDENRG DARGAALLLE GVTVYRGPAE ILRNIDWRVE PRTKWALVGA NGAGKSTLLK ALVGEVDSRG KIVIGNKEQV GYLQQTAVAG SNGTVFEEAS SGMRELNTAK QAMEKSQEVG DLQALERATT RFELIDGYKQ EQKVASVLKG LGFTNFEMRC HELSGGWQMR VAFARLLLSE PTLCLMDEPS NHLDAAAKKW LAKYLATYDG DGAMILVTHD VDLLKSMDHI AEVVPGAGSL QIYKSCNYNQ YLDLKEQRAA AAISQYERST EKAAKLQAFV DRFGASATKA SAAQSRVKML EKMKRDGLLN APADDIIAQR FKPSLILPDP PRAIGEKLIS LQKAGVGYDG EVLVSDINID IMKGMKLLIR GPNGAGKSTV MHSLRGSISL IDGDRSTNPD LRLGVFTQDL AQELDPSARA VDLVTAYART GLDGDITVSE QEARAAMGRL GLQGEKALRH ICDLSGGEKA RVALAMFALK ASNVYLLDEA SNHLDSECVE ALGEGLGSWG HDTGAMVVIS HDKSF
|
| |