Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44485 |
Symbol | |
ID | 7197714 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 705372 |
End bp | 708493 |
Gene Length | 3122 bp |
Protein Length | 859 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178559 |
Protein GI | 219115527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCAA ATTACGTCGC GCGTCGCGGG GAAAGCCTCC ACACCCTTGA CTCTGTCCTT CGTCCGCTGC GAGAGCAAAA TCCTCACTTC GCACCAAATG TTGGTCTGGC CGTATACCGT ATTCCGCTGG GCCCGCTATC GACGAACACA CCAACAGCAG GTGTCATCCA CGAGACGGCG ACAACGGTAC AACTCTGCGC GTTCCGAAAC ATACAAACAT ACTCACATAC TTTATTCGCG TACAGACCAA ACAACACGAA CGAACGAACA GACAAACACA CATACACACG CTAGCACTAG TAGAGAGATA TCGCCACCAC CAACAATAAC GATAGCAAAT AGTATCAAGT GTTTCAGGGT CATCACCGTG GTTAATCTTC GGACAGCTTT CACACCTATT TCCTTCGTTT CAACAGACAT TCTTTCTACA TACCTTCGCA ATGCCGTCGA ACAATACCGG CGGTAGCCGA GCCAAGAACA AGGACGGCAA GCTCATTGAT AGCGTAAGTC TTCGAGGATT GATATTGAGT GATAAAGCCC TTTTGGGGAC AGCGGGTTTC GACATAGTCT CCATGGATCC ATCGTCCTAC TACCGCGTTC AATACGTACG GTTCCAAGAA GAACGCCTCT GGAATGAGTT TCACCGGGGC TTGCTGCTCT CTTCGTGACC CTCCTTGGTC TATCCGGATC TTGTTTACTC TGCTTTCGAC GACACCTTCG CTTCACATCA ACCTTATAAA TGTATTCTCT CACCCTTTTC TTTTTGCGGC ATGCAATACA ACACAACATT CAAATACCAA CCCGTATGTC GTTAGCTTCG GGAAGCCTGT CCTCCACACA AACACAAAGA CCTCGCGGCG CTGGTCAAAT CGCTCAAGGG TGACGAGGAA AAGATCCGTC AAAAGATTAT GGAATGGTGG GAGGAACAGC CCGTTTCGAC CGAAGAAGAA TGGGAGGACG TCAACAAGCG CATCGCCAAG AAGAAACCCG AAGTTCGTGG GGGTCGTGGA CGCGGAAGGT CCGAAGGCCG TGGACGTGAG GCGGGACGCG GAGGCCGCGG CGATGGCGGC CGAGGCCGTA CCGGTGAAGG GCGCGGGGCG GGTCGGGGGC GCTCGACAGC CCCGCGAGCC AACGACAGGC CCAAGAACAA TACCACGCTC ACAGAGACAA GTGCCACTGC AGATAAACCC GTTGCAGATC CCGAGATTGG AATTCCCAAC TTGAACTCGG TCCCGGCTCC ACTCGGAGCG TGGGCGAAAA AGACCGGCGA CTCCATTCCG GCTGAAGTTT CTGCTCCCGA TCCGGTCCCG ACTCCCGCGC CAGTGGCTGC CGTAACACCT CCTTCTCCCG TAGCCCCAAT GGTATCGACT CCTGCGGCTC CCGGTATCCG TGCGACAAGT GGAGGGAACG TCTGGGCCAC CAAGGGATCG GCGCACCTTA TCCGTGCGGA AAAGCCCAAA CCGCCGGCCC CGGCTGCCCC ATATGTACCA AAAGCAGAGG CACCCCGGGT GCGACGGACC GGGGTCTCCC GCGAGGCACC CACGACCACT GCACAACCAC CCGCTTCTGT GCATATGAAT GTGCCAGTCG CTCCTGCTCC TCCTGCACCG GCTACCACTG CACCAACCAC AACGGCCAAC GCATGGAGCA AGAGCAGTGC GGTTTCAGAA ACATCCAAAG TAGACCTTCC ACCCTCAGCT GTGGGTAGCA AACATATTGG ATCCATGTCA CCGGCTGCTC CTCCAGCTCC GGCCGCCCCT CTAGAAATGC AGCAACAGAC CAAACCGCCG AAGGCACCTG GACCTGTCTT GAATATGGGT CGCTGGGAAA CAACTGATGC CGACGACGCT AATTTGGACT TTGGATTCGG CTCCTTTGAT GATGCGGGTG GCCCAGGTCA CCAGGTGAAC GCCAGCGTAA CGGAGAACGA AATCGCTCCT CCCGCACCGG CGGCGTCTCC GGCTAGACCT CCGCCTGGTC TCTCCCTTAC AGGCATTCCG CCGATGCCAA GCAACGCTGT CATGGTGCAC GAGCTAGAGA ACAAGCTCGA AGGAGCCACA CTCAACGCCA GCGCTGGTAC CGGTGACAAC AATTCTCATC CGCAAACCAG TGGACCTTCC ATGAACAGCA CTGCACCGGG CATGTACCAG GGTGGATACG GCCAACCCTA CGGTATCCCT GGTAGCAATA ACATTGCGTC CTCCATGGGT ATGTACAACT ACAACGCGCC CGGCGCACAG GGCAATGCAT TTGCTGGCAT GCCCGGCGGT GTTCCAGGGC TGGGTGGACC CTCTCAGCCA AAACTTGGCG GTGGCATCCC TCCAGTCCAG GCCGGCGGAT TGTACGCCGC TGCACAGCCT GAGCCAAGCT CCGGCAACGA ATCTGGATCG ATAGCGGCAT CGAATCCGAC TGATCCTAAT GCAACCCCCG GTATGCCACC AGGCATGCCT AACATGCCTT ACGGAAACCC AGCGCTTTAT TATGGAGGTC AAGCTCCTTT CCACATGGGA CAGCATCAAG GCGGTATGGG TTACAACTAT GGGTATGGTG CCCAATTTGG AGGCGCTGTG CAGGGTGGAT TTGGATACCC TCAAGGTATG GGGCAGAGTG CTGGGTATGC TCCCCATTAT GGTGATCAAC ACGAGCAGCA AGGATCACAC GGCAACAGCG GTGGCTACCA GAAAAACAAT GGCAATTACC GTGCACGCAA TCAGCATCAC AATAATAATC AGTATCACAA CCAATATCAA CAGCATGGTG GTTATGGCGG CCAACCATAC AATATGGGTT ACCAAGGTGA TCATTTTCAA CAACGTGGAG GATACGGCCA GCACGGAGGC ATGCCCGATC CGTACAACAT GCAACAGCAA CCTCAACAGC ACCAGGGAGG GGGAAACTAC GGAGGAGGTT TCCAAGACGA CGAACAGTAC AAGGGTAAAA AGGGTGGCAA CCGCCCCTTT CAACAACAGG GTCCGCCTCA GGCTCTGGGC ACTGGACAAC AGACATTTGG CTTGCAAGGA CAAGTTGCCG ATTCAAGCCA ACCGTCTAGC GGCTGGTCCA ATCAACAGGG AGCTACTGGT GGATGGGGCG GTGGTACGCC AAGTTGGCAA CAAAACAAGT AA
|
Protein sequence | MRANYVARRG ESLHTLDSVL RPLREQNPHF APNVGLAVYR IPLGPLSTNT PTAGVIHETA TTTFFLHTFA MPSNNTGGSR AKNKDGKLID SLREACPPHK HKDLAALVKS LKGDEEKIRQ KIMEWWEEQP VSTEEEWEDV NKRIAKKKPE VRGGRGRGRS EGRGREAGRG GRGDGGRGRT GEGRGAGRGR STAPRANDRP KNNTTLTETS ATADKPVADP EIGIPNLNSV PAPLGAWAKK TGDSIPAEVS APDPVPTPAP VAAVTPPSPV APMVSTPAAP GIRATSGGNV WATKGSAHLI RAEKPKPPAP AAPYVPKAEA PRVRRTGVSR EAPTTTAQPP ASVHMNVPVA PAPPAPATTA PTTTANAWSK SSAVSETSKV DLPPSAVGSK HIGSMSPAAP PAPAAPLEMQ QQTKPPKAPG PVLNMGRWET TDADDANLDF GFGSFDDAGG PGHQVNASVT ENEIAPPAPA ASPARPPPGL SLTGIPPMPS NAVMVHELEN KLEGATLNAS AGTGDNNSHP QTSGPSMNST APGMYQGGYG QPYGIPGSNN IASSMGMYNY NAPGAQGNAF AGMPGGVPGL GGPSQPKLGG GIPPVQAGGL YAAAQPEPSS GNESGSIAAS NPTDPNATPG MPPGMPNMPY GNPALYYGGQ APFHMGQHQG GMGYNYGYGA QFGGAVQGGF GYPQGMGQSA GYAPHYGDQH EQQGSHGNSG GYQKNNGNYR ARNQHHNNNQ YHNQYQQHGG YGGQPYNMGY QGDHFQQRGG YGQHGGMPDP YNMQQQPQQH QGGGNYGGGF QDDEQYKGKK GGNRPFQQQG PPQALGTGQQ TFGLQGQVAD SSQPSSGWSN QQGATGGWGG GTPSWQQNK
|
| |