Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47701 |
Symbol | |
ID | 7202887 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 573838 |
End bp | 577110 |
Gene Length | 3273 bp |
Protein Length | 795 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181935 |
Protein GI | 219123237 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATGC GCGTTCGTCG TGAGGAGCCC CGAAAATCCG AGCACCCGAA GGAATTGCCG GGACGATACC GACGGGGAGA AACGAAAATG TCGTTGGAGG CTGAGGGGGG TTAAAGCGTT ATCCCAAAGG GAGCCGGCCG ACGCGCAGCC CTCAACCCTC TGCTTGGATC GACGGAATGC CCAACCAGAA TTCTACTCGA GTCTTGGACA GGACTAAAAG TTCAATCGTG CACGTCGTCT CGACGAAATG CTATCTACCA GTACGGTTGC CCGACATAGT GTCCCCTTTA TCTCTATATA TGTTGACTTT GTCTAACTCT ACAGGACAGA CAAATTTTGC TGGTATGACT GCGAATCCTT AAGTGTCTCA TCGTTGTTTA CACAAAAGGC GTCGGGGCTT GACTTTGTCC CTGGTGTCCT TGTTCTCATA CGGGTCATTC GTAAGTAGAA ATTTAGAGCG GCTTGCATCG GTGCTTCCGG CCAAGCTTTG TCCGCCCTTG TGCAGGATTT TCCGTTCGAG CCGTCTCTTT CGTTTGGGCC GAACGCCCAA AAACCTCAAA ACACATGGAT GGTACTCGAC GTGTAAGTAG TAGATACGTA GGTAGTTTGT TGGAGAGAGA GACCGACCGC ACACATCAAT CACACACATA CACACACATA TACACACATT ATCATACCTG ACGATCCATT GTCCCGGTAC GGACAGCTCT CGGGACACGA CAGTTTCCGC CAAGTTCGGG CGAATCCACG CGTTCCGTTC GCTCCCTGCC TCCCTTTCTA GAGTATTGGG CCTTTCCCCG CAGACCCATT TGCGTTGGAT TTGCTCGAAC GAACACACTG TTCCAAACGC AGTCAGTCGA CACAAACACA CGAGCATATC TACTTGGACA GCCGTTCGAG CCCTTGTCCT TTGTTTTTAG ACCTTTTAGG AGACCAAAAA ACGGGAGCTT ACTGTGAAAA CTTGTCGCTT CCGAACAAAC CGAGAGGCCC ACACCCCTCC TTCGTCGTGT CACTACCAAC ACACACACAG CTCCCCATCA TGTCTTGGCT GCGACGGAGA AGTGTTCTTC CCTGGACACT CGGACGCCAT CGGTGCGTCT CGTGGTGCCT CGTTCTCGTT GCTCTCTCCG AATGGAGCGG ACCCCACTCG TATGGTACGT ACACGGATTT AGTTGTTCTC TGTTTCTGCA GCGTTACTTC CGGATACATA GCCATGGGTC AACAATCCAT CCCGTACTCG TCAGCTAGGC TGTTTTCCCT TTTTCTCTGT CTGTCATTCA TCATCTCACG CTCCTCTTTG TATTTCCCTG GAAGTAGCAT GGGCTGAGGA TGTCCAAACA CCAGTCGCGG CGCCAACGGT TTGGTTGCCA GAACAACAGA TGACGGAACT AGTTCCGATT ACGGACCAAC AGTTTACCTC GGCACCGGCA GCAGCGCCCG TGGCTGTACC TGTGGTAGCA CCCACACGAG CACCCACGCC GGCTCCGACG CGAGCCCCAA CCCGTCCACC GGCGCGAGCC CCAACCCGTC CACCGACGCG AGCCCCCACC CGTCCGCCCA CGCTGCGGCC AACGGTTGCA CCCACCAAAG CACCAACCGT ACCTCCCACA CCGGCGCCGA CACTCTCGCC GTCGTTTGTC CCGTCGGATG CTCCGTCGAT TCTGCCTTCG AAGGCACCCA CCGGCATGCC AAGCTCCCTT CCTTCCACCC TCGCTCCCAC CACAATCGCG CCCACGACTA TCACTCCCAT GCCCACATCC TCCTCGCCAC CATCGTCCAT GCCGTCGCAG GGGCCGAGTC GCGCACCAAG CCTCGCACCG TCACTTGCTC CTACCGGCAA TCCGTCAGCC CCACCCACCA ACAGACCGAC TGCAGCACCG TCGACTGCGC CAACGCTGTC TCTCCAGCGA GATACGGTGG ATATTACAAT GCGCATGACA TCCATCCCGG GACGACTGGA AAGATCTTCC GCCATTCAAT GGGAAGCCGC CACGGCCGAG CACATCCGTC GGAGTATCTT GGCGCAAACT ACCGACAGGC CACTCATGGA ATTGATGATA CGGACCAACA TTGAGTCCCA ATTCACGCAA GCGTTTAATG CCGGTCGACG AGTCGTAGTG CACATGGCCG AAGAGGGAAA CGCGATCGGC GGTCCGCGGT TTTTGCAAGA AGTCGTCATT GCGCCTCTGC GAGTTTCATT TTTTGCAACA GTCTCTTTCC GATCCACTTT TGACAATTAC GACTGGGCCA GTTTGATTGG TGACGCCTTC AACAGCGACG ATGAACGATC AGCCTATGTT GCACGTTTAC GGGCGACCGG AGACAGGGCG TTTGACCCAC TCGGGAGTGT AACTCTATTG GTGGAAGGGG AAACGCCCAT TGAAGAATTA CCCGACCAAG ATTCCGAGGA CAGCGGCGGG AACAATTTGT TGATTGTGAT CGTTGCGTGT ATCGCCGGTG GCAGCGTCCT CCTAGCCCTC GTGGGACTCT TTATATATCG CCAGTCCTCC TCCGCTCCGG ATATCAAGGT AACTCCCAAG CTTGTTGAAC AGCACCACAG TACGAGCCAA AGTGTACCGA GCCAGCGTGC GGGCTATTCG ACGGAAATTA ATGTTGACCG ACAGGACGAC ATTAGCACGC TTGGCGACCC TATGTTTGGT ATGGGCGGCA TGCACTTTGG TGCTGGCGAT GGCTTACAAC GGGACGAGCA AACTGCCAGC GTTGGCAACG ATTACGACTA CAACAAGGAG TATCTGCATA GCCAGGGTAT TGCCTTGTCG ATGGAGGAGA GTAGTCGAAG TCGGCTCACT TCGACGGACT CGGATCGCGT TTCGGGTAAC TCAACTTTTT CCAAGATGGG TAAACTCAAT CCAACCGTGT TCGCCGACGA CTCGTCGTTT GAAGAACAAT TCGTCGAGGA GGAGGAGGAG GAGGAAGAAG AAGAAGAGGT GGAACGGTTC ACCGTGAACG TACCGGCCGG AAAATTGGGG ATGGTAATAG ATACGCCGGA AGGTAGTCTT CCTATTGTCC ACGCCATCAA AGAATGGAGC ATTCTGGAGA ATACCGTCAA GATGGGCGAT AAACTAATAT TCGTAGACGA CGAAGACGTG ACGGAAATGA CTGCCGTGGA AATCTCCAAA CTGATTTCAC TCAGATCTGA TCGGCCGCGC TCACTCGTCT TTCACCGCGT TCTTCCACGC AGCGATTTCA TTGATATGTA CTAAAAGTTT CGACAACCAA CAGGATTGCC CCTCGTGTTG TTGTCGCCAT GTC
|
Protein sequence | MAMRVRREEP RKSEHPKELP GRYRRGETKM SLEAEGGQTN FAAACIGASG QALSALVQDF PFEPSLSFGP NAQKPQNTWM VLDVSPSCLG CDGEVFFPGH SDAIGASRGA SFSLLSPNGA DPTRMPWVNN PSRTRQLGCF PFFSVCHSSS HAPLCISLEV AWAEDVQTPV AAPTVWLPEQ QMTELVPITD QQFTSAPAAA PVAVPVVAPT RAPTPAPTRA PTRPPARAPT RPPTRAPTRP PTLRPTVAPT KAPTVPPTPA PTLSPSFVPS DAPSILPSKA PTGMPSSLPS TLAPTTIAPT TITPMPTSSS PPSSMPSQGP SRAPSLAPSL APTGNPSAPP TNRPTAAPST APTLSLQRDT VDITMRMTSI PGRLERSSAI QWEAATAEHI RRSILAQTTD RPLMELMIRT NIESQFTQAF NAGRRVVVHM AEEGNAIGGP RFLQEVVIAP LRVSFFATVS FRSTFDNYDW ASLIGDAFNS DDERSAYVAR LRATGDRAFD PLGSVTLLVE GETPIEELPD QDSEDSGGNN LLIVIVACIA GGSVLLALVG LFIYRQSSSA PDIKVTPKLV EQHHSTSQSV PSQRAGYSTE INVDRQDDIS TLGDPMFGMG GMHFGAGDGL QRDEQTASVG NDYDYNKEYL HSQGIALSME ESSRSRLTST DSDRVSGNST FSKMGKLNPT VFADDSSFEE QFVEEEEEEE EEEEVERFTV NVPAGKLGMV IDTPEGSLPI VHAIKEWSIL ENTVKMGDKL IFVDDEDVTE MTAVEISKLI SLRSDRPRSL VFHRVLPRSD FIDMY
|
| |