Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47663 |
Symbol | |
ID | 7202867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 453022 |
End bp | 454920 |
Gene Length | 1899 bp |
Protein Length | 495 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181910 |
Protein GI | 219123185 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.55455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCATCGTCGC AATGGATTTT CACAGGCTTG TTTCGAACGT GGGGTCCGCA AGCTACTTCG TTCAAAACGG GTCAGATTAG AAGCGTGATG ATAGAGTACC AAGACTTGCT CGTAACTGAC TAAGATCGAG ATCACATCCG ACCTCCCTCG CAATCGATTC GATTTGAAAG AAAGAAAGTC ACTCGTTACA AAATGAGTGA CCCTTCCAGT AAGCAATGGA TTCACATTCC TGATTCATCT AGCTTCTGGA CTAGGTGGCG TATGATTTAT TGCCTTTTTA GGTGTTTCGG TGTCACAGTG TCTGCAGCTT TCCAGTTTAC TCCATTCAAT CGCGGAACAA AGTTGGCTGT AGTACCTATG GCAAATTACG CTCCGATCTC TGAGGGAGTG AATGATGAAA CACGACCGCG CAGTATCTTT TTCCGTTCCC CATTGCTGGA TTACGGTTAC CTTCCCGTCG TCGAGGAATA CGAAAGCGGT AGTCTAGCGA GAAAGCCGCT GTTACTTTAT CTTCCAGGCT TTGATGGATC TTTTCTAAGC GCGTTTCTGC AGTATCCGGA ACTTTCGACT GCTTTTGATG TTCGGTGCAT GAGTATTCCA GCTTCGGATC GATCAACATT CAATGAACTG AAAAGATCAG TACTACAATA CCTGCGTATG GAGATAGCGG AATCAATAGT TGGAGATTTG GATCAAAGGT CCAGTCGGAA CAAAACCCAA CCTATTCTAA GCTCTAGTCC ATTCGATCAA ATATTCTCTT TTACCAAGGG CGCCTCCTCA AAAGCGGTAT ACAAGAGGAG CAGCCGGTCA GTATACCTTG TCGGCGAATC ATTTGGTGGC CTTCTAGCCA GTGAAATTGC CTTGTCGATT CTTGAGAGCG AGAAAAGCCA TGCGAATAGC ACTATCGATT TGCAAGGACT CGTGCTCGTT AATCCAGCTA CATGCTATGA CCGGTCTCGC TTAGCCGCCT TAGGACCGCC TGTGGCCAAC AGCGTACCAT GGATGTATCC AGCCAACTTG GCAAAGCTCC TGCCCCTCTT TACCGACGAG TATTCTTTGG CTCAATTGAG ACTAATCGTA CAAGCCAAAG CCTTGCCCTC TGTAATTGAT GATGCTCCCC GTGAAGCCTA CTTGGGACGT GTGGCATTAT CATTGCCTTT CATCTTTCCC TCCATGCCTC AAGCCACTTT GTCGTGGCGG CTGTCTCAAT GGTTGGAATT TGGATGTGCT AGTGCCGAGC AGAGGTTGAC GGGTCTGGCT GCTTTCCCTA GCTTTCGTGT ATTGATTGTC GCGGGGGAAT TCGATGCCTG CTTACCATCA ATCGACGAAG CCGAGCGTTT GGTTAGTGGC GTCTTGCCCA ATGCCAAGGT GCACGTTGTG GAGGGTGCTG GGCACGCGAG TACCTGCGGT AGTCGGATGG ACTTGACAGC TGTTATGCGC AACTGCTTTG TTGAACTACA ACAGAAAAAT GGACGCCGTT CAGTGACCTT GCGGACGGCC ATGAAAAACG AAGCGGCATC AGGCATAGAA GAGTATTTCG GCATGCAACC GCGATACGAT AACGCGACAA TTGGATTGAA TCCGTTACGC TACTGGAGTC CGGAATTATA CCTAAAGCAC CGACCTAAAA CCGGCCCAGG TCAGCGGAAA ATTTCTCGTA CCACCAGGCA CAAAGGATAG GTAGGTACAA TCGAGCGACT GGAATTTTAA ACCGATATAC TGTTATGGTA CCATTGGGGC TTCTCTTCGG AAAAACATCT TACAGAACTG TGCCGAAAAA TAAAGGTGAG ATTCCTATAT CCGTTAATAG TAAGGAGATA GACGTTCGAG GATGATCTGA GGCTCTGTCT GGAAATTTTG AAATTGACAA CAGGTATGAC CTTGCTGTTA GCGTTTGGA
|
Protein sequence | MSDPSSKQWI HIPDSSSFWT RWRMIYCLFR CFGVTVSAAF QFTPFNRGTK LAVVPMANYA PISEGVNDET RPRSIFFRSP LLDYGYLPVV EEYESGSLAR KPLLLYLPGF DGSFLSAFLQ YPELSTAFDV RCMSIPASDR STFNELKRSV LQYLRMEIAE SIVGDLDQRS SRNKTQPILS SSPFDQIFSF TKGASSKAVY KRSSRSVYLV GESFGGLLAS EIALSILESE KSHANSTIDL QGLVLVNPAT CYDRSRLAAL GPPVANSVPW MYPANLAKLL PLFTDEYSLA QLRLIVQAKA LPSVIDDAPR EAYLGRVALS LPFIFPSMPQ ATLSWRLSQW LEFGCASAEQ RLTGLAAFPS FRVLIVAGEF DACLPSIDEA ERLVSGVLPN AKVHVVEGAG HASTCGSRMD LTAVMRNCFV ELQQKNGRRS VTLRTAMKNE AASGIEEYFG MQPRYDNATI GLNPLRYWSP ELYLKHRPKT GPGQRKISRT TRHKG
|
| |