Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42953 |
Symbol | |
ID | 7196782 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1616609 |
End bp | 1618699 |
Gene Length | 2091 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176818 |
Protein GI | 219110133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.677298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTCT TTCGTCTCGC CATTTTCGCC GTCTTTGCGA CCTCTGTGCA AGGCAAAACC AACGAAGATC AAAAGGCAAC TAGTAAGCTT ATGGTGCATG TAAGTTGCGT AAGATTTTGA GCTGTTCTTC ATCTATCGGG ATTCACTTTT TGAATCAACT ACTATTTTCG CTACGTTGAG GAAGGCGAGA AATGTAATCA CGTTGATGTC TGAAGCTGCC TTTAGGAATC AAATTGGTCA AAATTGTAAC CTTTCTGTAT CAGGATGAGT TGTAATAAGC TTTTGATGGT TCCAGACCTT CTTTTCAGTG CATGAAACTG TTTTTCGACA AGTTTTTTCA AATCCCGTTG TCCAGTGTGT ACGTGACAAA TACACTCACT TACTCTGTAT TGCTTATGCT TTCAGATTCC CCACATGCTT TACAAATCCG CAGGATACGA TCACCGCGAA GCTCTATTTG GCATGCCAGC GTATGGTGGT TCGATCTCTC AGAATGTGTA CTACGCCGAC AGCGACCTCT GTGATCCGTC CGAAGAAATT GAAGGCTATC CACAAACCGA CTCGGATGGC GACGACGACG ATGTAGCACC ATTTCCAGCG CCCTATATTC TCATGGTTAA CCGCGGAGGA TGTACCTTTG TGCAAAAGGT ACGGAAAAGA CGACGATAAA AATGTCCTTG AACCGGCGAA GTTCTCACCT TTTTCTCATC CACAATGTTC TCGGCGACAG GTGCGCAACG CACAGCACAT TGGAGCATCA GGTGTTTTGA TCGCCGACGA CACCTGTCTC TGTTCGGATA AAGTCTGTAT GGCCAACAGT GAAGACGACG AGGACGCCTG CCAAGTCAGC GAACCCATCA TGTCCGATGA CGGTTCGGGT GCAGACATAT CAATCCCGTC TTTCTTAATG TTCAAGATGG ACTCGGAGCG AATCATTGAA GAAGTCAAGA GCAATCGACC TGTTCAGGTT GAAATGGCTT GGAGTCTTCC GAACCCTGAC GATCGTGTAG AATATGATCT GTACACGTCC CCGACCGACT CCATTAGCAA ATCTTTTATC CAAAGCTTTA AACAGCTTGC GGTGGCTCTT GGAGGCCGCG CGTACTTTAC GCCGCATATG TACATTTTTG ACGGCATAAA ATCACAATGC CACGGATCGG ATGGGGAGAG TCATTGCCAT ACTCTCTGCA CCAACAACGG ACGGTACGCC ATATACGCCT CCAACCTAAG CTTGAGGCGA CAAGAATTGG ACACGCTTCT CACCCTTTCC TTTATCCTTT CCTATAGATA CTGTGCCACC GACCCTGATG GTGACTTGGA ACGTGGTATC TCGGGTGCTG ACGTTGTCAC CGAAAGCTTG CGTCGTATCT GCATCTGGAA TCACTATGGC GCCCCCAACG GTATCGGAGA AATCTGGTGG GACTACGTAA TCGAATTTGA ACAGCGCTGT GCCGCTTCAG ACTACTTTTC TGACACAGCC TGTATCCAAG AAGTGTACCA CCGCGCCCAG GTTGACGGTG ACATGGTCGA GCGGTGCATG ACGGATAGTG GTGGAACTAT AGCGGATGGG GCCAACACCA AGCTTGACTT TGAACTAAAC GCCCAGACGG ACCGGGGAGT AGTTATTCTG CCAACTACTT TCGTCAACAC GGCTGCTATC CATGGTGCCT TGACCCCGTC GAATGTTTTT AACGCAGTGT GTGCGGGTTT CGCCGATGGC ACAGCCCCCG AAAGTTGCAA CACGTGCAGT TCCTGTAAAG ACACGATTTT CTGTGTCGGT CAGGGGTACT GCAAAGCGAA CGATTCGTCC GGTGGCCCCG CGGAAAGCGG AGTATCTGGA CATGCCTTCG CGACTTCCAT GCTGATTGTG ATCGGGTGTT TCTCCACCTT GGGTGCGTGG TACTACAAAC GCACCAAGGA CGAGCTGCGA GACCACGTTC GTGGCATCAT GGCCGAATAC ATGCCTCTCG ACGACAACGA AGGAGACCTC GGAAATCCAA TGGATTTTTC AAACAATGGC GATGCGACCA CTTCACTCAT GATGGGCCCG GACTCGATTT AGAAAACCTC GTGCAGCGTA GACGATACGT C
|
Protein sequence | MTLFRLAIFA VFATSVQGKT NEDQKATSKL MVHIPHMLYK SAGYDHREAL FGMPAYGGSI SQNVYYADSD LCDPSEEIEG YPQTDSDGDD DDVAPFPAPY ILMVNRGGCT FVQKVRNAQH IGASGVLIAD DTCLCSDKVC MANSEDDEDA CQVSEPIMSD DGSGADISIP SFLMFKMDSE RIIEEVKSNR PVQVEMAWSL PNPDDRVEYD LYTSPTDSIS KSFIQSFKQL AVALGGRAYF TPHMYIFDGI KSQCHGSDGE SHCHTLCTNN GRYAIYASNL SLRRQELDTL LTLSFILSYR YCATDPDGDL ERGISGADVV TESLRRICIW NHYGAPNGIG EIWWDYVIEF EQRCAASDYF SDTACIQEVY HRAQVDGDMV ERCMTDSGGT IADGANTKLD FELNAQTDRG VVILPTTFVN TAAIHGALTP SNVFNAVCAG FADGTAPESC NTCSSCKDTI FCVGQGYCKA NDSSGGPAES GVSGHAFATS MLIVIGCFST LGAWYYKRTK DELRDHVRGI MAEYMPLDDN EGDLGNPMDF SNNGDATTSL MMGPDSI
|
| |