Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45551 |
Symbol | |
ID | 7200620 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 521402 |
End bp | 523159 |
Gene Length | 1758 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179665 |
Protein GI | 219117752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAAATCCA TGCAAGAAGT TCTGCGAACA CTGGTACGCT TATTGCCATG CGCAAAGGAT CACCCTTCAT TTCCGTAAAT GTTCGCCGCC ATGAACGGGA GGAGATTGAG CAACGGGACC GATTCCACTC GAAGGCTGCT TCATCGTGAA GATTGTGGCG GTTCTAGTGG AGTCCGATCA TCGCGACCAG TACCGAATTC ATCGAGACCG CAGACACAAA GAGTCCGCTT TCCGGATGAA GATCGGGAAA GTGTCGATGG AACTCTGGAT AGCTTATTTG GTTTGATCAC CGCCGCCTTG GCAACTGCTT GTTCAGCAGG TCTCTTTACT ATGTTACCAT TTTCCTTGGC TGCGTATCGG AGATTGGCTT CACAATTGGG AGCGTCATCG ATCCTCGATG CATTGGCGCT GTTGCTCCCG AATACGAGAA TCTGTTTATC TGGGGATTCT GATATACCGA GTCCAGTAGG AACTTCGATT CTGGTGTCTA ACCATTTGAT GGATGGGGAC TGGTGGGCGC TATTAATGCT GGGGCGGTGC GTTGGGCTTC GAGGGAGTAT TAAGTTCTTC CTTCGAAACG AGTATTTTAA TCTCAAACTT CACAACTCAG ATTCCGCGAC AAGTCGATCT AACTCAACCA CAATTGCTAC GAGTAAAGCG GTGGGAACTG CTGTTCACAT TCGGAACGAG AACGCCCAGG CCAGTTCATC GTTGCCGCGA GTCTCGCACC TGCGCGAGGG CTCTACCTCT CACGGCATTG CCATCATGGC AAATCTTCTC CACCAGTTTC TTGAGTTCCC GCTGCTGAGT GGGGACGACC ACACTGCTGA CAGAGAACAG CTAGTTCGAC TGCTGAGGTC GTTTGCACAC GACAATGCAT CGGCCCCTGT TCATTTGCTG TTCTTTCCAG AAGGATGGTC GCTCCACAAT GGTGCTGACC GAACAGCAAT ATTGGCCAAG AGTAACGAAT TTGCTCAACG AGAAGGCCGC CCGCAATTAA AGCACCTGTT GCTGCCCCGT GCTCGTGGTT TCAACGCAAG TCTTGAATGT TTACGAGAAT CCAGTCCAGT AGTCTACGAT GTCACGATGG TACGTCATGC TGTTGTGGAG GACGTCCTGT CGTTTCTTTA CATTTTTCTA ACACAGAAGC CTCTCCTCCA GGCCTACAGT GGGTACAATG GATCGCTTCC ACCTTCTATT GAGCTTACCT TTCCCGCCTT GTGGAAACTG CTTCGTGGGT TCCCTCGTGA AATACACATC CGAATCAAGC GATACAGCAT GGAAGAGGTT ACTCAGGACT CATCTTGGCT AGATCAAAAG TGGGCAGAAA AGGATCGTCT TTTGAGTCAC TTTGCTCGGC ATCAAACCTT TCCTGCTGAT AACCGAGGCT ACTGCCGACA TCGAGTCTTT GATACGAGAA CGCATGCGTT TGAATCTTCC ATCATCGCAC TTGGACGCTT GCTGCTATTG CCATTGGCTG TTCCGTTGTT CGTTTTGGTA TCTATCCCAA TATTTTGGGC CTTGATGTGG TTGTGGCTGG CACACTGGGC TTATCGGCAA CTATTTGGTC GGGTAGAACA GTCGTCGTCC AACGGAGGCT CGTCTGGGAG TGTCGGAAGT GCTGGTGCAG GTACCACACC TGGTACTTCG TCCGCTTCCG GAACACCCTT CTTCCCAGCT ACTCCGTTTG CGTCTCCAAC GGTGACTTCC TGGCGTGACA TGTTCTCAAA GAGTGCCTCA TCATCGTCGC CATCTTAA
|
Protein sequence | MFAAMNGRRL SNGTDSTRRL LHREDCGGSS GVRSSRPVPN SSRPQTQRVR FPDEDRESVD GTLDSLFGLI TAALATACSA GLFTMLPFSL AAYRRLASQL GASSILDALA LLLPNTRICL SGDSDIPSPV GTSILVSNHL MDGDWWALLM LGRCVGLRGS IKFFLRNEYF NLKLHNSDSA TSRSNSTTIA TSKAVGTAVH IRNENAQASS SLPRVSHLRE GSTSHGIAIM ANLLHQFLEF PLLSGDDHTA DREQLVRLLR SFAHDNASAP VHLLFFPEGW SLHNGADRTA ILAKSNEFAQ REGRPQLKHL LLPRARGFNA SLECLRESSP VVYDVTMAYS GYNGSLPPSI ELTFPALWKL LRGFPREIHI RIKRYSMEEV TQDSSWLDQK WAEKDRLLSH FARHQTFPAD NRGYCRHRVF DTRTHAFESS IIALGRLLLL PLAVPLFVLV SIPIFWALMW LWLAHWAYRQ LFGRVEQSSS NGGSSGSVGS AGAGTTPGTS SASGTPFFPA TPFASPTVTS WRDMFSKSAS SSSPS
|
| |