Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43944 |
Symbol | |
ID | 7204173 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 498908 |
End bp | 500582 |
Gene Length | 1675 bp |
Protein Length | 493 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186072 |
Protein GI | 219112977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.323695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGATTCGAT TTATCCCTGG GTATAGCTTT AATCGACGAC GACGATGATG ATATTGTCCT CTACCGCCGC TCGTGCAACT GCCAAGGCTT TGAGTCGACC CGGTTCTCGG GCGTTGGCTT CGCAAGCGGA ACACAACTTG ACGAGGTGAG TTGGGCACTA AATAGACGTA GAAATATGGA CTCCAGGAAA TTTCGCCATT CCGAATGCTC ACATTCTCAT CACGCATATT CTCTCTCCCA GTGGTTTTAA CTTTTTATCC GACCAAGACC GTATTTTTAC CAATTTGTAC GGCGAGCAGG ATTGGCGTTT GCCCGACGCC ATTAAGCGCG GTGATTACCA CTTGACGAAG GAAATCATGT GCATGGGTCC GGACTGGATT ATCCAGGAAA TCAAGGACAG TGGGCTCCGG GGACGCGGTG GAGCCGGCTT TCCCTCCGGG CTTAAGTGGA GTTTTATGCC CAAGGAGACA GACGGAAGAC CCTCCTTTCT GGTTGTCAAC GCGGACGAGT CCGAACCCGG TACCTGTAAG GACCGCGAAA TCATGCGCAA GGACCCGCAC AAACTCATTG AAGGTTGCAT CCTGGCGGGC TACGCAATGC GTGCCCGAGC AGCCTACATT TACATTCGTG GAGAGTATTT CAACGAAGCC GTAGTGCTGG ATGAAGCCAT CCACGAAGCG TACGCCGCCG GACTCCTCGG GAAGAACGCC TGTGGTTCGG GGTACGACTA TGACATCTAC CTCCATCGAG GTGCCGGCGC CTATATTTGT GGAGAAGAAA CGGCTTTGAT TGAAAGTCTA GAAGGCAAGC AAGGAAAACC TCGTCTCAAG CCTCCGTTCC CGGCGGGTGT CGGTTTGTTT GGATGCCCCT CGACGGTAAC CAACGTGGAA ACCGTGGCCG TGGCCCCGAC GATTCTTCGT CGCGGTGCGT CCTGGTTCGC CTCCTTTGGG AACGAAAACA ACCGCGGCAC CAAACTCTTT GCGATTTCCG GTCACGTCAA GAACCCCATG GTCGTGGAAG AAAGCATGTC CATCCCGCTC CGTGACTTGA TTGACAAACA TTGCGGTGGT ATGCGCAACG GCTGGGAGTC CGTCCAAGCC TGCATTCCGG GTGGTTCCTC GGTCCCGGTC CTCAACAAGG ACCAGTGCGG CGAAGCGCTG ATGGAGTTTG ACGACTTGCG CGCCAAAGGA TCTGGTCTCG GTACGGCCGC TGTGACAATG TTCGACAACA CGGTCGACAT GGTGGGTGCC ATTCGTCGCT TGTCGCACTT TTACAAGCAC GAGTCCTGCG GTCAGTGTAC ACCCTGTCGC GAAGGCACGG GCTGGCTGGA AGATATTCTC ATTCGGATGG AAAAGGGAGA CGCCGACAAG CGCGAAATCC CTATGCTTGA GGAAATATCC CGCCAGATTG AAGGCCACAC GATTTGCGCC CTCGGGGACG CCGCCGCCTG GCCTGTACAG GGACTTCTGC GTCACTTCAA AAAAGATATC GAAGACCGTA TTGACAATCC AAAAGGCTTT GATCACGAAG CCGCTTTTCA AAAGGCCTGG AGTGGCGATC CTTTCGACAA CAACGCCTGG ACTAAGGAAC ACGGTGACGG CAAGACCTAC GCCGCGGCGT AATAAAACGG AATTGTGCAA TGGATAAGAG TAGGAATATC GTTGACTGAA AATAC
|
Protein sequence | MMILSSTAAR ATAKALSRPG SRALASQAEH NLTSGFNFLS DQDRIFTNLY GEQDWRLPDA IKRGDYHLTK EIMCMGPDWI IQEIKDSGLR GRGGAGFPSG LKWSFMPKET DGRPSFLVVN ADESEPGTCK DREIMRKDPH KLIEGCILAG YAMRARAAYI YIRGEYFNEA VVLDEAIHEA YAAGLLGKNA CGSGYDYDIY LHRGAGAYIC GEETALIESL EGKQGKPRLK PPFPAGVGLF GCPSTVTNVE TVAVAPTILR RGASWFASFG NENNRGTKLF AISGHVKNPM VVEESMSIPL RDLIDKHCGG MRNGWESVQA CIPGGSSVPV LNKDQCGEAL MEFDDLRAKG SGLGTAAVTM FDNTVDMVGA IRRLSHFYKH ESCGQCTPCR EGTGWLEDIL IRMEKGDADK REIPMLEEIS RQIEGHTICA LGDAAAWPVQ GLLRHFKKDI EDRIDNPKGF DHEAAFQKAW SGDPFDNNAW TKEHGDGKTY AAA
|
| |