Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46148 |
Symbol | |
ID | 7201365 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 409398 |
End bp | 411427 |
Gene Length | 2030 bp |
Protein Length | 503 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180430 |
Protein GI | 219119335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.703059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTAGTTAG ATACACCCAA GCCGAGTATA GAATACATAC TCACAGTCAG CGATCTCGAT TGTCGTTATA CCTTCTCGCT TTGCTCAGTG TTTAGATCGC GCCGTGCGCT ACTAGATGGT AGTATTTCCT TGGCTAAGAA CAGCGTTAGC AGTATGCTGG TGCTCGCTTT CTTCCGATTC GACGAGCTGC CGAACTTCAC GTGTTCTCAC GGCGCTTCCT TTCGAGAACG AAATAAATCA CAACCAGATT GTTTCGTCGC AAGAAGATCC GCTCCTATGT GCTCTCCGAG AAATGGATTT TTCGGGTGCT CAACAGGACA ATATCCTCAC AGAGTTGAGT AAGGTGGGGC TTTGGAATAC AGAAACGCTC TGTAGCCGAG ATTTGTTGGC CGTCTCTCGT GATTTTGTGG ACGCTCCGAT GGAGCTAAGT CAAGTTCTGC GAAGAGATTT TGGTCTCCCA GTACTAATAT CCCACCAGGC ACGGGCTGTG GTTTTGTACG CACTCAGATC TCAAAAGGAT CCAAAATTGA CTCCGACGGA AGACTCACTG TCAACACAAT CATCACGCGT CTCGTTACAG CGTGCGAACA AATCTCAACT CACAGAACCA ACACCTCTCG AATCAAATCC GCAGGAGACA GCGTTGACCC CTTCAGCTCC AAAACGACAA CTATTTAAAC AAGTTGTAGT TAACGAAATG TCGCAGAAAC GCCATCGAAA GGATCATACA TACGACTACG GCTTGCCACG GGACTACGCT ACGGTCTTTC CTACTCTCGG CGCTGAGCTA GACGCCTTTT TCACTTTCAT GACTAAGCCC TCTATATTGT CGTCAGAACC CCCACTTCGT GTTGTCACGG CCAACGTTTA TTTGCGACAC GCCAAGTTAT TCCTGGGTTG GTACGTTAAC ATATATCAGT GTCAGGGGGA ACTCGACCCA TCCATGGAGC CAAAATTTGG AACGACCGAC CCATCACTCT ACAATATTAT TCCCAATCAG GAAAAGAATT CGGCTACGTG CTTTATCGAC TTTCTTCTTT GGCTCCGATC AAACCGACAG ATATCGGTAT CTTACGAAGC CAACGTATTG AGAGGCATTA CCAAAATGCT CAAATTTCGC TTTGCTCGGG ATTGTTCGCC GGACAGAACT GGCGACCCCA GCAAACCATT TGATGATGTT CCCATTCTTC GTGAACTTCG TAAATTGCAC CGCCAAGCTA ATGGACAGCA GCGCTTGGCC CCTCGTGTCA GTGACGAAAC CAAAAAATGG TTGTCCTGGC CGGAGTATCT TGGTGTTGTC CAACAATCTC TGACAGAAGT GGATCAACTT TTACGAGCGT ACAATGGACC CGACCCAGAA ACAGTTGAAG CAAGCTTTAC AGTCGAACAA CGAAGAATTT CTGTTGCTTA TCAAAAGTAT CTGATTTTGG CTATTTTTGC CAGCATTCCC GATCGACAAC GGACAATACG GGAACTTGAG TTGGATCGTT CGTTCGTCAA GGACATTGCG AGCGATAGCT GGGGTATTAA GCATGCACCC GACGATTACA AAACTGGCAA AACATATGGG GAACGACCGT TACTGCAATT ATCGCCTTCG CTAACGCCAG CTATAGAGGG CTTTGTGTCA CACTGGCGCC TTTGTTTGCG ACCCCAGACT AAGCATGTGT TTGTCCAACC CAGAACTTGC AATCCGTTGA CACAGGATTC CGTCTACCAA ATTGTATCTC GGGCATGCTT TCAGTACACT GGGAAGCGAA CCAATCCGCA TTTGCTGCGG GATATGATTG TGACGCACGT ACGGGAGTCA GACGCATCTG AAAAGCAATT GGAAGCATTG GCATTATTTA TGGGCCATTC CATTCAAGTG CAACGTGCAT CTTACGATCG GCGTACATTG AATGACAAGG TGGCACCTGC CGTCGAGCTC ATAAGATCTT TGAATTCTGT CTCATTGATG AAATAAAATG TAAGGAATTC GGATCCGAAT GAAGTCCAAG GTACGCTACG AGATACAATA TAATCGGGAT
|
Protein sequence | MLVLAFFRFD ELPNFTCSHG ASFRERNKSQ PDCFVARRSA PMCSPRNGFF GCSTGQYPHR ETALTPSAPK RQLFKQVVVN EMSQKRHRKD HTYDYGLPRD YATVFPTLGA ELDAFFTFMT KPSILSSEPP LRVVTANVYL RHAKLFLGWY VNIYQCQGEL DPSMEPKFGT TDPSLYNIIP NQEKNSATCF IDFLLWLRSN RQISVSYEAN VLRGITKMLK FRFARDCSPD RTGDPSKPFD DVPILRELRK LHRQANGQQR LAPRVSDETK KWLSWPEYLG VVQQSLTEVD QLLRAYNGPD PETVEASFTV EQRRISVAYQ KYLILAIFAS IPDRQRTIRE LELDRSFVKD IASDSWGIKH APDDYKTGKT YGERPLLQLS PSLTPAIEGF VSHWRLCLRP QTKHVFVQPR TCNPLTQDSV YQIVSRACFQ YTGKRTNPHL LRDMIVTHVR ESDASEKQLE ALALFMGHSI QVQRASYDRR TLNDKVAPAV ELIRSLNSVS LMK
|
| |