Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39832 |
Symbol | |
ID | 7195668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 138943 |
End bp | 140484 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183933 |
Protein GI | 219127419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCCA CGTCCGTTTC GACCCCCAAC ATCCACCTTT GGTCATCGCG GCACACTGTG GTCAGTGGCA CAACCGGGAC AGACCCATCA GTACCAAACC AGTCGGGGTC GGCCTCGTTG CCGCCACGCA GTGCGTGGGT GCGTACGTGT CTGTGCGCGC GCAACCTTTT GATGTTTGCC ATTGTTGCGC TCCACTTCTC CTCCTTTCTG TTCGTACTGC AATCTCCGGC CGATCGAGAT CCGGAATCGT TGGTGCCGGA CGAGCGGAAT GTAAGGAGAC GGCACCAAAA TCAATCCGTG TCGATTCCGG TGTTCATACC CACTACGAGT ACCCCTGCCG CTGCCGCTAC TTCCTGGCTA GCCACTGGCT CGTCCCAGCC CCGAGCACCA TCGTCACCGT ACGCCTACAC TTTTGTTATT GGAGCCATTC ACGAAGATCG ACCCGCCTAC AAGGGATTCA TCTACGACAT TCTCGTATCC TTTCATTCCT TGCAAAAACT CGGCTCGCAA GCCGATTTCT GGGTCCTCGC ACAGCTCGCC CCCGATTCGA ATCTGACCGA GCTTCCCGCG GAAGACATGC GCGCTCTCCA TACATTGGGC GTTCGCGTGA AAATGCTGCC CCAATCCGAG TCCGACTCGT TCGCCGAATT CGTGTACAAC AAATTCCTCG TCTTCCGGAT GACCGAATAC CGACGCGTTC TTTTTCTCGA CGCCGACCTT ATCCCCCTTG CCAATCTCGA CTACCTATTC CATCTATCGG ACCAGGGCGA AGCATCCCTC ATCCGACCCA ATCTCATTAT TGCTACTCGA GGCGAGCCCT GTAACGCGGG GTTTTTCATG GTCGAGCCCC ACGAAAAGGC CTGGCAACGG CTGCAAGGCA TTGTGGCCCG GCACCACGAG GAAGCCAAAA CCTTGCCTTA TCCTCACTTT GACTGGCGGA ACGGCTGGGG TCACAACTTT CAAAAGGCAG GAGACGAATG GCGCGCAGTC CTGCGCAACG GTCAAGCCTG GCGCTTCCAC GCCGGTCACT CCGACCAAGG ATTGCTGTAC TACTTTTTGA AGTACGCCCA ACAGGACGTA ACTCTGGTCA TTGGCGAACG GGTGGAAAAC TGGGTGCCGG GGACAGTCGA CGGCCAACCC CAAATGGCGG CCAATCTCAC CACTCCGTTC AACAATTACA CAACGGACGA AGCCAGGGCA AAGAAGTCGT CCTGTCTCGT TCAAAATACT GCCTACGAGT GCGTGCCTCC CTACCGTGAT TTTATGCACT TTAGCGGTTC CTCCAAACCT TGGCAAGGGA TGCTGCCGAG GGCGTTCGTG TTGGCGCATG AGGGACTGGA CCGGTGGGAA AAGTTACCAA CTCCGTTGTG GCCCAGTCTT CTTTGGTTCA AGGAATTGAT CGAAGTGAAC GATCTACTCC AACTCGGCTT GGACGTAGAA AATTGGAACG AACAGCACTT GCCGCGAATG AAAGATTCCC CCATGGGATA CATTGCCAAA TTTGCGGATC ACGCCAACCA CGTGCACGGC GCTTCTTCAT AA
|
Protein sequence | MKATSVSTPN IHLWSSRHTV VSGTTGTDPS VPNQSGSASL PPRSAWVRTC LCARNLLMFA IVALHFSSFL FVLQSPADRD PESLVPDERN VRRRHQNQSV SIPVFIPTTS TPAAAATSWL ATGSSQPRAP SSPYAYTFVI GAIHEDRPAY KGFIYDILVS FHSLQKLGSQ ADFWVLAQLA PDSNLTELPA EDMRALHTLG VRVKMLPQSE SDSFAEFVYN KFLVFRMTEY RRVLFLDADL IPLANLDYLF HLSDQGEASL IRPNLIIATR GEPCNAGFFM VEPHEKAWQR LQGIVARHHE EAKTLPYPHF DWRNGWGHNF QKAGDEWRAV LRNGQAWRFH AGHSDQGLLY YFLKYAQQDV TLVIGERVEN WVPGTVDGQP QMAANLTTPF NNYTTDEARA KKSSCLVQNT AYECVPPYRD FMHFSGSSKP WQGMLPRAFV LAHEGLDRWE KLPTPLWPSL LWFKELIEVN DLLQLGLDVE NWNEQHLPRM KDSPMGYIAK FADHANHVHG ASS
|
| |