Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39794 |
Symbol | |
ID | 7195611 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 47106 |
End bp | 48411 |
Gene Length | 1306 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183919 |
Protein GI | 219127389 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACG GTGTAGAGAG AGATCAGGGT TATGTTGACT ATTCTGCAGA AGTAGAGGAG ACTCCCGACA AGGAAGGGAA AACAAACGAA AAGTTTATGT CAGTTGGCGC ATGGCAACGC ATAAAGGGTG GCAAGCCAGC CTTTCTTGTT CAGCTTCATT TCCATCTAAG CCAAGCCGCA AATACAGGGC TGGATGACAT ATTCTCGTGG CAGTCTCATG GCCGATGCTT TGTAGTGCAC AGTCAAAAGC GATTTGAACA GTACGTACTC CCCGTGTAAG TTCCCAATGT GTGCGCTGCT TTTAACAAGG GTGCTTACCC CGTGACATGC TACAGTCTAA AAATTTCTCT TCAAATGTCA CGCTTTGGTG CAGTTGGTTT CGACAAACGA AAATATCATC CTTTCAAAGG CAGCTGAACC ATTATGGATT CAAAAGACTG ACAAAAGGTA AGTCCGTGGT CTGAATATGC AGGTTAAGGT ACTGTATCTG CTTCTTAGCT TTAGTCGACT CTTGCTTCCA AACGCCTGAA TTTGAAAAGG TTTTGATAGG GGAGGCTACT ATCACGAGCT TTTCCTGCGC ACTAAGCCAT TCTTGGTCCA TCGTATTAAA CGCAAAGTCA AAAAGGGAAC CGGTAGACAA CCACCCGACA TGCCTAGGCA GGAACCCAAC CTGTATTTGT ATCCCTTCTT GCCACAAACG GTATTTCAAG TTTCCGGCAG AAGAGCACAG ATTTCCGACA GTTCGGCTGG ATTCGTCACA ACGACTAAAC CAGAAAGTCA GGGACAAGTG TTGCCACTAA TCCCGGTCGA GGATTCCCGC AGCTTGCCGA AGTTCGTTTC CGAAGTCTGC ACTAATAACC AAACGGTCCC GGTAGACCGC TGCCTTTCCA CTAAAATCCA TGACCAGAGT CTCCTGGAAG AATCTGTCCG AGCTGTTGCA AACTCACGTT CTCTACCGCC GTTTGAGCGT GCCGTACCGT CTACATTGGC ACTATCACAG GCGACGTATG ATTCCAATAT CCGCATGTTG CTGTTGCTGC GGGAGGAACA GGCGCAAGCT GAAGCATTCG AACAAACCAG AGTACATCAG CAGCTACTGC TCGAAGCGAA TCTTCTCATT GGTTGGCGTA GCGAAAGACA ATCGGTCTGG TTCGAAGATG GAAATTGCAA CCGCGGACAT GATTGTGAAA TGCGGCTTGC CAACGCTGCA GCTTACCAGC AACAGCTTTC ACCACAATCA CCCTCGGATC CCCAAAAAAT TTCGCTTCCC AATTTGTATC GAATTTTGAG GCAAAATGGT TATTGA
|
Protein sequence | MSNGVERDQG YVDYSAEVEE TPDKEGKTNE KFMSVGAWQR IKGGKPAFLV QLHFHLSQAA NTGLDDIFSW QSHGRCFVVH SQKRFEHWFR QTKISSFQRQ LNHYGFKRLT KGFDRGGYYH ELFLRTKPFL VHRIKRKVKK GTGRQPPDMP RQEPNLYLYP FLPQTVFQVS GRRAQISDSS AGFVTTTKPE SQGQVLPLIP VEDSRSLPKF VSEVCTNNQT VPVDRCLSTK IHDQSLLEES VRAVANSRSL PPFERAVPST LALSQATYDS NIRMLLLLRE EQAQAEAFEQ TRVHQQLLLE ANLLIGWRSE RQSVWFEDGN CNRGHDCEMR LANAAAYQQQ LSPQSPSDPQ KISLPNLYRI LRQNGY
|
| |