Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44917 |
Symbol | |
ID | 7199608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 631502 |
End bp | 633446 |
Gene Length | 1945 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178817 |
Protein GI | 219116044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTACTT TTTGTAAAGA AAAATTATGC AAGATGCGTA ATCGACTTCA CGGTACTTAA ATTTCTTTCT AAGATCTTCG AGAAAATGGC ACATACCAAT CCAGATAATT CTAGAAGACC AACAGACATT CGATGATATT CGATGCCGAT CTCGATGAAG ACGACGACCA AAACAAAGTA TAATGCCACT TCCGTTTAAA ATGGAACAAC CGAAAGCAAA GGCACAGCGG CCATCAAACA CCGAAGCTGC AGATGTACCA AATTCTCAAG ACCAAATGCC ACCAACTAAC CTAGATGCTG CAAGTGCGTC TGAAAACGAT ATAGCAGACG AACCAAATCA GTGCTTCACG ATACTCTCCC ATCATAGCTA TCATGATTAT GCAAATATGA ACGAGCCGCC AATATTGGGG ACAGGTGTCG TTGGGTGCGC CACGAAGACG CGAGGAAATT CAATGAATCC GTTCCCTCTG ATGCTTCATA AGCTGCTCGA AGGAGCAAAG AAGGGAAACT ATAGCGAAAT AGTTTCATGG AAACCGCACG GACGAGCCTT TCATGTACAC ATGAAGGATC GCTTCGTGAA AGATGTGATG CCTCTATACT TTCGACAGAC AAGGTTCGCC TCTTTCCAAC GACAGCTGAA TCTATATGGG TTCCGGCGCT TGACTGGACG GGGACCGGAT GAAGGGGCTT ACTACCACGA ACTCTTCCTC CGAGGGATGC CCGAACTCTC ATCAAATATG GTCCGAATGA AAGTGAACGG AAATGAGGTC CGGCTTGGTT CCTCCCCTCG AACAGAACCG AACTTTTATG CCATGTCGGT TGTTCATGAA CCCAGTACAC AACAAAAGCA ATCAACCCCG AAAAAACGAC CGGCCATCAC GCAGCGCACA GCAGAGCGAC CCTCTTATCC GTTAGACAGA AAAATGTTGG CCGACTCTTG GAAAGTTCCC GGCGACATGC AAGATGCAGC GATTGGCTCT TCTTCCAAAA TAAGCATAGA AAAACCATCC AGCGTTGACA TAGAATATTT GAGAAGCCAC TTGAGCCAAC AACGGCACAC GGCATTCCAA GATTTTTCCG GCCACGTTGT TCCTGGGCAA AAAACTACCG TAGCCCCACT CAAAGACATG GTTGTAAATC AGCCTCCTTT GGGGAATGCC TCGGACTTGG TGCGTCAGCG CACGTGTCCA GACGAAGACT TTCCTGTTCC TGAACCCGTC CCTTTGCGTC GACATGATGT CGGGACGGAG GACGATGACC AGACATCAAT GGTGAACTTC TTGAGTGATA TTGACCTACA GCCATCGTCA GACGAAAACA CTCTGGATCC CTTGAGGATT CATCATTCTT CCGACGACAG CTCTGTGTTG AAACACCTGC AATTCTCTTC TGATGAAAGT ATGAATCCGA TCCCTAAAAT TCCATCAGCT CAGCAATCTC GGTCGTCGTC TTCCAAGGAG GATAGCAGAT AGCATAAAGA GACACTTTAG CATTCGCTTC AGAAACGTCC TATGTTCAAA ATCGTGTTGC TCACTCTCTC CAACAAAAGA AACGTCTCGG CTTGGGTTGC CTTCGTCTAG GACGGTCAAC TCAAGTGATT TGAGCGAGCT ACAATGGTTC TATCTGAGCA CGACAGTGAT CCGTGTGATT GCATGTTGAT GCAGAATATG TATTGTAGGT AACAAGTTCT GGGTTGTCCA CATCGAGCAG GAGTACATTG AGCAAGAGTA GCACACAGCC AGTCAGCAGA GGCTCTAGTT CGCCAGCGAA ATATCCAAGG CCATACCAAA GTCTCGAGAA ACGACTACAA ATGGACGAAA CAGATAAAGC TCTTCCTCCA GCCTCCTAGC TACGCTCTAA TCATTTCTCA GGAGAGTACT GTGTGCCGCG CTTGATCATT GGTGCGAATG TTAATGGCTC TCGAAGATAC CAGGTCACTA GAATG
|
Protein sequence | MPLPFKMEQP KAKAQRPSNT EAADVPNSQD QMPPTNLDAA SASENDIADE PNQCFTILSH HSYHDYANMN EPPILGTGVV GCATKTRGNS MNPFPLMLHK LLEGAKKGNY SEIVSWKPHG RAFHVHMKDR FVKDVMPLYF RQTRFASFQR QLNLYGFRRL TGRGPDEGAY YHELFLRGMP ELSSNMVRMK VNGNEVRLGS SPRTEPNFYA MSVVHEPSTQ QKQSTPKKRP AITQRTAERP SYPLDRKMLA DSWKVPGDMQ DAAIGSSSKI SIEKPSSVDI EYLRSHLSQQ RHTAFQDFSG HVVPGQKTTV APLKDMVVNQ PPLGNASDLV RQRTCPDEDF PVPEPVPLRR HDVGTEDDDQ TSMVNFLSDI DLQPSSDENT LDPLRIHHSS DDSSVLKHLQ FSSDESMNPI PKIPSAQQSR SSSSKEDSR
|
| |