Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25551 |
Symbol | |
ID | 7197290 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1472389 |
End bp | 1474688 |
Gene Length | 2300 bp |
Protein Length | 692 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177840 |
Protein GI | 219112177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0800445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGATGGCCTA CGATCTTACC CCTGCAGAGC AAGGTGGCTG GCATTCGAAT GCGAGCTACG GCTTTTCATC GAAGTTTGGA TACGGAATGC CGTCACCAAC GTTTGACGCA CGGTCAGCAA TGACTCCTTC ACCGCATCTT GCACTTTGGC AGAACCCTGA CACCGGAATA TATGGAGGCA CACCACAAGC CAGCCAGACC GAGACGACTC ATTCGGTATA CGTAGCTTCT GTGCGGCCCC CGTCTACAGC AACAGCTAGT TCCGACCAAG GTCCCAGAGA TCATCAAGTG TCCAAGAAAG AATCAAAACG CGGAAAACGG GGTAAGAAGA AGTTAAAGGA AAAACCATTA AATGCACCAC AAAATAAAGG TAAAGAAGAA GACAGTAGAC CTGGTTCGGC GACACACAAT CAAGAAACAA CTGAAGACCC AGGGGAAAGG AAACGAGCTG AGCTCGTAGA AAACGCGGCA ACGCGAATTG CTTTCAAGGA ATTTTATCGG TGCTTTCGAA GTGAGGAGCG CTTTTCGTTT CAGAAAGCTG AAGAATTTGC TCTGGATGCT CTAGACAACG GGTCACTTCC AGTGTCCGTC CACTGGCGAG TGTATCTCGA ACTCGCTGAT TTGGCAAAGC GTTCTAACCG TTTCGTCGAG GCCAGGTCGC TTTATCAGCG TGTTTGCCAA CAGCAACCTT ATGCAAGTCA AGGGTGGTTA GAATATAGCA AGCTAGAGGA AGAATGCGGG CACATGAATC GGGTTACAAA TATTTTGCAT GCCGGTCTTG AGTATTGCGA GTATAGCGAG AATCTTTTGA CCAGAGCAGT AAAACATCAA GAAAAGATGG GCAATGTTAA TGGAGCTCGA GAGCTTCTTG CCCGCCTTAA GCACGTCGGT ATCGACAAAG TTTGGAGAAC CGTCCTAGAA GGAGCGCTTC TCGAATCTCG CGCGGGAAAC GCGTTCATGG CACGGCGTGT CCTCAAGTAC TTAATGCATC ATGTTCCATG GTATGGTCCT CTCTATCTCG AAGCGTATAA ACTCGAAAGG GATCTTGGCC GCCCGACCGA TGCCTTACAG ATTGTGCAAC GAGGATTGAA CGAGATACCA CGATATGGGC CGTTATGGTT TGGTGCTTTT AGACTATGCG AAGAAATCGA CCTGTCAAAG CTTGACTTTC ATCTTCCCGA GGCGTTTGTG ATGATAAATC GTGCTACCCT CAACATCAGT AAGGAGCTTG TATGGAAGGT TCATCTGGAA GCGGCACAAA TGCTTGAACG AGCTGCTCTT GAACAGAGTG GAAAGACAAC CCCTTTAAAT TCTGCCTTCG ACATCGCCCG CCACAGGTTT GCTTTGACCG TCCTGACGTG CCCGAGCAAT CTGCGTTGGA AAGTATGGCT AGCAAGCGGG AGAATGGAAT TGGGTATAGG GAATATTAAG GTAGCTCGGA AGCTCTTTCT TCGGGCTCAT CACGTTGTAC CGGATAAAGG GCGATCAGCC AGCCTACTGG AGTGCGCACG TTTAGAAGAA TTCATCGGAT GCACCCACCT CGCTCGCTCC GTTCTATGCA AGGGTCGTGT ACTCTATTGC AACGATTGGA AAGTGTGGCT CGAAAGTGTT CTGCTTGAGA TTCGCACCAT GAATCTAAGA CGTGCACTCG AGATTGTTAC AGTTGCTCTC GAGATACATC AGGGCACAGG TCGTCTGTGG GCTACCTTGA TACAGTTATG TCAGATTCGT GGAGGCGATC AAGCACAGAT CTTCGCCCTC CAACGCGCTC TCAATGCTGT CCCAAAAAGC GGAGAGGTGT GGTGCGAAGG TGCCAGGATT CATTTAAATC CATTTTCAGA TACTTTCGAT GTTTCTCGCG CACGCCGACA TCTTTTCTTC GCCACGAAAT TCACTCCGCA GTACGGAGAC AGCTTCATAG AAGCTCTTCG TCTTGAGCTC CTTCATCAGT GGCTCGAACC AATTGCGACG TACATTTGGA AGAAAACCAA GTCAACTTTT CTACCCTTGG AAGCACAAGA TGCAAAGACA AACTGTCTTT CCAAATATAT CGCAGATATT TCGTTGGCTG TTTTCATTTC GCAAGAAACA AACGAAAACG AGGTGCCGTT TTCCGATCTA ATCCACAAGA ACATAGTCTC GACAGTACGC CAGGAGCTGA CTTCAGATAG CATGCGATCT GCCATCGACT TGGACGATTT ACGCCAAGCA TGTTCGAACG CGGATCCAAA TTACGGATCG CTGTGGTTTT CTTGTCGCCG CCATCCGTGT GATACACCCC AACGAGTCAT TGAGGATGCG
|
Protein sequence | MAYDLTPAEQ GGWHSNASYG FSSKFGYGMP SPTFDARSAM TPSPHLALWQ NPDTGIYGGT PQASQTETTH SVYVASVRPP STATASSDQG PRDHQVSKKE SKRGKRGKKK LKEKPLNAPQ NKGKEEDSRP GSATHNQETT EDPGERKRAE LVENAATRIA FKEFYRCFRS EERFSFQKAE EFALDALDNG SLPVSVHWRV YLELADLAKR SNRFVEARSL YQRVCQQQPY ASQGWLEYSK LEEECGHMNR VTNILHAGLE YCEYSENLLT RAVKHQEKMG NVNGARELLA RLKHVGIDKV WRTVLEGALL ESRAGNAFMA RRVLKYLMHH VPWYGPLYLE AYKLERDLGR PTDALQIVQR GLNEIPRYGP LWFGAFRLCE EIDLSKLDFH LPEAFVMINR ATLNISKELV WKVHLEAAQM LERAALEQSG KTTPLNSAFD IARHRFALTV LTCPSNLRWK VWLASGRMEL GIGNIKVARK LFLRAHHVVP DKGRSASLLE CARLEEFIGC THLARSVLCK GRVLYCNDWK VWLESVLLEI RTMNLRRALE IVTVALEIHQ GTGRLWATLI QLCQIRGGDQ AQIFALQRAL NAVPKSGEVW CEGARIHLNP FSDTFDVSRA RRHLFFATKF TPQYGDSFIE ALRLELLHHM RSAIDLDDLR QACSNADPNY GSLWFSCRRH PCDTPQRVIE DA
|
| |