Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50516 |
Symbol | |
ID | 7199292 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 285385 |
End bp | 288327 |
Gene Length | 2943 bp |
Protein Length | 806 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185463 |
Protein GI | 219130628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTATTCGTT GGTTCGTTTC CTATAGTGCC ATTCTAGTCG AGTGTGTGCT CACCTCTTCA CCGAGAGTAC ACATAGTACC AACAATAACA ACAACCGTCG CAAAAGTACA CACCACCTCC ACCAGAAACG TAGTGTCCAT CACGGCACCA AGTAGTTCAT AGTTGGTGTA TCGATACGTA GACGTTGCTG ATAGCCATCT CTATCGTTGC AACCACTCGT CCATAGAGGG GAGGGAACCG AACAATTCTT TACAATACCC CGTGCTTACT CAGCAATAGT AACAACAGCA ACAACAAGAT GTCCACGTCT CGTCTCCCCG CCGCGGTGAC GAGCTTTGCC TACCGTTGGC TCGCCGAAAC GGAGGCCGAC GGGGAAGTCC ATTCCGAATT CGGAGTCCAC ATTGCCTACG AAGATTTGTA CGACTTTGTC GTCTTTCTCA CCTGCATTTA CCTCGCGGGA CACATCGCCA CTTACTGTAA AATGCCAGGC TTAGTGGGAG AAATCATTGT CGGGATCGTC CTCGGACCGC CGTTGGCGGA TTTCGTTCCC AATCCGGAAG CCTGGGTATT GCTCGGTGAG GTCGGACTCG TCATTCTCGT ACTCGAAGCG GGTATTGATA TCGACGTATC CACACTCAAA CTGATTGGAA CCCGCGGATT TCTCATTGCC TTTGTCGGAT CGGTGCTACC GATTGGTATC GGAATTCTAC TCGCCATTCT CGTCAACGGA ACCGACGATA TCAAGGCAGC CATTGCGGCG GGGGCCACCT TTGGACCCAC CAGCTTGGGC ATTGCCCTCA ATATTCTACG TTCCGGGGGC ATTCTCAATA CACCCGTCGG ACAACTCATC ATCTCCGCCG CGGTACGTAC CAACCGACCG ACCAGCCATT CGTGTCCGCA TTCCGTTGGA CTCACCTTGC GGGTTCTTTC TAGGTGATTG ACGACATGAT TGCCCTCATC GTCCTTTCCC AACTGGAATC CTTGGCCGGA ACCATTGACG CCGCAGGTAT TCTCATTCCC ATTGTCTCGG CCGTGCTCTT TCTGGTCGTT GGTGGTTACC TGGCCCTCTT TGTGGCACCC CCCTTCCTCA ACCAATACGT GCTCAAGCAT TTTGAACAGG AATCGCACGG AAAAATTGAA ATGGCCGTCA TGTTTGGCAT CCTACTCGCC TTGATGCCCG CGACCTTTTA CGCCAAGGCA TCGTACTTGA TGGGCGCCTT CGTGGCCGGA TTGGCCTTTT GTACCTCGCA CGATCTCCAC GAAACCTTTG TACGACAATT TAAACGACTC TTGCAGTGGC TCATGCGCAT TTTCTTTGCC GCCAGTATCG GATTTCAGGT ACCTATCAAG GACCTCTTCC AACACCACGT CCTCTGGAAG GGACTCGTCT TTACTCTGGC GTTGACGGGC AAATTGGCCG TCGGATTTAT GGTACCGAAC TTTACTCAAT CCAAATCCTT CACCGGCATT CATCTGCGCG ACTGTCTAAT TACGGGATTT AGCATGGCGG CGGAAGGGGA ATTCGCCTTT GTCATTGCCG TCTTTGCCGT GGACAAAGGA CTGATCGACA AGGATTTGTA CGCTTCGGTG GTCCTCGCCG TCCTCATCTC GACCATCATT CCACCGTTTC TTCTGCGCTT TACCATTTCC CACTACAATA AAAAGGCGGA AGAAGCCATC AAGGCCCTGG CCAATGCCGA AATGGAACGT CAGCACAATT TGGAACACGA ACTCGAACAT GTCGTCATGG GCGACGATCA CCTCGCGAGG CTGGAGGACG AAATTAAATC CCATCGAGCC GTGTTTCTGT GCATCCAAAC ACAGTCCGAG GCACGATGGG GACTCATGCA CAACATGATG AAGGCGATTG CGAAACTCGG GTACGGAACG CATGTGTTGG TGGTGTTTGC AAAATGTGAA AGCTAATTGA CTCACACTGT CATTTTGGTT ATTGTTGATT CTGTATCCAA TTCCAGACTG GACATTATCG ATCATCGTTC TTGGCATCCA CGTGGCATTC ACTCAACCTT GGTCAACGAA GTGTACGTCA AGGACCAGTT CGAGAAGACT GCCAAAGGGC AGGCTCAGCA AGCCTTGAAT ACTCGCATCC AAGAAATCCA CGATGCGTTG GAAGACACAA TTGATCAAAA AGAAACGGCC CGCGTCAAGG TGCAGCGCTG GTACCCAGGT GTGGTCGAGG AGATTGTCGA AAGCGTTCAC GAAAGAAAGC GTCACGGAAG AACCAAGATC AATTTAGAAG AGCGCCTCTT GTCGGAAGCT TCCAAAGCAT TGGATCGTCG TCAGTCAGCA CAAGTATTGG CGACGAAAGA AAAGACGGTG GAAGAAATCC TGGCGGGGAT GCAAAAGGAT ACGGAAGGCA TGCCGGTCAT TGAAGAACAT AAGGCTGGTC CAATTCCACA AGAGCATAAA AAGCAACAAA GTCGGCGCCG TCGACAAAAG ATGCGAAGTA CGCCGGTCGT GGGTGGTGGC TTGTTCGACA AGCGCGAAGT ACAGGAAGAA AAGGAAGAGG CTCCGCCTTC CAAATCAGGT TCGGGGCAAT GGACGCCCAC TTTCGACTTT AAGGTTCCCG GTCACAAGGC CGAAATTATT GTCAAAGGTG AATCCTATGA CATTCGGATC AACGACGCCA CGTTGAAGAA ACTGCGGTCC GGGTTCAGCG GGGATATGCT CGACCATCGT GGTCTTTCGG CTAGCGAAGG AGTTTCGATC CAGCCCGATC GCTCAAACGT TGTCAACAAT TTGCATGGAT ACGTGCGTAA TCCACAGTTG GGCAAAATTG TCGAAGAGAC GATTGTCGAT TCTGAATCAG AAACGTCAAG CGTAAGCCAT CAAAATGTCA CAAATCCGCA TGATCCCCCT GTATAAGCTG AGAATTATGT TTGTCGTTTC GACAGTATTT ATAATCAACA AGTTTACTTT TAC
|
Protein sequence | MSTSRLPAAV TSFAYRWLAE TEADGEVHSE FGVHIAYEDL YDFVVFLTCI YLAGHIATYC KMPGLVGEII VGIVLGPPLA DFVPNPEAWV LLGEVGLVIL VLEAGIDIDV STLKLIGTRG FLIAFVGSVL PIGIGILLAI LVNGTDDIKA AIAAGATFGP TSLGIALNIL RSGGILNTPV GQLIISAAVI DDMIALIVLS QLESLAGTID AAGILIPIVS AVLFLVVGGY LALFVAPPFL NQYVLKHFEQ ESHGKIEMAV MFGILLALMP ATFYAKASYL MGAFVAGLAF CTSHDLHETF VRQFKRLLQW LMRIFFAASI GFQVPIKDLF QHHVLWKGLV FTLALTGKLA VGFMVPNFTQ SKSFTGIHLR DCLITGFSMA AEGEFAFVIA VFAVDKGLID KDLYASVVLA VLISTIIPPF LLRFTISHYN KKAEEAIKAL ANAEMERQHN LEHELEHVVM GDDHLARLED EIKSHRAVFL CIQTQSEARW GLMHNMMKAI AKLGLDIIDH RSWHPRGIHS TLVNEVYVKD QFEKTAKGQA QQALNTRIQE IHDALEDTID QKETARVKVQ RWYPGVVEEI VESVHERKRH GRTKINLEER LLSEASKALD RRQSAQVLAT KEKTVEEILA GMQKDTEGMP VIEEHKAGPI PQEHKKQQSR RRRQKMRSTP VVGGGLFDKR EVQEEKEEAP PSKSGSGQWT PTFDFKVPGH KAEIIVKGES YDIRINDATL KKLRSGFSGD MLDHRGLSAS EGVSIQPDRS NVVNNLHGYV RNPQLGKIVE ETIVDSESET SSVSHQNVTN PHDPPV
|
| |