Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48695 |
Symbol | |
ID | 7194680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 633181 |
End bp | 636099 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183260 |
Protein GI | 219126009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTGT TTTTGAAAGC AAAAAAGAGA AGTAGTAGCG ATGGCCCGCG CTCAGGCGCC CTAAAGTGCT CTGCCAAGGT GGACCTTTTT AAGCCCACGA AAAAGCTAAC ACGAGCTCCT GAGGATAGCG ACTACAACGT TGAAAGTGCT AAATCCATGG TCGCCTCGGC CGGAATAGCG TCTGGACAAG AACAGTGCTC TATAGACAGA ACCGTCGGGA TCCCAACCTC CACCCTTGCT CTCTCAGCGG AAAAAGGCAA CCACCAATGG ATGTTTTTCA ATGTAACGCA GGGTTCTGGT GTCTACGAGA GCCTCCAGCA GTTTTCTCAC GAATTGGACA AGATTCCAGA TTTTGAGAAA AGCGCGTACA TAGAGGCTTG CCAAACCATC CCAATGATCT TGTATCGAGA GTCGAACCCA GAGATTTTCT TGCGCTTCCG TTGTGAAAAC GTCGCCGCCG CCGCACTCCT CTTTGCACAG CACTGGCAGG TCCGGAAAAA TGTCTTCGGG GAGCGAGCCT TTTTACCCAT GAACTCGACA GGAGAGGGAA CCTTATCACC GACCGATTTG TTGCTGTACC GCACGCACTA TTTGGTCGCT TTGCCCAACA ATCAAGATGG TTGTTCCGTG CAGTTCTTCG AACCTGGAGT CTTAGCCTCC TTTACACCAG CGCGTCTCCG CTGCGTTTTT TATACCCAAT TTCTGGCTAT GCACAACACT AACAATGCCA AAGATGGCTA CACAATTCTG ACGTACATGG ATGAAAGGGG CTTTGATGGC ATCTTTGGTC GTGAGACACC AGCCAGCCTC TTAAAGGCGA TGCCAGTTCG TTTGGGGGCC CTGCTCTTGT TCCACACCTT AACTGGTGAC GAGATGAATG TTTTTGAGCG CTGCGTTCCA CAACTCACTA GATTGTACAG CTGTCCTGTC AAAACATTTT CCTTGGCAAC TTGTACAAAA GAGGAGATTG GGGACTTCTT CCAGATCCGG GCTCTCAAGC CCGAAGCTCT TCCTGTAAAA GTTGGTGGAG CCCTTTCTCA GGCAGATGTT ATCAATCTGG ATAAAGCGAG AACTCAAATG GAGTGGCAAG CCCACGCTGA ACTTGCGCAC TCTGACTTGC TCAACCTTGC CGATACTGAC AAGGTTTTCC AATCCAAAAC AAAGGGCCAT GACTTATTGA ATGTGCCCCA AAGACCCAAA AGTTGGATCT TTCAGACGCA ATTAGACCAC TGTAATTCCA CAGCGCTCGA TGCCTTTGCG AAGGCGCTTC GCGAGATTCC TCAAAGCGAA AAATCTTCTT TTTTGGAAGC ACTTGAGTCT GCTCCAGACC TGCTCCACCG AGAAACGCAC CCAAGGATAT TTCTTCAGTT TGAAGCAAAC GATCCTTTAG CAGCTGCAAG ACGATTTGTT TCTCACTGGC AAACACGCCA AAGACTGTTT GGAGATCGAG CGTTTTTGCC CATGAATACA ACGGGGGACG GAGCTTTATC TAGTGATGAT ATCGAGCTTC TATCCTCCAC CTATATCACA CTTCTGGAGA AAGACAACGT CGGCAGAGAT GTGATTTTTT ATACTCCGGA ATTGGCTGGT GCTGAGTCAA CAGAGCGCCT GAAATGCGGA TTTTATACCC TGACTCGAAT TATGCAAAAT GAAAAAACGG TTGAGAACGG CGCCGTTGTG ATTGCTTTGC TAGGCCAAAC AGAAATGGGA CAGGCGGCAG GGAGGTCTGG AGTTGCTGAA ATGTTACGAC GGTCAATACC AATACGTTTC AAGAGAGTCC ATATAATTCA TTGCTTGAGT GGTTTTGAGG AGCAAGTGTT TGAAAAGAGA GTCCTACCAT CTGTCCTTGG TTTGTTTGGC TGCCAAGTGC TTGCGCACGA TGCAAATGCG CCATCGGAAG TTTTGGCTAC GCTTGTTGCT TGTGGCCTAA AGGAAGAACA CCTGCCTGAT TTGATCGGCG GAAAGATGAG TTTTGCAGAT ATTGTGGCTT TGAACGAAAA TAGAAGGGGC CAGGAAAGAG CAGATATTAC AGGCCTTTTA TCCGGTGTTG GAGAGCCTGG TGAAGTACAT GTTGCAGACG ACCTATCGCT AAACGGTGGG TTTGACCATG AGAACGGCGA TCCGCACTTT CGCGATGAAA TGATTGTCAA AAACCCAAAG GAAACGCAAG AAGTGAGGGA TTGGGAGCAA AGAAACGGTG CAATTGTTTC ATCGTCTGGA GAAGTCGAAG TCAACGAAGA AAATCTCCAC TTTCTTAAAG GAGAAAACCG TAAACGAATT CTCAATGTAA TCGCATCACG GCGAAAGCGG CTACGACGAA AAGAGCGCTA TGAATCTCTA GAAAAATTTT GCAATGACCT TTGCGCACGT AAGGCAGCTA TGCAAGAAGC AAATGCTCGA CTCGAGGAAC TTCTGCGAAA GGCCAGCGGC GTGGTGGCAG CATACAAAAT GTCAATGCGC ACGAGTATGA GCAGCAGCTT TCCCCTCGAC ACTTATACCT TGTCAACAAT GGGGTATGGT GCACATGCTT TACATGGATC TGTAATGTTT TCTCCACATG GTATGCCTAT TCCTGTGACT AACGGTCTGA TCGAAGAACA GCTGTGGAGG CATTTCATGG ATGGACAGCT TGTAACACAG GCAGCGGCGA ACGAAGCAAA TCACAATGCT TCCGTTGCAC AACGTCATAT GCTGCCGATG TTCCTGAGAC CGTCCGACAG GGGGGCCCGA CCGTACGGAT CATTGCCGGT CGGTCCTAAT CTATGTGGCG ATCTGCGAGA TATCGCGGGC AGGGCTCTGA CTGCTAATTC TTTGCGAGAT GGGCTAACCG CTTGGGAAAT TGGCGGCGCG CTTGATGCCA CACACAAAGA CCGTTTGTAT GCGGAGCTCG ATCGTAAAGT GCTACGAAAA CCCGATTGA
|
Protein sequence | MDLFLKAKKR SSSDGPRSGA LKCSAKVDLF KPTKKLTRAP EDSDYNVESA KSMVASAGIA SGQEQCSIDR TVGIPTSTLA LSAEKGNHQW MFFNVTQGSG VYESLQQFSH ELDKIPDFEK SAYIEACQTI PMILYRESNP EIFLRFRCEN VAAAALLFAQ HWQVRKNVFG ERAFLPMNST GEGTLSPTDL LLYRTHYLVA LPNNQDGCSV QFFEPGVLAS FTPARLRCVF YTQFLAMHNT NNAKDGYTIL TYMDERGFDG IFGRETPASL LKAMPVRLGA LLLFHTLTGD EMNVFERCVP QLTRLYSCPV KTFSLATCTK EEIGDFFQIR ALKPEALPVK VGGALSQADV INLDKARTQM EWQAHAELAH SDLLNLADTD KVFQSKTKGH DLLNVPQRPK SWIFQTQLDH CNSTALDAFA KALREIPQSE KSSFLEALES APDLLHRETH PRIFLQFEAN DPLAAARRFV SHWQTRQRLF GDRAFLPMNT TGDGALSSDD IELLSSTYIT LLEKDNVGRD VIFYTPELAG AESTERLKCG FYTLTRIMQN EKTVENGAVV IALLGQTEMG QAAGRSGVAE MLRRSIPIRF KRVHIIHCLS GFEEQVFEKR VLPSVLGLFG CQVLAHDANA PSEVLATLVA CGLKEEHLPD LIGGKMSFAD IVALNENRRG QERADITGLL SGVGEPGEVH VADDLSLNGG FDHENGDPHF RDEMIVKNPK ETQEVRDWEQ RNGAIVSSSG EVEVNEENLH FLKGENRKRI LNVIASRRKR LRRKERYESL EKFCNDLCAR KAAMQEANAR LEELLRKASG VVAAYKMSMR TSMSSSFPLD TYTLSTMGYG AHALHGSVMF SPHGMPIPVT NGLIEEQLWR HFMDGQLVTQ AAANEANHNA SVAQRHMLPM FLRPSDRGAR PYGSLPVGPN LCGDLRDIAG RALTANSLRD GLTAWEIGGA LDATHKDRLY AELDRKVLRK PD
|
| |