Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48573 |
Symbol | |
ID | 7194736 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 239493 |
End bp | 241824 |
Gene Length | 2332 bp |
Protein Length | 648 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183188 |
Protein GI | 219125858 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.017831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAGGA GACAGCTACA TTCTCAGCTG CAATATCAGA ATTGGGAATC CTACGTTTGG TCACTAGGAG TTCGTCGTCA CCGGTTCCGC AGGCTTCTGC GATCTTCGAG CGAGAACGTT CCTCCGGTTG TATGCGAAAA AACTCGAACC CCGCTGACGG AAGCGGGACG AGGGTGGGGG TCTCCAAAAC CAATCCCGTC CGTACCAATT GCCGCAGCCC TGGCCCTTTT TTCGGGATCG GAACGACGTT TTTGGCTCCG CCCAACGCAC GCGAAAAACC ACCGGGGGGC ATCGCCGTCG AACCCAACGC CCCCAACCGC AACCATAACC ACAACTCACA GTCAGGCAAA CACAACCCTC TGCACAACGT CCATTCGGTT CATTGGAAAG GGTTGTTTAC CCCTTTTGTC CTATAGGGTT CATCGTCTCA CATTTTGTGA ATCGTCTACA GTAGCATTGC TCTCCGTCCT TCGTCAAGAC AGTCTCTTTC TCCAACACAA TGGCTAAAAA GAAGACTGCA GCCCCCAAAC CGGTTGAAGA AGCGGCAGCC GAAGTCGAAG CTGTCCTGGC GGCTCCGCCG GTGGAGGAAC CGGAAACGCT TGGAACCGCT CCCGTCCTCA CCCATACGGC ACCCTCAATG GACCCCGGGG CCTTTGAAAT ACTCACGGAC GTCATCGAGT ACGAGGTGGA AATGATCTTT TCCGATCGTG AGAAGCGTGC GGAGCGCTGG AAGACTCTCA CTGTAAGTGC CAACGACACC TGACGGTCGC CTGCGTAAGA TCCGCTAACA CTGGTCTATC TCTTTAGTAT TTTTGGGACC TGCACCCGGC GCGCAACTGT AACTTTGGTC GTCTCTACCT TCGTCGTTAC CTGGCGGAGC TTTTGGTGAA CGCGGTTGGA GCCCCGCCGT ACGGGACGGA CAAGGACGAG GAGCTCGGTT TGGTGACTAC CACACCCAGA GTAACTTCGC CCGACAACCT GGCAAAGTGG GACGTGGAAC GCTGGTCCGA CTTGCAAGAT GATCTCGAAC ATATTCGTGA TATCCTTTTA CGTCAAGATG CCGTCCTCAA TCTAAAAGAC GAAAGCATGG CGGAGAACCC CGAAATGGAC GAAGAACGAG CGGAGATGGC CAGTTTTGGC AAAAAGTATC AAATACTCAT ACAGTGTCTC GAAGTGAACA AGGCTATTGC ACCTATTCGT TGGGATCTCG AGGAGCAGAT CTTGAAGAAA GACGAAGGTC AGCCGGAACT GGATAAGAAA AAGGTTGCTG AAGTTGTCAG GATCATCAGC GAGATCAAGA ACGAGCGCTG TGTGGAAGTA GCCGAAACCC TTTTTGCCAA GAAGCCCGCG AAGAAAAACA AAGGTACCGA GACTTTTGAA GACATATCGG CCCTCGCCAA GATCTTGGAC TCGAAGCCAT TCAAGTACGT ATTCCGGCAA AAACCATCGC ATCCATGTAC ACATTTCTTT CTTTCGACTG CTTACCTTTC CTGTTCTCCC TTTGCAGCTT GAGTTACGTG CAGAATAAGA TCGAGGTCGT CCTCCGCCGT TGGAGTTGTG ATACGTTCGG CTCCGATGAT CCCATGCTGT TTCGTTTACG TTACGGCATC CCTCCCAGAA GCGGTCTCGC CTTGCTCGCC TCTACGCCCG CTTTGGCTCG ACGCACTCCA AGCAAAAGTA CCGGCGGACA ACGCAGTCCC GTCCCCTCTA TCAAAAAAGA AGACGAGGAC GACGAGGACA TTCAGCGCCT TCGCGAGAAC CGTGTAGCTC TCGGCGAAGG ACATGGTGAA GATCCTTTGG AGGAAAGCCG CACTTTGGCA GCCGAGGCGG CCGGGGAAGG CCGCAAACGT ACGGCGGACA ACCAAGACGA GGACGAGGAA CAGGACACGA AACGCGTTCG TCGCCTCGCA ATTGACGATG AGTTGGATGA AGACGACGAA GAGGACGAAA AAGAACGTGC GGCTCTCAGC GAGTTGCCCC GTCGAAGAAC TCGTCGCCCC TCACGCCGCT TCACCGGCCC TCCTCCCGAT GACGGTATCT TTGACGAGCA AGGCAAGGTT AAGGCTCGCC GCAAATGGGC CGAAGAGGAA AAGAACGCCG TCAAGGTTGG GTCCCAAAAG TTCGGCGTGG GGAAATGGGC GGAGATCAAG AAGGAGTATG GTGACATCCT GCGCAACCGC ACTTCCGTCC AGATCAAGGA TTGCTGGCGA ACGATGAACA AGAATAATGA AGTCTGAAAT AAGGTCGAAG CCAAAGCGGT GGATGTTGAA ATTTGGGCAA GGAACGCAAA AGTATGTCGG TGATTAATAA TAAAACTATT GGTTTTCCAC GC
|
Protein sequence | MVRRQLHSQL QYQNWESYVW SLGVRRHRFR RLLRSSSENV PPVVCEKTRT PLTEAGRGWG SPKPIPSVPI AAALALFSGS ERRFWLRPTH AKNHRGASPS NPTPPTATIT TTHISFSNTM AKKKTAAPKP VEEAAAEVEA VLAAPPVEEP ETLGTAPVLT HTAPSMDPGA FEILTDVIEY EVEMIFSDRE KRAERWKTLT YFWDLHPARN CNFGRLYLRR YLAELLVNAV GAPPYGTDKD EELGLVTTTP RVTSPDNLAK WDVERWSDLQ DDLEHIRDIL LRQDAVLNLK DESMAENPEM DEERAEMASF GKKYQILIQC LEVNKAIAPI RWDLEEQILK KDEGQPELDK KKVAEVVRII SEIKNERCVE VAETLFAKKP AKKNKGTETF EDISALAKIL DSKPFNLSYV QNKIEVVLRR WSCDTFGSDD PMLFRLRYGI PPRSGLALLA STPALARRTP SKSTGGQRSP VPSIKKEDED DEDIQRLREN RVALGEGHGE DPLEESRTLA AEAAGEGRKR TADNQDEDEE QDTKRVRRLA IDDELDEDDE EDEKERAALS ELPRRRTRRP SRRFTGPPPD DGIFDEQGKV KARRKWAEEE KNAVKVGSQK FGVGKWAEIK KEYGDILRNR TSVQIKDCWR TMNKNNEV
|
| |