Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44128 |
Symbol | |
ID | 7203881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1059536 |
End bp | 1062780 |
Gene Length | 3245 bp |
Protein Length | 360 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186458 |
Protein GI | 219113749 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGGCACA GTGATCTTCC GGCGTGTCTT TGCTATTCGA TGCTTAGTGT TCCACAGATT GAGATTCTGG ACGCGTTGCT TGGTACATAA AGGCTTGATC CGGCCAAGAC AAGCAAAGAT TGGTGGAAAG CAACGGCATC TGTAACTACA CCATGACCTC AACCGTCCCG GATGAGACTA TAGAAGCGCC CACGACGACG ACGACGGCGA ATTTGCCTGC ATCAATATCA AAACTTCCAA TTGTATGTTT AGCGCATGGC ATGCGTCACA AAGCTAACGC TTATGAATTG AAAAATTGGG TCGCGTTGGC GTTGACGTTA GACGGAAGGG ATAAGCTCAC AAAAATTTTA CAGTATGTTG CTCGTTTGCT TTCTTGGTGG TTTGCGGGCC GGGGAAGAAT CAACCAAGCG CAGCGGTTTG CGTCTCTAAA AAACAGCCTC ACAACGAGTC GTAAAGCGTT TCGCATGGGA CGAAGCGTCA TCGAGTTTCA TCGCCTCCGG AAAATGGGGA TTTTGGAAGC ACTCGGCTAT TACTTACAAC AGAATTTGGA TAGGAAACCA AGAGACGAAC CGGCTGGAAA ATGTCTTGAT ACAGGCGGAT ACGGTAAAAA TCAACAATCG CTGGTCCCTC TTCTGAATGG TTTAGCATAC CGATTGCAAC AGAAGATGGC GACAACTTTT TTGCTTACTG AACCCAAACT CCCCGACACA CCACTGTGGG CATTGTTAGG AACAGCTCTG AAAACCCTCG GTTTAATGGG CTTCTGGGCA GCAGACAATA TTTCCTTTCT CATAGCCTCC GGCACTTTCG ACAACTACCG ACTAGGATCT CGAGAACGCT TGGAACGACG GAATAGATAC GCTAGACGAG CAAGTGCAGT AGCGAATCGA GCTTATTTCG GAGGAGCGGT CGCCGGCTTA ATTGTTAATG TTCGAGGATA CCTTGATCAA CGGCAGACAA CCCTCCACTT CTTACAGCAA CGCCGAGAGC AAACCCAGAC AGCGGATGAA ATAGATTTTA CTAAAAAGGG TCTGGAGAAG GCCAGAGAGA AGCACTTTGA TTCGTTTGTG GCCCTCACCA AGAGCGTATG TGACGTTTTG GTCTTCAGTA ACAATCCCGG TATCGATCTC TGGCAACAGC AGATCGGACA CAAGATGCAC GAAGGCTTCC ATTGCCTATT TGGCTTGATC TCTGCTTCCG CTGTCCTTTA CAATAATTAT CCGGCTGCTT GTTGACATGC AGAAACCGTA CTATTGTGTA TTGTAATCGT TGAAGCGTTT AGAGCGCATA CAAAATCATT CCTGGTCGTC CAAACTCTCA AACAAGGGCT GCAAGTACTC GACTCCACAT GGGTCATCCC GCATCTTGTA CGCAGGATCA TCTTTTGGGA AGTCTTTGAA ACCATTAGCC ATGGTGAATG CCTCGCTCTG GATTGCTTCT AGAGCCATCA ATTCCCCAAA TTTGACATGC GCCACCATCC AGAGGCGTTT ACCGAAGTTT TCAACACGAA GATTCTCTTG AATGTTTTTG TGGCCATCGG CAGATCCCCG AATACGTTGA GTGGCCGTCA ACGCCATGCG AGCAACGCAG AACCTTGTGC GAAATTCAAG GGCTTCGTTG GATTCCGGCA GAACATAGCT TGCATTGCCA ATATGAGCCT CTAGTTGATC CGCCGCGGTG CCATTGTAGT CGGTATGCGT CATGACGATC AAACGACACG TCGAAAAGAT TAAACGGGTG TCCATTTTCA CGGTACGTGG GTTCATCGTT GGGATCTTGC TAGTAATGGG CTGATCCGAA TTTCGAGCAG GCTCAGACAG CGGACTCGGA GGCTGGACGA AAACATTCCA ACGTTCAGGA ACCTTCAACA AATTCATCAT TAGCCTAAGC TTGCGTTGCT TGAACGCCCA AGCTTCCTGT TGATTCTGAT CGACGCAGGA CTGGTATCCA TCGTCATACT GCAAATAGTG AAGCGCCTGG ATGCCACGTG GTGTATCGTT ACCGTTCGCA CTCAACGGGC TGTGGAAGAT GGCAATACTG CCTTGGATGT ATCCACTGCC ATCGCCATTT ATAAAGCCGT ATTTGGCAAA CAAATGTGAG TCGGTGTGTT TTCCGTAAGA ATCCATGATC TCGGTTCCCA CAGGAATCGT CTTTAATGCG CGTACTTCGA AAACCCGGGC CTCTGTATTG TACGTAAAGC CTACGTTAGG ACGGGCATGG TGGTTGTAAA GATCCAGAAT CGGAACCATT GCATGCGAGC CCTTGGAGAG GTCGATGCCG TGTTGCGCGT ACATTTTGAG TTCCGAATCA AGAGACTGAT TCGGTGCCGC TTCGTTCGCT GGGAGAGGTC CAGTGCCGAA GCTACGAGTC CAAACATTCA AACGGGCAGT CTTATAGGCT TCGCGAGACA CTCCCTCTGT GAAGTCGACC GAAACCTCCT TAAAAGCCTC ATATTCCGAT TCCATTTGGT TCTGCATCGA TTTAATAGTG TCGTAAGCTG ATGTAGCCGT TAGCGCTAAT TCCAGCTCTT TCTGGGACCA GAGAAGAGGA TGGAACTCGG CAAAATCACT GTATTTCGGC AGCACGTTAA TGTAGTCGCG GTAGTGCTCC ATTGCTTCTA CATCATCCGT GTTGTTCTTT CGAATTTGTC CCTGGATTTG AGCCAAACGA GCAGCTAAAA CAGCACCGGC ATCGATGAAC TCTGGGCTCG TCTTTGTTTT TGCTTTCAAA AGCTGGAGCC CTAGATTGCT GCGCAAGGCA TCCAAATCCC AAAGTTGCTT GTCTCGTGGA ATTGTCACAA GCATGGTACC TTTACGAATA TCGTGGGATG CTTGGTGAGT TCTGCGGGTA GCCGCAAGAA CGGGATTGCA ACCTGCCTTC GTGTCCGACG ATGTATCCAA TTCTCCGCAG ATCGATCTCA AAAGCGAAAT CACCGTCTCG TCTACCCAAT CTTTCGCGAT GTGATTCCAA AATAGTGCTG TCGCCATTGC AAGAAACAAC ACCAAGGTAG CTTGCCAAGT AGATTCAGAC ACGCCGGTTA GACCAAACCA CGATCTTCTT CTTTCTCTTC AGCTTCTTCT TGATGTCTGA AGCATCGACT CCTTTGCTCT CTCCTTTGAG TCGGGAAACG GCCATCGTTA CCAAATCTAT TTGGGTATTT TCCGTGAAGA GTCTGGTCCC CCAAAAGGAT CGCCCCAGAA ATTTATGGTT GACTAGTTTA CTGCGATCCC CTCTTGACAT ATTTG
|
Protein sequence | MTSTVPDETI EAPTTTTTAN LPASISKLPI VCLAHGMRHK ANAYELKNWV ALALTLDGRD KLTKILQYVA RLLSWWFAGR GRINQAQRFA SLKNSLTTSR KAFRMGRSVI EFHRLRKMGI LEALGYYLQQ NLDRKPRDEP AGKCLDTGGY GKNQQSLVPL LNGLAYRLQQ KMATTFLLTE PKLPDTPLWA LLGTALKTLG LMGFWAADNI SFLIASGTFD NYRLGSRERL ERRNRYARRA SAVANRAYFG GAVAGLIVNV RGYLDQRQTT LHFLQQRREQ TQTADEIDFT KKGLEKAREK HFDSFVALTK SVCDVLVFSN NPGIDLWQQQ IGHKMHEGFH CLFGLISASA VLYNNYPAAC
|
| |