Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43550 |
Symbol | |
ID | 7197581 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 791838 |
End bp | 793912 |
Gene Length | 2075 bp |
Protein Length | 563 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178004 |
Protein GI | 219112505 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCATTTACA ATCCTTGTGC ATGTACGTAT CCCCGCTAAG CCCGTAGTGA TTTATGAGAA AACCCGCGCT GCTGTTGTTG CATTCACCAG CATCGTTGAA TTGGTTTTCC GGAACCTTTG TTTCCGCCAG AGGAGTTCGC CCTGAGCGTG TCGCTCCTAG ATATCCAATC TGGACCAAGT CTCTGCATCG ACACGGCATG TCTAGCTATA GCGGCGGTGA TCGAGGAGGC CGGGGACGAG GCGGTGGAGG TCGTGGTGCC TACTATAAGA ATAAATACGG CGGTGGCGGA CGAAACAGTG GCGACGCTCG CGATCGAGGT CCTCTGGGTG GAAACGGTAA CCTGGGAGAC AATCACCGCG CACGGACTAG TACCAACGGT GGAACCTTCC AAGATTTAAA GCAACTGTTA CAACACATCG ACGGTCGTCA GTATCCCGCC TATCATGATT TAGAAACTGC TCCAAATACG GGATGGGTTC ATCCCGAGGG ATTTGTCTTA CAAGTCGGAC GTGCTCAGGC AGATCCGTTC GCCCCGCCTA CCCGGTGTCG CGTCACCCTT CCACCTTCCG TCTCGCGCAT TTCAAACTCT TTCTATACAA ACGCTACGCG GCGCATGGCG ACTGGTGATT TCTTGCTGCG GCGCTTGTAT GGCAACTGCA AACGTGTGGG AGCCGATCAT AGCTTGCGTA GCAGTAGTGG TGGTAAGGGT GGATGGAGTG GACCCAAAGG AGGCGACGTG CAAGTGCTGG AGCCTACACA GAATGTTATC GAGCAGTCAG CGGTTCAAGT TGACGAACAA GGTAACATTC TATGCCAAAT TACCATCAAT CTTCCCGCAA AGGGGAGATC AATTATGGGT CACGCCGCAC ACGAAATCAT GGACGCTGTG CTACCGCAAC TGATCAGTGA TAGTCTGATG TTCACCTCGA TGAACTTGGA CAGTATACGC ACGCACATCG AGTCGGTAGA GGATCAAGCC TGGCTGCAAC AACAGCTGGA TACTGCCGGA CTGGTATCTT TCGTGCGGAA CGGGGCAATT CTGCCCCGTG TCTCCGGAGT TGAAGATCGC CCCATGGCGG GATCGGTCGT TGCCTTTAAG TCTCCTCCGT CGCTGCAAAA GGAGTTTACC CTCCCCATCA GCGGCGTGGT AGTCCAGGGC ATGGGAATTC GCAAGGGTGT TACCCTGATC TGCGGTGGTG GCTTTCACGG CAAATCACCC TCTTACAAGC CATACAAAGC GGAGTCTATC TGAAAGTACC TGGAGACGGC CGGGAATTTT GTGTGACCAG CAGCCAAGCG GTCAAAATTC GTGCGGAAGA TGGCCGTGCT GTCCAAGCGG TCGATATTTC TCCGTTCATC AACAATCTGC CGTTTGGTAA AGGTACATCT TGCTTTACCA CGTCGGATGC GAGCGGAAGC ACAAGCCAGG CAACAAATAT TGTGGAGGTA AGTGAGCAAA CGTGTGCTTA TGTCGGGAGA GAGACCATGC ACTGGCGAGT TGTATTTCCG CAGAGCAAAC TTCCATTGCT ACTGAGGAGC GTTTGACCGC CAAGAGAGAT ATCAGGACCC GACTATTCGT TGTACTGTGC CGCAATTTAT CATCGGTACA ACCGACATTT CGTCTTTTCC AACACATGCT TATGACTTCT TCTTCATAGT CAATCGAACT AGGTGCAGAC ACCCTTCTAG TAGACGAAGA TACGTGTGCA ACGAACTTCA TGGTTCGAGA CAATAAAATG ATGGAGCTTG TTGCTTCCGA CAAAGAACCA ATCACACCGT TTGTGCGTGT TATCAGATCC TTGTATGAGT CACAGGGGGT TTCTTCGGTT CTAGTTATTG GTGGACTTGG CGATTATTTC GACGTAGCTG ACCATGTGCT ACTAATGGAT TCGTATGGAT GCCAGGATGT CACAGCACAT GCGAAAGAGA TTGTTGCTCG AAGCGGATCA GATTCTGCTA AACTGCAGGT AAAATTTGGA AAAATTCGTC AACGATTCCC GGTACTAGAC ACATTTGCTG CTAATGGAAA GGTTAGAACC CCGGCAAGGG GGGTTATATC GTACGGCGAC GTTGA
|
Protein sequence | MRKPALLLLH SPASLNWFSG TFVSARGVRP ERVAPRYPIW TKSLHRHGMS SYSGGDRGGR GRGGGGRGAY YKNKYGGGGR NSGDARDRGP LGGNGNLGDN HRARTSTNGG TFQDLKQLLQ HIDGRQYPAY HDLETAPNTG WVHPEGFVLQ VGRAQADPFA PPTRCRVTLP PSVSRISNSF YTNATRRMAT GDFLLRRLYG NCKRVGADHS LRSSSGGKGG WSGPKGGDVQ VLEPTQNVIE QSAVQVDEQG NILCQITINL PAKGRSIMGH AAHEIMDAVL PQLISDSLMF TSMNLDSIRT HIESVEDQAW LQQQLDTAGL VSFVRNGAIL PRVSGVEDRP MAGSVVAFKS PPSLQKEFTL PISGVVVQGM GIRKGVTLIC GGGFHGKSPS YKPYKADQAV KIRAEDGRAV QAVDISPFIN NLPFGKGTSC FTTSDASGST SQATNIVESI ELGADTLLVD EDTCATNFMV RDNKMMELVA SDKEPITPFV RVIRSLYESQ GVSSVLVIGG LGDYFDVADH VLLMDSYGCQ DVTAHAKEIV ARSGSDSAKL QNPGKGGYIV RRR
|
| |