Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50441 |
Symbol | |
ID | 7199304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 70709 |
End bp | 73130 |
Gene Length | 2422 bp |
Protein Length | 776 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185372 |
Protein GI | 219130438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.712504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGCTTTCG TTACTCTCAC AGAGATACTC CATTCTCATT GTCATTGTCA CCCACACAGT CCATTCAATA GTACGTACGT AGCCAGCCCC CATGGACTTT TCGTCGATTC CTCGTGCCTT GGAATCTCTC CAACGCCGTC AGCCTCCGCT CGACGAGGAG ACGGTGTTGC GTCCCACACG AGCCCTCGTA GCGAACCCGT CCGCGGACGT CTTTTTATCC TCCATTACGT CACTCGTCTC GTCTACATCG TCTCGTTGGG AACCACTCGC GGTGGGTCTT TACGTGGCGA CGGAAGCCCT CACGCAGCAC GGAAAAGCCC TCGCCGCGTC GTCGTCGTCG TCGTCAACCC CCGACGCGGC ACCGTCCGTG TATCTGGAAG GTCCACGGGT CCCATCCACC CGCGACGAAG GAACACCTCT GGAATCGGTC GTACCCGTCC TCGACACAAA CCAGGCTTTG GTCCTTTGTC AGACCCTTCA CGACGTCGCT TTGCGACATT TGGAACACGA CGAGCCCCGG GTCCGTACTC TCGTCGCCAA GGCCGTGGGA GCCTACGCCA AACTTACCGT CGAACTCGAT GATGCACTCC CGCACAATCG TCAAGCTCTC CACGATCGAC TCGTGCAATC CATTCGAACA CATATTCAAC AAGGTAGAGA CGAACAACCT GATCCACAAA ACGATCCACA CACCAACACT CCCGACGATG GGGACGATGA TGGGGACCGA TCCCCAAAGT ACAGTAAATC CTCCACGGGA GCTCTGGACG ACACCACGGG ATGGCGTGCC TTGGAAACCA ATTGGCAGTG TCTCGCTTCC TTGATTCGGG CACTCGGTCC GGCTTACGTC GTACACTTTG GGGTCCCCCA AACCGTTCTG GACGATTGTC AGTACAGTTG TATCGAGCAC GTCAATCGCC ACGTCCGGGC GGCCGGGATT TCCGTGCTGG AACAGTGGCT TTACGCGGCA GCAGCCGGAA GCCCTGTGCA ACAGGCCCTC TTGACGGAGT CCGACGGAGT TCTGCGCAAA ACCTGCCGGG TTGTACTCAA ACACGGGCTC GCCGACAATT GGTCACAAGT CCGGATGGCC GCCAGTGTCC TCTGTCGCGT TTTGTTCACC ACCCTACAGG CTCTACAGGC ACCAGCGGAC GATTTGTATC CCGTACTCCT ACCCCGCATG TGTTTGAATC GATTCTATCT CGCGCAAGGG GTCAAGCTCT ACAGTCACGA AACGTGGAAA CTCGTCTTTG TCGATTCCGG CGTGTCCCTG GTGGCCGCTA ACTTGCCCGC CGTTTGCCGT TACTACGTTC AAATGTGCGA CGCCGATAAT CACGTCGTCC GGGAAGCCGC TTGCCAGGCC GTCGCCGAAC TGGCCATCCG CCTCGGGTCG GATCCGAACC ATCACGACGA GCTCCTGCCA CACATGGACC TACTGCTGCA GGCCTTGCTT ATGTGTTTCC ACGACGAGTC CTGGCCCGTG CGTGACGAAG CCTGTTTGGC GTGCGGCCTC CTCTGCAAAG CGTATCCCGA GTCCTGTCGT CCCGAACTCG GTAAACTTTG GGAACGCTGG ACGGGACAGC TCACGGACCA GATTTGGTCC GTCCGCGAAG ACGCCGCGGT AGCCTTGGGC GACGCCCTCG AAGCCTATGG TGCCGACTTC CTACAGGAAC TCCTAGCCCT CGTTGACAAG CTTCTTCCGT CGGCTCGCAG CCAATCCGCC ATGACGCCGA CCGAATACAA GGCCCGCCAA AACGACGCCG CTGCTCACAC CGACTCGCAA TTGTACAGTT GCGGGAGTTT GGCGCCGAAG CTGCGCAAGG GCGGTGCCGG CCGCATCGGG TGTTCCTCGT GCGACGTTAA TCGCGAAAAA TCGCCCTGGG AAGCGACAGA CGGCTGCGTC TACCTGATTC GCGAACTCGT GGTGCGGTGT GCCTCGCCGG AGAGTCCGAC ACCCCTTGCG GACGAAATCC TGCTTCCCAT GCTCCGGGAA CTAGCTGACG TGTGTCGGGT ACAGCACTTT CCGCAATCGG ACGATTTGCG CACCACGTTG TGGAGAAATC TTCCTGGAAT GGCGGAAGCG CTCGGCAAAC AGCGTTTCAA GCGGTTGTAC CTGGACGTCT TCCAGAATTT GTTGTTCTCG AGTTTGGATG CGCGGTCGTC CTCGCAATTG TCACAGCACG CGGCGGGACA GTGTGCGGAA GAACTGGCCG ACCTGGTAGG CCGGACTATT TTCCGGGCAC GTTTGGAAGA TGACCAGCGG GATACTTTGG ATCGCGTTTT GCGCGAACGC GCTGCCATAC CGGCGGGGCC GGCAGACGGG GCCGTCTTTT CGCCCTTTGG ACCGCCCGGA TTGCTCGATC ACATTCACAA GGGTACGGTG CATCCGGGTG TGGCGGGGAT GACGCGAGGA ACGGCGCCGT GA
|
Protein sequence | MDFSSIPRAL ESLQRRQPPL DEETVLRPTR ALVANPSADV FLSSITSLVS STSSRWEPLA VGLYVATEAL TQHGKALAAS SSSSSTPDAA PSVYLEGPRV PSTRDEGTPL ESVVPVLDTN QALVLCQTLH DVALRHLEHD EPRVRTLVAK AVGAYAKLTV ELDDALPHNR QALHDRLVQS IRTHIQQGRD EQPDPQNDPH TNTPDDGDDD GDRSPKYSKS STGALDDTTG WRALETNWQC LASLIRALGP AYVVHFGVPQ TVLDDCQYSC IEHVNRHVRA AGISVLEQWL YAAAAGSPVQ QALLTESDGV LRKTCRVVLK HGLADNWSQV RMAASVLCRV LFTTLQALQA PADDLYPVLL PRMCLNRFYL AQGVKLYSHE TWKLVFVDSG VSLVAANLPA VCRYYVQMCD ADNHVVREAA CQAVAELAIR LGSDPNHHDE LLPHMDLLLQ ALLMCFHDES WPVRDEACLA CGLLCKAYPE SCRPELGKLW ERWTGQLTDQ IWSVREDAAV ALGDALEAYG ADFLQELLAL VDKLLPSARS QSAMTPTEYK ARQNDAAAHT DSQLYSCGSL APKLRKGGAG RIGCSSCDVN REKSPWEATD GCVYLIRELV VRCASPESPT PLADEILLPM LRELADVCRV QHFPQSDDLR TTLWRNLPGM AEALGKQRFK RLYLDVFQNL LFSSLDARSS SQLSQHAAGQ CAEELADLVG RTIFRARLED DQRDTLDRVL RERAAIPAGP ADGAVFSPFG PPGLLDHIHK GTVHPGVAGM TRGTAP
|
| |