Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43572 |
Symbol | |
ID | 7197306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 844048 |
End bp | 845699 |
Gene Length | 1652 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178018 |
Protein GI | 219112533 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCGATACA TTTTTACACG ACGCGAGATC TTAGCCTGAA CGTATCCAGC GAACAGTTGC AACGTTCGTC TTTTTCTTCC AGTCACAACG CGGTGTGTAA AATGAAGGCC ACAGTGTTAG CGGCATCAAT ACACACGGTG CTATCGCTGC AAACACCATC GCAAGGACTC GTTTCATCGA CAAGACGCCC ACGGAACACA CGGTCGCTGC GACAGCCTAC ACGATTGCTT GCTGTGACCG AGGACTTGCC GAAGATTCTC AGTACGCCCG TGTCCACCCG AACCGAGTTC GCTTCGTGCG CACCGGAACC AGTTGGGCTC TACATACATA TCCCCTACTG TCAACGCCGA TGTCGTTACT GCAATTTTGC CATTGTCCCG ATTGGATCGG CAGCGGCAGC CCGAGCGCAA TCTGATGAGG GTGAACGTCC CATCAACGCA CAACTTTCGG GCTTTCTGGA AATGGATCAC ATGTACACAC AAACAATTTT AAAGGAACTG CAATGGACGC TTTCCAAGAT GCCGCAGGAT CAAAAGGTGT CCCTGACGTC AATATATTTC GGAGGCGGGA CGCCGAGTCT AGCACCGGTT GCAACGATTC GCACCATTCT CCACGCTATC CTGGCGGAAG ATACACCTTT TACACTGAAA GGTGGCGCCG AGATTACCAT GGAGATGGAT CCTTGCACCT TTTCCAAAGA TCAGCTCCAG GAGTTGAAGG AACTGGGTGT CAATCGTATC AGCTTGGGCG TCCAAGCGTT AGACGATGGA ATTTTGGAAT CGCTGGGCCG TATCCATCGT GTCCAGGATA TATATAAATC ACTTTCTATG ATGCAAGAGG TGTACGGAGA TGAGTTGAGC TACTCGTTGG ATCTGATATC CGGACTTCCC GGTCTGTCAT TAGCAGCGTG GACCGAAACG TTACAAAAAG TCGTCACGCT GGAACCGAAA CCTTGTCACT TGAGTCTGTA CGATTTGCAA GTAGAAAGTG GAACCGTCTT TGCTAAATGG TACGGCAATG GTGACGAAGA GTCCGGCTGG GATCGTGTCC GCGGAAACCT CCCCACACCG GCAGTGGCGC TACCCTCCGA CGCCGAGTCT GCCTTCATGT ACCAATACGC GGCGGGCTAT CTACGATCAC GAGGCTACGA ACACTACGAA GTGAGTTCGT ACGCATTACG GGACGAGACT CAGACTGGTC CTTCACCGTG GCGCAGTCGG CACAATCAAA TTTACTGGGC TACAAACAGT CAGTGGTACG CACTAGGTCT CGGGGCTACC AGTTTTGTCG CGAACGAACT GGTCGCGCGT CCTCGAACCT TGGTGGACTA CGCGGACTGG GTCAATCGTG TCCGTACACT ACCGGATGCG GGTGTGAGCG AAATTGTCGA TACCGAGCTG TTGCTGAATG TTGTATTGAA GCGCTTGCGC ACGAGCGAAG GACTTGATTT GGGGTTCGTT CACCAACGAT TCTCTCCAAA AGGAGACGCT TTCGTCGACG CGATTCAACG CGGGGCTGCC TTGGCGCTCG AGCTCGGTTT GGCGCAGCTC AATGACAACG TCCTCCGTTT AGTTGATCCC AAGGGATTCC TGTACTCCAA CACAATTATT GCCAGTATCT ATGCAGAATT GGAGGAGACT GCTAACTCAT AG
|
Protein sequence | MKATVLAASI HTVLSLQTPS QGLVSSTRRP RNTRSLRQPT RLLAVTEDLP KILSTPVSTR TEFASCAPEP VGLYIHIPYC QRRCRYCNFA IVPIGSAAAA RAQSDEGERP INAQLSGFLE MDHMYTQTIL KELQWTLSKM PQDQKVSLTS IYFGGGTPSL APVATIRTIL HAILAEDTPF TLKGGAEITM EMDPCTFSKD QLQELKELGV NRISLGVQAL DDGILESLGR IHRVQDIYKS LSMMQEVYGD ELSYSLDLIS GLPGLSLAAW TETLQKVVTL EPKPCHLSLY DLQVESGTVF AKWYGNGDEE SGWDRVRGNL PTPAVALPSD AESAFMYQYA AGYLRSRGYE HYEVSSYALR DETQTGPSPW RSRHNQIYWA TNSQWYALGL GATSFVANEL VARPRTLVDY ADWVNRVRTL PDAGVSEIVD TELLLNVVLK RLRTSEGLDL GFVHQRFSPK GDAFVDAIQR GAALALELGL AQLNDNVLRL VDPKGFLYSN TIIASIYAEL EETANS
|
| |