Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45942 |
Symbol | |
ID | 7201149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 730135 |
End bp | 731826 |
Gene Length | 1692 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180107 |
Protein GI | 219118678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.617552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTTTTGTT TGGTATTGTA CGTGTACGGG TGGATCTTGC TGGCTGCCGG TGCGTTGGTA AAACGAGAGT CTCTTAATAT TCATGATCCC AGCTATACAC TTTCCCCGTC GACACTCAGT ATTACTACTG CTATGTCATC CCTAGCCTTT AGTCGCGTAA CCAAAATAAG CGTCTTGGTA CCGCTTCTGG GTGGGGTGGT GTTGCTCGTT TCGATCCTCA GTCTTTCTAT TGCTTCCAAT GGTCTCTCGC ATCTCGAACA AAGCAACCCT ATCCATACTG CGCTTTCGCA ATTTCACAAC GAGAATTCTC TGTTGGAAGG ACAAGGAGAC GGGGCTCCGG TCGCGGACAC ACGCCCCGTT CTGGATCCCC GGAAACTCTA TCCTTTCGAA GACCCCGATC CCAATCCACC CCTGGCAGAC GGTCACGATA CTTTTTCCGC CTGTATGCTC GTCATGGACG ACAATCATCG TCTCGTGGAG TGGCTCGCCT ATCACTACCA CGTACTGCCG CTACGCCATT TAGTGGTTAC CGTGGATCCA CGCTCCCAGA CCTCCCCCAC ATGGCTCTTT AATCGCTGGC GCAAACAGGG CATGGTCGTG GAGCAGTGGG TCGATCGGGA TTTTTGGCGA GCCGATCTCC AGCTCCTGCC TCTCCCAGCC AATTCCACCC TCCAAATAAA ACGCGATCGA CACCGGGGAC GCCAAAAGTT CTTCTACCGA TCCTGTCTTA TTCATTTGAA GACCCTGAAT CGCACCTGGG TCGCTTTGCA TGATTCGGAC GAGTATCTCC TCTACAACCA CGCCGGCGGA GACCGCTACG AGGCCTGGGA AGCGCGGATG CAAAAGCGTC ACGATAACAG CGCTCAGCAC GCTACATCCG CGCGGATCAA ACCATCACAT CCACCTCCAC CCACGCCCGG AGAAGAAGGC GGAATGATTC GGTACATTCG TCAGGAACAG GCGGCGGGCG TGGAATACTA CCAATCGCCG TGTATAGGCG TGCCTCGTTT GAGCTTCGGG GCGGTCGAGA GTTCGCGCGC CGCCAGGGAA GCCGGTATGC CCCAATCATC CACCACGTTG GATGCCTTGC AGTTCGACAC CCTCCGGTGG CATCGGCACG CTCCCCGCAA CGATTTTGTC AAGAACGCAC TTGGCAAAGT CCTCATGGAT GTCTCTCGCA TCGATGTGGC CAAATCGCCG TATTTTATGA GTCTACATCG TCCCATTAAG AGTATATGCA CGCCTCCTTG GCATAACGAC TGGACGTCGG GATTACGAAT CAACCACTAC TTGGGATCCT GGGAGTCATA CGCCTTCCGA GATGATAGTC GACGGGGTGG TGAACGGTCA CGGGAACAGT GGGAGTACAA GGCCACCACA CATACCGATC AGACTGACGA CAACATACGT CCCTGGTTGC AAGGGTTTGT CGACGCCCAA GGGTTGTCGC AGGCGGAGCG GCTATTAGAC AAGGCTGGCT TACCGCAAGG CTACCGGGTC GCAAACGAAT CTCAATGGAA TTTGTTGCCA GAAAAGTTGG CCAAGATTCT GAGCAGCGAC GTGACGATTG CGAACGACAG CAAAATGGTG ATGTTCGATG CTTGGGTCCG GGCCAAGTAT TGGAACGAAA CATTTGATGT GCAGGAGACG TATCGAAAAG TACGGAATGC CGCTGCAGCG CCTGCCGTGT AA
|
Protein sequence | MSSLAFSRVT KISVLVPLLG GVVLLVSILS LSIASNGLSH LEQSNPIHTA LSQFHNENSL LEGQGDGAPV ADTRPVLDPR KLYPFEDPDP NPPLADGHDT FSACMLVMDD NHRLVEWLAY HYHVLPLRHL VVTVDPRSQT SPTWLFNRWR KQGMVVEQWV DRDFWRADLQ LLPLPANSTL QIKRDRHRGR QKFFYRSCLI HLKTLNRTWV ALHDSDEYLL YNHAGGDRYE AWEARMQKRH DNSAQHATSA RIKPSHPPPP TPGEEGGMIR YIRQEQAAGV EYYQSPCIGV PRLSFGAVES SRAAREAGMP QSSTTLDALQ FDTLRWHRHA PRNDFVKNAL GKVLMDVSRI DVAKSPYFMS LHRPIKSICT PPWHNDWTSG LRINHYLGSW ESYAFRDDSR RGGERSREQW EYKATTHTDQ TDDNIRPWLQ GFVDAQGLSQ AERLLDKAGL PQGYRVANES QWNLLPEKLA KILSSDVTIA NDSKMVMFDA WVRAKYWNET FDVQETYRKV RNAAAAPAV
|
| |