Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30067 |
Symbol | |
ID | 7195291 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 388277 |
End bp | 389935 |
Gene Length | 1659 bp |
Protein Length | 403 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183738 |
Protein GI | 219127011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.657092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCACCATCGC ACTCCAATCC ATATCAATGT CGATACATCA CAATTAGATA CACAGACAGA CGTATCTTTC TACTTAAAAT CGTCACGTTC CTTTCGCCGC TCGTGCAATT GTTCCCAACA AAACCCTTTG TGGGAGACGA CGAAGAACGT GGTACCCCAT GTCCACCGCC GTCGCCATGA ATACCTCGCA CGACGCTACT GGTTCGAGCA CCGGCAAGTA CTACGCTTCT AAAATTGCGG AATTGCGTGA GGTACGTGTG TCTGTAGACC GATAGACAGG TATACAGATA GATAGACAGG TACTTGGGAA TGTCGGTACT GCCACTTGAT TCACCGTCCC GTCCGCATTC GTCGACCGCA CCCCGTCCCG TGCGTACCAT TCCTCACCGG TCTTGTTGCT TTCTTTCGCA CACACACACA CACATATATA TATATTCACG ATTATATACA CCTAGACGGT ACAAGAACGG AGTGCCGACT TGTTGCGTCT CCAAGCGCGT CGTAACGAAA TCAACGCACA CGTCCGGATG CTCCGGGAGG AACTCTACCA TTTGCAGGAG CCGGGATCGT ACGTGGGGGA AGTCGTGAAA CCCATGGGAC TCAACAAGAT CCTCGTCAAG ATCAATCCGG AAGGCAAGTA CATTGTTGAT TTGGACAAGG ACATTGACAT ACAAAGCTGT CAGCCCAATA CGCGCGTCGC CTTGCGCAAC GACTCCTATA CCCTGCACAA GATTCTGCCC ACCAAGGTGG ATCCCCTCGT CTCGCTGATG AAAGTGGAAG CCGTGCCGGA TTCGACCTAC GATATGATTG GTGGCCTCGA AAAGCAAGTC ATGGAAATTA AGGAAGTCAT CGAGTTGCCG ATCAAGCATC CGGAGCTTTT TGAATCGCTC GGCGTGGCCC AACCGAAAGG CGTCCTGTTG TACGGCCCTC CCGGAACGGG CAAGACGCTG TTGGCCAGAG CCGTCGCGCA CCACACCGAC TGTACCTTTA TTCGCGTCAG TGGAGCCGAA CTCGTGCAAA AGTACATTGG AGAAGGCTCC CGGATGGTGC GTGAACTCTT CGTCATGGCC CGCGAAGCCG CACCCTCCAT CATTTTTATG GACGAAATCG ACAGCATCGG GCAATCGCGT GGGGGCAGCG GGGGCGGAGA TTCCGAAGTC CAACGAACCA TGCTGGAACT CCTCAACCAG TTGGACGGAT TCGAACCCGC CCAAAACATC AAAGTCATTA TGGCCACCAA CCGCATCGAT ATTCTCGACG CCGCACTGCT ACGCCCCGGC CGTATCGACC GCAAAATAGA ATTCCCCAAC CCCAATACGG AAAATCGCAT GGCGATTATC AAAATTCACT CGCGTAAAAT GAATCTCTTG CGGAATTTGG ATCTGTACAG CATTGCGGAT AAAATGGCCA ACGCGAGTGG CGCCGAGTGC AAGGCCGTCT GCACCGAAGC CGGGATGTTT GCCCTCCGTG AACGACGGGT GCACGTGACC CAGGAAGACT TTGAAATGGC CGTCGCCAAG GTTATGAAGA AAGACAGTGA TACCAATATT TCTCTACAGC GGTTGTGGAA GTAAACCCCG GCCGTCAATG AGCCAAGGCG GTGGACGGGA ACGTCCATTC GACTGTTTCT ACTCTATAGA ATACGTTCGT GAAGCCTTG
|
Protein sequence | MSTAVAMNTS HDATGSSTGK YYASKIAELR ETVQERSADL LRLQARRNEI NAHVRMLREE LYHLQEPGSY VGEVVKPMGL NKILVKINPE GKYIVDLDKD IDIQSCQPNT RVALRNDSYT LHKILPTKVD PLVSLMKVEA VPDSTYDMIG GLEKQVMEIK EVIELPIKHP ELFESLGVAQ PKGVLLYGPP GTGKTLLARA VAHHTDCTFI RVSGAELVQK YIGEGSRMVR ELFVMAREAA PSIIFMDEID SIGQSRGGSG GGDSEVQRTM LELLNQLDGF EPAQNIKVIM ATNRIDILDA ALLRPGRIDR KIEFPNPNTE NRMAIIKIHS RKMNLLRNLD LYSIADKMAN ASGAECKAVC TEAGMFALRE RRVHVTQEDF EMAVAKVMKK DSDTNISLQR LWK
|
| |