Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31846 |
Symbol | |
ID | 7196149 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1075504 |
End bp | 1077251 |
Gene Length | 1748 bp |
Protein Length | 552 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177216 |
Protein GI | 219110929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCCA CGAGAAGGGA ACGTCTGCTT CCGCAGTTAC GAGATGCTCT TGCGTGGACT GCGGTGCTAC TACTACTGGG TTTACTACCC TTCGGCGCCG CGTCGACGAC GGACGATTGG ATGCGTTTGG ATTACGCCGT CTTTCCGCGA CGACACGTCG ACGAAGCGGC GGTGCTGGAA TGGGGACACT CGGCCATTAT TTCCGCACTC GACGCATCCG TCGCCTGGTT TGGTCCCCAA ACCTCACAGG CGGCTCTACT TGAAGTCGAA GCCCAACCCG TATTGGCCTC GCCCGTACAC GGCGTATCCA ATACGGTACA AGAAGCTTTG GATGCCCTCG AAGAACAAGC GAGCGACGTT GAGACGAACG AACAGGTCGC TAATTGGAGG AAAATCGTCC TGGATAATGC CGACGAAGTC CAAGGCAACG TCGTCGTCAT GACGAATACG GGAGACTTGT CGGGACCCCA AATGGCCATG CTCGCCCAAA ACAGTGGCGC GGCCGCCTTG TTGGTCGTCA ACGTTGACGA AGACCGTCCC GATGATATAT ATCGACTCGC GGCGGATCGG AACACAGTGA CGACGACGAC GAAAACAGCC TTCGAAACAA CAGCCACCAC CACGCCGCAC TCGCCGCACT TATTGACATT CCCACCGTCA TGATCTCCAT GAACGCCGCC AACGTACTTA CCACCGCCAC GGTGGATCCC AACGCTTCCT CGCGGCGACG CGTAGTCAAT CACGGAATGC CCGATCGAGT CCGTTTGTAC GCCGGCGGGG ATCGACCCTT CTTCGAAGAC GCCCAGGCCG AATCCCCCGC CGTGTATCTC ATTCACAATG TTTTGACTCG CGAGGAATGC AAAGCCCTCC AAACGCGGGC GTCCACTCGT TTGCAGCCAC TAGAGGCGAT TGCGAGTACG GCTGGTGGTA CACGCAGTCC ACTCCAGTAC ACTACCGCAT CTTCGTTGCG CGGAGACAAA AGTGGCGGTC CGTACTATGT CGGAGACGTA TCACGCGTCG TATTGTGGCA AGGATTGTGG CAAAGTCAGG CCGCCAAGGC CGTGGAAGAA CGCATCGAAC AAGTTACCGG ATTCCCGTCT ACTCATTACT CCGATTGGAT CGTGGATCGT TACGAAGCTG GTGCCTACGT CCGTCCCCAC GTGGACAATA TTTTGGCAGC CGACGGAACC GCCCCAATAG CCGTCCTCAC CGTCTTTTTG AACGATGATG GCGGCGACGC CGCCATTGTC TATCCGTCCG TACCCACCAA CGCAGCGGCT CAAAAACCAC TGAAAATTCG TCCCCAGCAA GGCCTGGCCG TCGTCCATCA CGTTACGGAC GATCATCATC GGATCGATAC CAACGCCGTG ACGGGAGTCC TTCCGGCGTC TACCGAGCAC GGTGATGCCT ATTACCTTGC ACGCAAGTAC ATTTATGCCA CTCCCGTCAG TACCGCCCGC CGTCTGGTGC TTCCAGCACT GTCGTTGGTA GCAGCCGGTG GGGGAAATCT GCCCAGCCTG GTTGTTCGAC TGCACGTCGC CATGCTGGAA CAGTTTGGCG TTCCGCAGGG CAACGCAAAT TTTGACAGGG TCTGTATCTT TGTCCCATTG TTGCTCGTGC TACTTCTAGT GCAGTACGTG GTCAATCGCC TTATGAACCA ACCCTCCAAG CCACCGAGCA AGTCAGCCTC CGGAACCGGG TCGAGCTCTA CAAGCGCAAA TAAGAAGCGG GACAAGAAGC AAAATTAG
|
Protein sequence | MSSTRRERLL PQLRDALAWT AVLLLLGLLP FGAASTTDDW MRLDYAVFPR RHVDEAAVLE WGHSAIISAL DASVAWFGPQ TSQAALLEVE AQPVLASPVH GVSNTVQEAL DALEEQASDV ETNEQVANWR KIVLDNADEV QGNVVVMTNT GDLSGPQMAM LAQNSGAAAF LRNNSHHHAA LAALIDIPTV MISMNAANVL TTATVDPNAS SRRRVVNHGM PDRVRLYAGG DRPFFEDAQA ESPAVYLIHN VLTREECKAL QTRASTRLQP LEAIASTAGG TRSPLQYTTA SSLRGDKSGG PYYVGDVSRV VLWQGLWQSQ AAKAVEERIE QVTGFPSTHY SDWIVDRYEA GAYVRPHVDN ILAADGTAPI AVLTVFLNDD GGDAAIVYPS VPTNAAAQKP LKIRPQQGLA VVHHVTDDHH RIDTNAVTGV LPASTEHGDA YYLARKYIYA TPVSTARRLV LPALSLVAAG GGNLPSLVVR LHVAMLEQFG VPQGNANFDR VCIFVPLLLV LLLVQYVVNR LMNQPSKPPS KSASGTGSSS TSANKKRDKK QN
|
| |