Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43232 |
Symbol | |
ID | 7196953 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2397734 |
End bp | 2399790 |
Gene Length | 2057 bp |
Protein Length | 568 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177506 |
Protein GI | 219111509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.332386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAAC GACTCCAAGC TGTTCTCTTT TTTTGGGCTT CCAAGAGAGG TATGCCACTC ACTGTCATCT TCCCATTTCC TGTGTATCCG CTTCCGATCC GCCACCGACC GAACTCTTAC AAACAAAGAT TCATTGAACC TTTCAAAGGA CCTCGGTGCC GACATGGAAG AGGCTATGGA AACAGGCGAC ATTTTCCTCG GCACTGTCGT CGCTGTCTTG ACAATTGTCG GGATCAGTGT AACCATTAAC CTGTTGAAGT TCTTGTGGAG ATCGTTCGCC GCTCAGGTAG GTTCAACGAT CTATCCCTGA ATAGTCCGGT ACCGACGGCA CTTTCTCACA TCTTCTATTC CAATGAATTT TGTACAGCCC ACAACGGCCA AGATCCGTCA GGTCTTCCCG GGTGCCCAGA CGAACGATCA GCTCGTCGAA ACGATCAGAT CCAGTCTCGA AAAGTTCGGA TTCGGAGAAA ACTCCCTCAT TGCTACCTCC TTGTGTTGCG ACGAAGTCAA CCGTCCCCTC GATAAGGCTC TGTCGGAGAC CTACGGTAGC TACTTCTCCA TGGGAGGTCT CGCTGGCTTC CCCTTTGGAG GTCTGACCTC CTTCGGAGCC ATGGCTGCGC ACATCCCCGA CGGGGGCTCT TGCGTTGTGG TGTACGGGCC TCACGTTGGT GTGGACTCCA AGGGTAACGT GGGTACCGTT GAGCGCCGCG GACGTCAGAA GGGCGGATCT TGCTGTGGAT CCGGTGTTGC CGCGGCTGGC TTTGTGAAGT CTTGCCTTGC GGGTGACGCC AAGCCCCCCG GCGCCCCCTC GGACCCCCTG GACGCGCAGC AGACGTTCGT GAACTCTATG CTCCTTCCCC ATGGAGCCCG TCTGAACTCC GCAGAAGAGC CCATGGTCGA GCTTCCGTAC GCTTTGTTTG ACGCCCAGGA CGAGTTCATG CGCAAGATCA TCGAGAAAGG ATCCGGTAAC GTGGCAGGAA ACGGTCGCAT TGCTCTGTTG GGAGGAATCC AGATCAACAC CCCCGCCGAC CAGCCCGACT ACTTTTTACC ACTGCGCTTT GACGTCCTGT CGAACAAGGG CGAGACTATT GAGAAGATTA TTGATTCTCC CTCGCGCGTT ACCGCTACAA AGATCTCCAG TGTGTTCCCC AACGCGGTAC CGAACGAAAA GCTCCTCGCC AAGATCAACA GCACACTGGG CTGCTATGGG TACGGCAAGA ACTCTCTGGT TGCTACCTCG CTGTGCTGTG ACGAAGTCAA CCGTCCTTTG GAAGATGACC TCAAGGCCGC ATTCGGCGAA AACTTCAACA TGGGCGGACT CGCCGGCTTT GCGTTTGGAG GTGTCACCAG TTTCGGTGCC ATGGCAGCGC ATATTCCGGA CAGTGGCTCG TGTTTGGTGG TATACGGGCC GCACGTAGGT GTCGACTCGA ACGGCAAGGT GGGAACGGTC GAACGACGTG GACGGGCGAA GGGCGGGTCT TGCTGTGGAT CTGGTGTCGC CGCGTCAATG TACGTCAGAT CGGTGCGTAA TGGCGGGGAA GAAGCTGCTC CGCCTACGGA TCCACTCGAC GCGCAACAAA GCTATGTTGG CACTATGCTA CTCCCGTATG GTGAACGCTT GGAAAATGCG GAAGACCCTA TGGTGGAACT TCCATATGCT CTTTTTGACG CACAGGACGA GCTAATGCAG AAGATTGTTG CCAAAGGCTG CTCGAACGTT GCTGGCAACG GCAAGATTGC TCTTTTGGGA GGAATTCAAA TCAATACGCC TAAAGGCATG GCAGATTACT TTTTGCCCCT TCGTTTCGAT ATTCGCGACA ACCGCGACGT TACCATTGAA GATTTCCTGG TAGAGACTGG TACCTAGACC TCACATTTTC CTGGCTATGA CGCAGGCCAA TCGCTATGCA TGGAGATGCC ATTCTCCTCC ATCTTACCGG CATCGCGCTC TCCTTTGACG AAATGTTTTT TACCTCTTCA CGAGACCTTA CGAGGGTAAC CGTACTCTAT ACGCATTGTG GCAATATAAC TTTAACTAAG ACGCTTGCTG TTAGTGC
|
Protein sequence | MQQRLQAVLF FWASKRDSLN LSKDLGADME EAMETGDIFL GTVVAVLTIV GISVTINLLK FLWRSFAAQP TTAKIRQVFP GAQTNDQLVE TIRSSLEKFG FGENSLIATS LCCDEVNRPL DKALSETYGS YFSMGGLAGF PFGGLTSFGA MAAHIPDGGS CVVVYGPHVG VDSKGNVGTV ERRGRQKGGS CCGSGVAAAG FVKSCLAGDA KPPGAPSDPL DAQQTFVNSM LLPHGARLNS AEEPMVELPY ALFDAQDEFM RKIIEKGSGN VAGNGRIALL GGIQINTPAD QPDYFLPLRF DVLSNKGETI EKIIDSPSRV TATKISSVFP NAVPNEKLLA KINSTLGCYG YGKNSLVATS LCCDEVNRPL EDDLKAAFGE NFNMGGLAGF AFGGVTSFGA MAAHIPDSGS CLVVYGPHVG VDSNGKVGTV ERRGRAKGGS CCGSGVAASM YVRSVRNGGE EAAPPTDPLD AQQSYVGTML LPYGERLENA EDPMVELPYA LFDAQDELMQ KIVAKGCSNV AGNGKIALLG GIQINTPKGM ADYFLPLRFD IRDNRDVTIE DFLVETGT
|
| |