Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45672 |
Symbol | |
ID | 7200422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 893583 |
End bp | 895500 |
Gene Length | 1918 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179740 |
Protein GI | 219117909 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTTCACGA CTGACGATGA CCTGCGTCTA TTTCGGCGCG TCCGCATCCT CCTGCGGGAT GTTGCCGCTC CTCGCAATCC TGATATGCAG TGATAGAGTA ATTAACTGTA AGGCGCCGCT GCAACTGGAT TTTTTCTTTC ATTTGAGCGA AGGCAGTTCC TAAAGCTCGA CGATCCACAA AAAGTCATCT GAAAAACGCT TACAAGCTGT TTCACCTCTT CCGCTACGAC TAGCAATAAT CAGTATGCTC ACAGCGGACA AGCGGCCGAC GCAGCGCTAC GAACAAGTCG TCACTGAGAT CGATCACTGT TCAGACGCTG AGGAACAGCC GGTAGACCTT GACACAAATC ACAACATTAC GCATCTACCT TCTCCATTGC CTCTGGTAGA AAATGAAGAT CTGTTTTCAA CCGCCCCAAA TTTAAAGAGC CACAACGATA ACGACTTGGA ATACATTTCC ACTGATGTTG CTTTGGAGAG AATCGGTATG GGAGCATTTC AAATACGTAT TTTGATTGCT TCTGGACTTT GTTTTGCGGC TGATGCCATG CAGGTGATTT TACTTAGCTT CCTCTCGCTT GTCGTCCAAG ACCTTTGGTC GCTCAGCAAC GACTTGACAG CGATGATAAC CAGTTCATTG TTCGCCGGTA GCATGTTAGG TACACTGATT CTTGGTCCAT TGGCCGATTC CTGGGGACGT CGACCCGTGT TTATGCTGGC ATCTTCCATA ATTTCCTTCT TTGGTTTCGC CACCTCAGCA GCGACAAACT ATGGTATGTT GACTGCCACG ATTTTTGGGG TCGGGGTTGG TGTTGGCGGT TTAACCGTGC CGTTTGATAT TCTTGCAGAA TTTTTGCCCT CGTCGCAGCG AGGTACAAAT TTGCTTAAGA TTGAGTATTT TTGGACCATC GGCTGTCTTT TTGTAGTTGG TATTGCCTAC ATGACACTTC GTGGTGAAGT ACCCCATTGG AGATTATTTG TGGCGACTTG TTCGGTTCCT TGTCTTGTAT CCCTCGTTCT TGGATACTGC TGGGTACCCG AGTCAGCGCA ATGGTTATGT GCGGAAGGTC GTACAGATGA AGCCTTGGAG ATTTTACGAC ACGCTGCCGC TCTCAATGGC CTAGAAAAGG ATGAGGTGTT TCCCAGCAAC ACGAAGTTGT TGCAGGACGA AGACGAAAAA GACGCTTCTT TGGCCGATCT CTTTACTCCG AAGTGGAGGG AAACAACTTT GCGATTGTGG GGTGCCTGGG GTTCGTTCGC ATTTGGTTAC TACGGTACGC TCCTTGCCAT TACCAAGGTA TTTGCGGAAG CTGAAACTAT AAATCGAGTA GCGGTAGGTG ACGAAGAGCC TTATAGCTTC GATTACGGCG CCATTTTTGC TAGCAGTACA GCTGAGCTGG TTGGAACCAC AATGGTCATT TTTGCGGTCG ACAGAATTGG CCGTATCCCC TCCCAAGTGT TCAGTTATCT TATTGCGGGC CTCTCCGTTT GTGCGCTCTG TGTCTTCGCC TCGTGGGGAT TTCCTCGGTA CGCTTTGATC GGTCTGAGTT TTATAGCCCG AATCTTTGAA ATGGCCGCTA CGTGCGTTAC ATGGGTGAGT ACGGCGGAAA TTCTGACAAC CGAAGTTCGT TCCACTGGAC ATTCAACGGC CAACGCCATG GCACGCTTAG GAGCCATCTT TTGTCCCTAT CTGGTTCAAG GAAGTGCTTC ACTTACGCAA ATTGGAATCG TTATGCTGTT GGTACACTTC TTTACGGCCT TTTGTGTTTC AACATTACCG GAAACTAAAG GAAAAGGCAT GGGCGCTGTA TCGGAAGACC AGCCAGTGGA TTCGGCCATT AATAGGCTCA ACCTTTTGCC TCTAGAGACG AACGACTGCA ATGAAGACGA CCTTGTCATT CAAGGCGAAT TGAGTTAA
|
Protein sequence | MLTADKRPTQ RYEQVVTEID HCSDAEEQPV DLDTNHNITH LPSPLPLVEN EDLFSTAPNL KSHNDNDLEY ISTDVALERI GMGAFQIRIL IASGLCFAAD AMQVILLSFL SLVVQDLWSL SNDLTAMITS SLFAGSMLGT LILGPLADSW GRRPVFMLAS SIISFFGFAT SAATNYGMLT ATIFGVGVGV GGLTVPFDIL AEFLPSSQRG TNLLKIEYFW TIGCLFVVGI AYMTLRGEVP HWRLFVATCS VPCLVSLVLG YCWVPESAQW LCAEGRTDEA LEILRHAAAL NGLEKDEVFP SNTKLLQDED EKDASLADLF TPKWRETTLR LWGAWGSFAF GYYGTLLAIT KVFAEAETIN RVAVGDEEPY SFDYGAIFAS STAELVGTTM VIFAVDRIGR IPSQVFSYLI AGLSVCALCV FASWGFPRYA LIGLSFIARI FEMAATCVTW VSTAEILTTE VRSTGHSTAN AMARLGAIFC PYLVQGSASL TQIGIVMLLV HFFTAFCVST LPETKGKGMG AVSEDQPVDS AINRLNLLPL ETNDCNEDDL VIQGELS
|
| |