Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27877 |
Symbol | |
ID | 7201627 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 106019 |
End bp | 108553 |
Gene Length | 2535 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180943 |
Protein GI | 219120408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACCCGTAG AGTTGTTCTC TCCTCTGTGA CACACTCATC CTCTCAGCAT TCACCATGTC TTTTGATTTG GACGCATTCT GCACTGGCCT GACAGCCGCG TCGAGCTCTT CTGAGCAGGC TGTGTGCGCT CTGCAAACGG TAAGTCGCGG CAAAACTTCT GACATGCTCC TATCTTGGCA CTGCAATGAG AAAGATGGAC CTGGCTACTA TCCGATACCG TAATGCTACA TTCATGCAGT CTACTCTTTG CAGCAACCCT TCCAGTTATT GTCCTGAGCA TCCGCCTCGG ACTCCTAAGC GGAATAAGCG GGGTTGATCT ACACGTTGGT CGGTCTGTCT TTGACTGCGA TGCGTCGTCG ATAATTTTGA TCAAAGGCTT GTTGTTTAAA TTTGCTTGTG TGCCACCTAT ACTCACTTGG ATGCCTTTTT GTTTGCACTC TACTTCTGCA GATCGTGGCC GGAGTTTCTA AGACTGTCGG AGGTATTGAC GCGGAAGGGA TTACTGCAGG TGTTGATACT TTCTTCCTCA TTTTTGCGGT ACGTATTTTG AATTACCCAC TGGCATTTCG ACCCTGCAAA TGTATTTGTT CGTTCGATCT TTGAGGCCGC TTGTGAAGAT TTTCGTCTTC CATCATGATG AGAAAGAGTT TTTCTTTCAT TCGATGTGAG AAAAAGGATC CCACGTCGAC TATGTATTGG TACCTCGTGT ATAGTGAGAA GTTCGATCGT TCTTGACAAG TTCCGATCTA CTTTGCGGAA GTTTTGGGCT CATGTGACAT TGTTGTGTTC GACGGGCGTG GAAGGTATCG AGTTCGCGAT GTTACTTAAC AACTTGTGAC GCAACTCGTC TTTTTTGACA CAAAGTTCTT TAAGTCCGCA ATGTCTTGGC GTATGCCGTT ACGAACTTCC GTTTCTGACG CGATTGCCTT CTCGTCCAAT CACTACAGGG TGCCTTGGTC TTCATGATGC AGGCCGGGTT CGCCATGCTT TGTGCTGGAT CCGTCCGTCA AAAGAATGTA AAGAATATTA TGCTCAAGAA CTTGTTGGAT GCCTGTGGTG GTGCTATTGG CTTCTACACC GTTGGTTTCG GCTTCGCTTA TGGCGGTGAC GACACCACCG ACAAGACCTT CATTGGCAAC AGCTACTTCG CGCTCCGTGA TTACACAAAT TATGCAGGTT TCTTCTTCCA GTTTGCGTTT GCTGCCACTG CCGCCACGAT TGTTGCCGGT ACAGTTGCTG AGCGATGCAA GATGTCGGCA TACCTTTGCT ACTCTCTCTT TCTTACGGGT TTCGTCTATC CCGTCGTTGT ACGCTCTGTC TGGAGCTCCA ACGGGTTCTT GTCAGCCTTC AGTGCCGACC CCTTCCAAGG AGTTGGAACC GTTGACTTTG CCGGATCAGG TGTGGTGCAC ATGACTGGAG GACTCACCGC CTTGATTGCT GCCATTGTTC TTGGACCGCG TAAGGGTCGG TTCTACGATG AGGATGGCAA CCCTCTGGAG ACGCCCGCCA GCTTCCCAGC CCACTCTGTA GCCCTCCAGA TCCTCGGAAC TTTCATCTTG TGGTTCGGAT GGTACGGATT CAACCCTGGT TCAGCCCTGA AGATTGCTAA CGCCGATTCG GCCGCAACCG CCGCTTTGTG TGCCGTCACC ACCACTATGG CCGCCGCTGC CGGTTGTGTT TCCGCCATGT TCACTGACTC GATCATTGAC GGCATGGCGA CCGGTGAAAC TACGTACGAT CTGACCATGG CCATGAATGG ATGCCTTGCT GGTCTCGTTG CCGTCACTGC TGGTACATCT GTCGTCACCC CATGGGCCGC AATCATTATT GGAGTCGTTG GAGGTTGGGT CTACATTGGT ATGTCCAAGC TTTTGATCAA GCTCAAGATT GATGACGCTG TCGATGCCAT CCCTGTCCAT TTCGCCAATG GTTTCTGGGG TGTCCTAGCC ACCGGCCTTT TCGCCAACGG TGGATTGATG GCAACCGCTG GGTACAACTC GGAACACGAG GGCTGGTTCT ACGAATGGGG AAGTGGCTCC GGAGATGGAA GTCTTCTCAT CTGCCAGCTT GCTTGCCTCG CCTGGATTAT TGGATGGGTC ACCACCATTA TGACGCCCTT TTTTATCCTT TTGAACATGG CCGGTATGTT CCGTGTGGAC CCGCTTGAGG AAGAAGTTGG TCTTGATATT TCCCATCACC GTGGATCTGC TTACGATCTT TCGGGACCCA GCAAGGACCA TGTTGACGAG CTCATGGAAA TTCGTGCCTC GAAGCACGGC AAGGTTGAGG TTCCAAAGGA GGTTGCGCAG GCTGCTGATG ACGCCGCCGA AGAGACTGCT TAAACGGTTT AATTGCTGCT CGTGTGGGCT TTCAGCCCAT CTGATTCATG TTTATATCGT TTCTTTGGTG TAGCGGCTTT GAATTTTTGG TTTTGGGATA GACGATGGGA CAGTTTTAGT TTGTTTTTCC AGGTTTTTGA TCATATGTCT ATTTCTCCCC TAAACAGTAT TTAAAGTCCG GTTTGTCGAT ACCTT
|
Protein sequence | MSFDLDAFCT GLTAASSSSE QAVCALQTIV AGVSKTVGGI DAEGITAGVD TFFLIFAGAL VFMMQAGFAM LCAGSVRQKN VKNIMLKNLL DACGGAIGFY TVGFGFAYGG DDTTDKTFIG NSYFALRDYT NYAGFFFQFA FAATAATIVA GTVAERCKMS AYLCYSLFLT GFVYPVVVRS VWSSNGFLSA FSADPFQGVG TVDFAGSGVV HMTGGLTALI AAIVLGPRKG RFYDEDGNPL ETPASFPAHS VALQILGTFI LWFGWYGFNP GSALKIANAD SAATAALCAV TTTMAAAAGC VSAMFTDSII DGMATGETTY DLTMAMNGCL AGLVAVTAGT SVVTPWAAII IGVVGGWVYI GMSKLLIKLK IDDAVDAIPV HFANGFWGVL ATGLFANGGL MATAGYNSEH EGWFYEWGSG SGDGSLLICQ LACLAWIIGW VTTIMTPFFI LLNMAGMFRV DPLEEEVGLD ISHHRGSAYD LSGPSKDHVD ELMEIRASKH GKVEVPKEVA QAADDAAEET A
|
| |