Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47857 |
Symbol | |
ID | 7202987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 257480 |
End bp | 258688 |
Gene Length | 1209 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182357 |
Protein GI | 219124116 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTTGA TCATGAACGC GCAGAATCCC AGCAAGAACA AACCGACGAG GAAAGCTGTG CAGCCATCGC GAGGTAGTTA CAGTAAGCTG GCTTATTGTT TTTTGGGCGC GTGCATGGTC GGGGCTATGG TCCTAAATAT GTCCCACCTG CGGAACGTGC AGGAATCCTC GCATCCTCAA TCCGCACTAC CACGACGAAA CGCTAAAGTG GTTCTGAGGA GAGAGTCCGT AAAGAAAACT GTCGCTTCCT TGAAAAAGGT TGTGGCTCCT CAGAGTCCGG CCCTGTCCGA TAGGAAACCG TCGCAATCGT TTATTCCCAT TGCACCTCGA CCAACGGATC CTGCCAAAAA TCTCCTCCAG GCTGGGGACT ACATTTACTA CCAGGACCCC GCTATACCTC GTTGGGATGC GGCACCGATT GTGGTTGAAA GCCATAAGTT GCTTTTTTTC ACCACGCCCA AAGTCGGATG CACGGTTTGG AAACAGCTTT TCCGTCGTAT GATGGGCGCA AAAGACTGGA AAAGTCAGGA CGCACAATCG CTCCTTCCAC ATAATCCGGA AGTCAACGGT CTCAAATACC TTTACGACTA CCCCTTGGAA GAAGCCGACC GCATGATGAC CTCGCCTAAA TGGACTCGGG CAGTCATGGT TCGTGATCCC AAGCAGCGAT TTCTGTCGGC CTTCTTGGAC AAGGCCGTCA GCAACCGCCA CCAACACATT CAGCACCGCT GTTGCCCGGA CCAGGCGTGC ATAGCAGACG CCCAGACATT AGCAGGGTTT CTCCGGCTGT GTGAGCGTTG CGACGATGAG CATTGGCGAG CGCAGAATGC CCGCCTTGAT TCCAAATTTT GGCCATATAT GGACTTTGTT GGCCACGTAG AAAACTCGGC GGCCGACGCG CAGGCATTGC TGACTCGCGT TGGCGCTTGG GACGAATTTG GCGCCTCGGG GTGGGGAACC GACGGCACCA GTGCAATTTT TCAGTCCAAA GGATCCGGCG GTGCGGGTAC ACACGCAACC TGGTCTCAAT GGAAGGTGTG GCAATGGTAT ACCCCGGAAA TCGAGCAACA AGTGGAGGAT TTCTTCCGTG CCGATTTCGA AAATCCTTTG TTCAATTTTA CTCGAGGTGA ATGTTTGACC TGTCTCTCCG ATGAAGATAA AGCCAAACTT GCGGCTGAAC AAAAGAAGTA GAAAACGGTT TAGGGCGTA
|
Protein sequence | MVLIMNAQNP SKNKPTRKAV QPSRGSYSKL AYCFLGACMV GAMVLNMSHL RNVQESSHPQ SALPRRNAKV VLRRESVKKT VASLKKVVAP QSPALSDRKP SQSFIPIAPR PTDPAKNLLQ AGDYIYYQDP AIPRWDAAPI VVESHKLLFF TTPKVGCTVW KQLFRRMMGA KDWKSQDAQS LLPHNPEVNG LKYLYDYPLE EADRMMTSPK WTRAVMVRDP KQRFLSAFLD KAVSNRHQHI QHRCCPDQAC IADAQTLAGF LRLCERCDDE HWRAQNARLD SKFWPYMDFV GHVENSAADA QALLTRVGAW DEFGASGWGT DGTSAIFQSK GSGGAGTHAT WSQWKVWQWY TPEIEQQVED FFRADFENPL FNFTRGECLT CLSDEDKAKL AAEQKK
|
| |