Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13093 |
Symbol | |
ID | 7201582 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 520594 |
End bp | 522381 |
Gene Length | 1788 bp |
Protein Length | 495 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180847 |
Protein GI | 219120207 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCGAGTCGC AACAAGAAAA GGCGGTTGAG AAATTTCAAC GCGAGCTGCA AGATAACGTG TACAAAAACG GTGGCCAAGT CCGGGACTAC CAGGCAGAAG GCATATCATG GATGCTGTCG AATTATGTTA ATCAGCGATC TTCAATTCTG GCTGATGAGG TATGTAGATT TTTTTTTAAC AAGTACACTT CAATTGCTGC GCATTCGCTT ACTGCCTCTT TTCTCTCTTC AAAAGATGGG TCTCGGGAAG ACCCTCCAAA CTGTCGGGAC TGTCAACATT ATGGCGACAC GCCTTAACGG AAACGGTGTC TTTCTGGTGA TCGCTCCGCT GTCGACTCTT TCACATTGGG AGCGGGAATT CAAGAGGTGG ACCGATCTAA ACACGATTGT GTATCATGGA TCAGGTGACG ACCGGCGGCT AATAAGGGAG CTTGAATTCG CATACGAAGA TGACCGACCA AAGAATACGG TTGGTTTCAA TCAGCTGTAT TTGAAAAAGT GCAAGCCAAG AAAGCCAGGC AGTGGTGAAT CTCCTTGGAT GGTTCAAGTC GTTATAACGA CACCCGAAAT GATTGTGGCG GACGACTTCG TTGAACTGAC GGCTGTCGAC TGGGATGCCG TGATTGTCGA TGAAGCGCAC CGGTTAAAGA ACCACAACTC GAAGCTTGCT ATCAATCTTC GAGACAATCG CTTCAAGTTT GACCATATTA TTCTGCTTAC CGGCACACCG ATTCAAAATG ATGTCCAAGA ATTTTGGACG CTACTGAATT TTATCGATCC TAATGGATTT GACGACGTTG ACAAATTTAT GACGAAGTAC GGCGACATGA AAAGCAAAGA GCGCGTTGAT GAGTTGCACG AAGAAATAAG GCCATTCATT TTGCGGAGAC TGAAAGAAGA CGTCGAGAAG AGTGTGCCAC CCAAAGAAGA GACTTTGATT GAAGTTGAGC TAACATTGTC CCAGAAGCAA TACTACCGAG CATTGTATGA AAAGAATGTA AAATTTCTCC ACAAAAACAA CAAAAAGGCT CTTGACGGTC CCAGTCTCAA CAATCTTGCG ATGCAGCTAC GAAAGTGTTG CAATCATGTC TTTCTTCTCA AGGGTGTTGA AGAAGAGTTT AGGAATAAAG GAAGTCTTAC TCTTTCGGAA GCTGATTTTC TCGTTCAAGG ATCGGGAAAG CTGATCCTGC TAGACAAGCT ACTCCCTCGT CTCAAAAGCG AGGGACACCG AGTACTCGTG TTTTCACAGT TCAAGATTAT GCTCGATATT CTTGAGGATT ATTTTTCAAT GCGGGAAATG AAGTTTGAGC GCATTGACGG TTCCATCACT GGTAAACGGC GACAGCAAGC AATCGATAGA TTCCAAGCCC CAGAGATTGA TGGGAGAAAA CCACCGTTTA TTATGATGCT TTCTACAAGA GCCGGTGGTA AGTGTGTTTC AATTGTGTAG CGATGAGTAG CTCAAGAATC TGCTCACGCT CTACCTTTTA CTTCTAACCT AGGGGTCGGA ATCAACCTGA CTGCTGCCGA CACTTGGTAG GTTTTCCACC ACGATTATCC TCGATTGAAG ACGCAGCATT CTTATTCCTA ACATTTTGGT TTGGTCCTTT CAGCATTATT TTTGATTCAG ACTGGAACCC GCAGAATGAT TTGCAAGCCC AAGCGCGATG TCACCGAATT GGGCAGACGA AGGAAGTCAA AGTATATCGC CTCTTGACAA GAAAGACGTA CGAAATGCAA ATGTTTCATA TGAGCTCGAT GAAAATGGGA CTTGATCAGG CTGTCCTT
|
Protein sequence | SESQQEKAVE KFQRELQDNV YKNGGQVRDY QAEGISWMLS NYVNQRSSIL ADEMGLGKTL QTVGTVNIMA TRLNGNGVFL VIAPLSTLSH WEREFKRWTD LNTIVYHGSG DDRRLIRELE FAYEDDRPKN TVVVITTPEM IVADDFVELT AVDWDAVIVD EAHRLKNHNS KLAINLRDNR FKFDHIILLT GTPIQNDVQE FWTLLNFIDP NGFDDVDKFM TKYGDMKSKE RVDELHEEIR PFILRRLKED VEKSVPPKEE TLIEVELTLS QKQYYRALYE KNVKFLHKNN KKALDGPSLN NLAMQLRKCC NHVFLLKGVE EEFRNKGSLT LSEADFLVQG SGKLILLDKL LPRLKSEGHR VLVFSQFKIM LDILEDYFSM REMKFERIDG SITGKRRQQA IDRFQAPEID GRKPPFIMML STRAGGVGIN LTAADTCIIF DSDWNPQNDL QAQARCHRIG QTKEVKVYRL LTRKTYEMQM FHMSSMKMGL DQAVL
|
| |