Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49637 |
Symbol | |
ID | 7198293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 271719 |
End bp | 273360 |
Gene Length | 1642 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184334 |
Protein GI | 219128258 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCCTTC TTCTCTATCA AGTGGAGACC TGTGTATGTA AATGTTCAGT CTTTCCTTTA GATAAAAGAT CGAAAGGAAC GAGCGCGAAT GAGGAGCACT CACTCATTGG TGTCGAGGTG CAAACGCACG CTTTCTTTAA GTCGAGAATG CCAGCTCTTG CTGCTAGCTG TTGCAGTAGC TATGCCGCCT TCCTTTTTCC ATGACCGCAA TGTCTTCGTG GTTCACGCAG TCAACGTGAA GCCAACTTGC ATTCCAAATT CTTTGCGGGT TCCGGAAAAA GTCACGCACC GATCCCACAA GCTGATTGTG GCTCACCGTG GAGCCTCTTA TCATCTTCCG GAGCATTCTA TAGCAGCTTA TCGTCTGGCG TTGGAACTCG GGGCCGATTG GATCGAACCT GATATTATCG CAACCCGGGA TCGACAATTG ATCGCCATGC ATACGGTCGA TTTGGGTGTC ACGACGAACG TTGCAGACAT TTTCCCCGAA GATCGCCGAT GGTTCTCGCC GTGGGCCAAT GCATCGTCCT ACTGGGCCTT CAACTTTACC TACGATGAAA TATCGCGTTT GAGACTACGG CAGCGCTTAC CACAAGCGCG GACGACCGCA TTGGATAACA TGCTTGTCGT TCCGCATCTG AACGATGTCC TAGACATGTT GGTACAGTGG AACGAGGTCG ATCTAGCCCA AATCCTGAGA CAAAGCACTA CAACAAACAC GAGTAGCATT AGCGAGTACA ACAGCAGCAA AACGAGCAAA TATCCGACTG CTTTGAATCT GAAACAATCA GGTTTGTACA TTGAACTTAA AGATGCCGCG TGGATTCAAG CTGAGGCCGA CTTGGATTTG GTAGACTTGC TCTATGAGCA CTTTCGTGAG GAGCAGGATC GGTGGGACAA CATTTTGAGG TGCTGGAACG GCATTCGTTT TGACCAATAC ATTGTTCCCG GCTTAGTTAT TCAGTCCTTC GATGGTGACG TCCTTCGAGC CTTTCACAAT CGCTGGTCGT CTGTATTCAA TAATACAGCC GGTACCTCCA AGGCGGAACC AAACTATGTC TTACTGGCAT CGGAAGGTCA GTGCGCCGAC GAAATGTTCT GGTTGAACGT GGGAGATCAG TATCGCAGCT TTATCCACGG GATTGGTTTG GAAAAATCCT GCTTGTTAGA CTCAAGGTTC TTCGCTGAAG AGCTTTCTCC AGTGGTGCGT GCAGAGGAAT TTAACCTTGC CTTGCATCCG TGGACATCTC GGCCAGAAAT TTCAGAAGTG AACATACAAT TCAACTCCGC TTTTGAGGAA ACACAGTACC TTTTCTGTAA GGAAGGCGTC CATGGGATCT TTTCTGAATC CGTGGCCTCC GCGGTACTGG CTGCCCGGTT GGGCTGTAAC AACAATGATG GCGCGTGGCA GCCACCACCG GTGCCTTCAC CAACAGCGAA TCCCAAGTCC GACAATTTGT GCTACAATGA CCCCAGCGAC TCCATTTTTT ACGTTGGGGT CGCATGCTTT GTAGTGGGTG GAATTGTGGC AGCGCTTTTG TTTTTCGCAG CATTGCGTAG CTTTCCGAGC CGTCGTGCTG GGAGTGGCAG GCGTGCAATT CCGACTACGG AAGACACGCT TTACGATCTG GAATTGACAT GA
|
Protein sequence | MRSTHSLVSR CKRTLSLSRE CQLLLLAVAV AMPPSFFHDR NVFVVHAVNV KPTCIPNSLR VPEKVTHRSH KLIVAHRGAS YHLPEHSIAA YRLALELGAD WIEPDIIATR DRQLIAMHTV DLGVTTNVAD IFPEDRRWFS PWANASSYWA FNFTYDEISR LRLRQRLPQA RTTALDNMLV VPHLNDVLDM LVQWNEVDLA QILRQSTTTN TSSISEYNSS KTSKYPTALN LKQSGLYIEL KDAAWIQAEA DLDLVDLLYE HFREEQDRWD NILRCWNGIR FDQYIVPGLV IQSFDGDVLR AFHNRWSSVF NNTAGTSKAE PNYVLLASEG QCADEMFWLN VGDQYRSFIH GIGLEKSCLL DSRFFAEELS PVVRAEEFNL ALHPWTSRPE ISEVNIQFNS AFEETQYLFC KEGVHGIFSE SVASAVLAAR LGCNNNDGAW QPPPVPSPTA NPKSDNLCYN DPSDSIFYVG VACFVVGGIV AALLFFAALR SFPSRRAGSG RRAIPTTEDT LYDLELT
|
| |