Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37038 |
Symbol | |
ID | 7202073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 20667 |
End bp | 21773 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181118 |
Protein GI | 219121531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.313227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCCC TTGCTGTTTC TGCTGCCCGA TCCATTCGAA AAGAAGTCGT AAATCCACGT TGGTGGAGTG GGGCCTACGA TGTGATTGAC ACTAGTCGTT TCATTATTGA TGGTAAAGGG AAAGTCTGCA AGAGTGATCT ACTGCCTACT TCCTGGCTTG GACAAGAGAT TCAGGAGAAA TATGACGACG TGGGTGTGCT TCATCTTCAG AATACTGGGC TTGTGGACAT GGCCGATCAG CGGACACTAG CTCGAATCAT CATGGGGGAA GAAACCGAGT ATGAAGGTGG CGCAAACCCT CGAGGAAGAG CCGAAGGCCT AGCGAACGTC TACGATATTG GTGCCCCTCT AATGGCAGAT CTTCACTACC ATCATGAAAT GACGTACAAG TCCCATTCGG TTACAAGCTT AGGCTTCTTA TGCAAGCACG CCGTAACGAC CAGACCAGGC GTTGGTTGGA GTTTTGTTTC GGATAGTGTC CAGGCTCATG ACTATATCAT GCAAACAGAG CTGGGCCAGA AGCTGAAGGA GAAAGGCCTC TGTTTCTTGA GGCGCATGAC AGATGCGGAG GACAAGCACA TGTTGGACCG AAACAAGCAA GGCTCAGTTT ATAACCACTG GCAACAGTCT TGGATGACCA GTTGTCCTCA AGAAGCTGAG GCTCGAGCTA ACGCACAGGG TTTGCAAGTG GAGTGGCTTG ACGATAAAGA AGATGGTCGA ATTATGCAGA CCCGATACTA CAAGTCTGCT TTTGAGTATA TTTCGTTCCT CGACCGCAAT ATCATGGTCA CTTCTATTGC TGATGATGGA GAATGGTTCG ATTCTTGGCC CGGAATTATG GATATTCCCC AGGAAAAGCG TCCTCTGGAG ATGCTTTTTG GGGACAACGA ACCATTTACT TTGGAAGAGA AACAGCTCTG GACAGATATC TATGGCATGT TTGGTATTCC AATCACCTGG AAACCAGGAG ATGTCGCTGT GGTTTGCAAC ATGCGTTTTG CCCATGGTCG TCCAGGCATA GAATTGCTCC CTGGCGAGAA GCGTGAGCTT GGAGTCATGC TCGGACCATT TTACGAGCGT ATGGAGACTA GAGAGGACAA GTGGTAA
|
Protein sequence | MAALAVSAAR SIRKEVVNPR WWSGAYDVID TSRFIIDGKG KVCKSDLLPT SWLGQEIQEK YDDVGVLHLQ NTGLVDMADQ RTLARIIMGE ETEYEGGANP RGRAEGLANV YDIGAPLMAD LHYHHEMTYK SHSVTSLGFL CKHAVTTRPG VGWSFVSDSV QAHDYIMQTE LGQKLKEKGL CFLRRMTDAE DKHMLDRNKQ GSVYNHWQQS WMTSCPQEAE ARANAQGLQV EWLDDKEDGR IMQTRYYKSA FEYISFLDRN IMVTSIADDG EWFDSWPGIM DIPQEKRPLE MLFGDNEPFT LEEKQLWTDI YGMFGIPITW KPGDVAVVCN MRFAHGRPGI ELLPGEKREL GVMLGPFYER METREDKW
|
| |