Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50149 |
Symbol | |
ID | 7198850 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 196188 |
End bp | 197874 |
Gene Length | 1687 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185065 |
Protein GI | 219129793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACTA CAACAAAAAA GCAACTCGAA ATCGATGAGG TCCAGGAACT AACAGAGGAA GAAATTAAAG CTCCAGGTTG TGGAATTCTG GGGCGTTACC CAGTTCTTTC CGTCCTTATT TTTGCATCGG CTGGCATTGG AATTGGTCTT GGCCTCAGTT TTTGGGAACC AGATGACGAT GACGATACAA AGGATAAAGT CATCAAGTGG CTCGGTCTCG TTGGCGACTT ATTTATTCGC TCACTAAAAT GCGTCGTTTT ACCATTGGTA TTTATCAACG TGATTATATC TGTGGTGGAT ATGATGAATG TGGGACGAGC TGGATCTATT GGTTGGAAAA CAAGTAAGTA CTGAGTCCAG ATATCATAAG AGTAGAATAA TATTACGCTT TCGCTTGCTG ACTCAGATGT TGATTTTGAC AGTTGTGCTT TATCTGCTGA CTACCGTCAT CGCGTCGATC CTTGGCATTA TCTCGATTGT CAGTTTCAAA GGCCTTTTCG AGGAAGGAGA ATTCGAGGAA GCCGTTCCTG CATCAGTCAA GCTTGGGTGT AACCAAGATG GAGAATTTCT CACTGAAAAT GCCAGCGGCG CTATTTCGTG TGCGGCAGAC TCGGGGGAAA GCTCTGAATT CTTTATCACA GATGTATCCA TGAGCTTTGT CCGTGCCTCC GGTAGTGTTC GTGATGACAT TTCTTTGAGC GATACGGTAT ATGATGGCGT TTTTACGAAA CTAGTCACAG CTAACATTTT TGAGTCGTTT GTTGAAGCCA ATTTTGCTGC TGTCGTCTTC TTCGCCATCG CTTTTGGGGT GGCAATCAGT CGCGTCTTTG ATCAGGGTGG TGGTCCCGAC AAGAGTTTCA TTCTACCGTT TCTCAAGGAA CTGGACGGCG TATTCCTTAC GATTATCAAC TGGATCATTA TGATTACTCC GTTTGCAGTG CTTTCTCTAA TTTCCTCGGC GATTGGAAAG CAGGAAAATC TTGCGGACTC CTTTTCCAAT GTGGGATATC TCGTGGTTGC CACAATGATT GCGATGTTCT TTCAATTTTT GGTCGTTCAC TGCCTTCTTT TCTTTATTGT GACGCGCACT AACCCCTTCG AGTACTTAAA GCATCTGATT CCGGCGCAAA CAATGGCATT TGCATGTGCC AGTAGCGCAG CGACAATTCC AATGACTCTC AAGTGTGTGC GCCAAACGGA GCGGGTACCC GAGCCCGTGG CTCGTTTCGT TATTCCTCTT GGGGCGACAG TCAACATGGA CGGTGGAGCA ATTTATTTCC CATGTGCGTG TATATGGCTT GCTGTGCTGA ACGGTATCCA ACCAGATGCT GCTTCCTACC TTCTATTGGT TATTATTTCA ACGATCGGCA GTGCAGGCAC AGCGCCAGTG CCTTCGGCCA GCCTCGTGCT TATTATCACG GCTTACAATA CTGTCTTTAA CACCACCGGA GTTCCTGAGG GGTTTTCTTT CATTTTGGCG ATCGACTGGT TCATGGATCG CCTACGCACT GTCGTGAATG TGACTGGCGA TGGCGTTGTG GCTGGAATGG TGTCACACCT TTGCCCGGTG GACGACGACA CTGGGAATGT GCTTTACGTG GACAAAACTG AACAACACGA AGCTGGAGCT GGCTCTTCTA CAGATAGTGA TATCAATCTA AATGCGGTGG AAGTCACGCG AAACTGA
|
Protein sequence | MTTTTKKQLE IDEVQELTEE EIKAPGCGIL GRYPVLSVLI FASAGIGIGL GLSFWEPDDD DDTKDKVIKW LGLVGDLFIR SLKCVVLPLV FINVIISVVD MMNVGRAGSI GWKTIVLYLL TTVIASILGI ISIVSFKGLF EEGEFEEAVP ASVKLGCNQD GEFLTENASG AISCAADSGE SSEFFITDVS MSFVRASGSV RDDISLSDTV YDGVFTKLVT ANIFESFVEA NFAAVVFFAI AFGVAISRVF DQGGGPDKSF ILPFLKELDG VFLTIINWII MITPFAVLSL ISSAIGKQEN LADSFSNVGY LVVATMIAMF FQFLVVHCLL FFIVTRTNPF EYLKHLIPAQ TMAFACASSA ATIPMTLKCV RQTERVPEPV ARFVIPLGAT VNMDGGAIYF PCACIWLAVL NGIQPDAASY LLLVIISTIG SAGTAPVPSA SLVLIITAYN TVFNTTGVPE GFSFILAIDW FMDRLRTVVN VTGDGVVAGM VSHLCPVDDD TGNVLYVDKT EQHEAGAGSS TDSDINLNAV EVTRN
|
| |