Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45677 |
Symbol | |
ID | 7200424 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 903090 |
End bp | 905307 |
Gene Length | 2218 bp |
Protein Length | 482 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179937 |
Protein GI | 219118320 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCGCATATT GTTCCCCTTT CCTTGCATTC TTTGCCAAGA AATAAACGTA AACGAGTTCC CTCAGATTCC AGATTGACTT TAGGCATGTA TATATAAGAG GCTAGCAAGT ATGAAGACGC AGGACATCAG CCATAGATCA AGAAAACCGC CCGCCGGCCT AGATCCAATA GACAAAGCCG CTCGACCACA GGCTACGACG GCACCACTGG CAGGTCGAAA ACATCGAAAG GGCAACCGCT ATCGGCACCA TTGCGGCGCA TCGAGATGGA TACTCTTAAT GACCGGAATG AGTGTTTGTT ATACTATTCT TGCAGCGGTG GGGTCTCCAG TCAGGCTTTC TTCGTTTTTG CTCAGTATGC TGCCTCGGCT CGTCAGTCCT GAACGCAAAG TGCCTCGAGT AGTGCGCTTG TCAAACAACG ATAACGACAG CGACCACCGC ATTTTTCTAA AGAAATTCAA CCCCAAAGAA AAAATTGCAA ATTCTGTACG ATACATAAAG AGGCCCATGG CAAATAGCGA TGACTTGTTA CGGCTCAAAG ACTCGGATGA GTTTGAGCAC GGGATGGCCG ACCGTTTTGA GACATCAAAG TGCAAGGCTC AATACGACTG GCAAAAAACT TCTTTTGTGA ACTGCAACAC AGTTCACGAA ATTGATATGG TAAAGCCAAT TTCGCTGGAT ATGAAGATTG GCGAAGCTCG CCATGTGGGT AATGGCTTCT GGCGCGACGT ATGGTCTATT CCAGAGGAGA TTACCAATCA GTATCGCGTT CTCAAAACAA TTCGCCTAGA GCATGAGATG ACTCCCCGAA ATTTTGAGCG GCACCGTCGA GATGCCATGG CTATGGAACG TCTATCATCG TCTCCATTAG TGGTGGATAT CTACGCCTTC TGCGGAAATT CAGGAGTTTT TGAGTTTGCA GACGGCGGCG ATCTTTCGGA TACTCTATGG CCAAAGCAAC AAGCGAAGAG GGTAACCAAT AGTCTCAGCA AAAAGAAGAA GCTACGAATT GGTAAGCATT GCTCTACTCT TGAAGCAAAC CTCTTGGGTC CTGCGCTTAT TGTGTTGCTC TTTGCACTTC AGCTACACAA ATTACGTCAG CTCTTGCTGC TGTGCACAAT TTCGATAGGG AAGGAAGAGC ATCAGTGGCA CACGCAGACA TCTCACTGAA TCAATTTATA AAAATCAATG GAAGATTTAA GCTCAATGAT TTTAATCGAG CGCGATTCCT CAGATGGAAT TTAAAACATG ACCGAATGTG CACTTTTCAC GTTGGGAATA ACCCTGGCAA GAACCGATCG CCCGAAGAGT ACAGATACGA AGGACAAACC GAAAAGGTAT GCATGGTATA AAAAATGGAT GCTATGGATT TCTCAAACCC TTTCCCATCT AACTTTGCCG ATCAGATTGA TGTTTATTCA CTCGGCAACT TGCTGTACAG TTTGCTACAG GACGAAATGC CCTTTCATGG AATTGATGAT GACGAAACAC AGGAACTAGT AAAGAAAGGG GAGAGGCCCA GCGTCTATGC TGATTTGTGG AATAGTACTG ATCCCGTCGA TCAAGTTCTA AAACAGACCA TGGTAATGTG CCATGAAGAA GAACAAGACA AACGAGCCAG CGCTAGGGAG GTTGAGGCTC TTTTGAAGGA TGCACTTCGG ATGGTGGATC CACAGTGGAA GGTCACAGAT CCTATCTAAT TGTATCCTTT CGTTATACTG ACATGAAAGT TTTTCAATGC CACAAACTCA AGCAGCTTGC TTTCAGTGAG TCCCGGAGCA CAATCCTTTC AATTGTCACA TACCTCTCGG GTCAAGTGGA CTGCAAATCA GCTTTTGACT CACTTCCTTA TCATGGAGCT TCCTCTCAGC GTTTCCAATT CCTTACGGAC AGCAGCTAGG CGAGTACATA TTTCTTGGGA AATGATGAAA TAGAAAACGT CTATTGGATG CGGGAGGACA AGGCTCTTGC CAGTTGGATA TGGTCGTTGC TGCACTATAA AGATGACGAC GAGGTAATCG GAAACACTTA AAGCATTGCA CTGATTTGAC AGTGAATACA TCTTTTTATG TCAATTCTGC TCACTCATCG CTTTCAAGAT ATTGGGGACC TTAGTTTTTT CTCTCTCGAT AGCTAACTGT AAGTTCCTTC GTCCCACAGT TTTGGGCCGT TTTACGGATC GAACCTGAGG AACAAATTTA ACCGTTTCAA TTAACAGT
|
Protein sequence | MKTQDISHRS RKPPAGLDPI DKAARPQATT APLAGRKHRK GNRYRHHCGA SRWILLMTGM SVCYTILAAV GSPVRLSSFL LSMLPRLVSP ERKVPRVVRL SNNDNDSDHR IFLKKFNPKE KIANSVRYIK RPMANSDDLL RLKDSDEFEH GMADRFETSK CKAQYDWQKT SFVNCNTVHE IDMVKPISLD MKIGEARHVG NGFWRDVWSI PEEITNQYRV LKTIRLEHEM TPRNFERHRR DAMAMERLSS SPLVVDIYAF CGNSGVFEFA DGGDLSDTLW PKQQAKRVTN SLSKKKKLRI ATQITSALAA VHNFDREGRA SVAHADISLN QFIKINGRFK LNDFNRARFL RWNLKHDRMC TFHVGNNPGK NRSPEEYRYE GQTEKIDVYS LGNLLYSLLQ DEMPFHGIDD DETQELVKKG ERPSVYADLW NSTDPVDQVL KQTMVMCHEE EQDKRASARE VEALLKDALR MVDPQWKVTD PI
|
| |