Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50634 |
Symbol | |
ID | 7199465 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 30551 |
End bp | 32183 |
Gene Length | 1633 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185598 |
Protein GI | 219130916 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.324652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTTCC AAATCTACGA CGGGAATAAC TCTCAAAAGC GCAAAAACGC TGCCGTAATG AAGGATGGCT CTTCCTCCGT CTTTCAGATT CGCCTTTGGG CGGTGTTTCT TTTCACAGCT TTACTTGGAT CGGTATCGAT TCTACTAACG CTCTCCAGCG CTTTCACTAA TCAGATTAAT TTCTGCTCAG AGTCTGCTCG GCAATATGAA CCAGGGCCAG TTTCTTGGAG ACAAAGCGCC ATGGCATTTC AAGCAGTCAA AGCCCATCAA AAATTCACGG AAAAGAGTAT TCAGCGGGAA ATGCCAGTCA CCCTTGCGCA GGCCCATCCC ACTTCATCAA TGATACTTAT TCGCGCTCTG GGCAACGCCT TACCGCCGAG ACATAGTACG CAGCAAACTT TGGAAAATCT TGACTTCATT CTGGCGCACG AAGAGTCGTT TCCCAACACA ACACGACACT GGTTTGTCAA TCGCTTCGTC GATCCCGAGG TGGAACGACT AGTCCTGGAC CGACTTCAGA AGGCTAACGA ATCCTACACT GTCATTCCAT TTGACTTGCA GGTCTACGAC AAAATTGAAT ACGCATATGA TCGGATCCCC AAGGACCAGA TTCATCTCCC TCCTGAAACA ATAGAAGGGG AGGGACTCAC GGAGAAGGAA ATACTTCTTA CCGAAGAGCA GATACAGCAT GACAAGATTC TATATGTCAT CAATATTAAT GGGGTCAGGA ATGCCATGCT AGACTACGGC CGCAAACATT CCAGTGCTGA ATACATTCTT CCTTGGGACG GTAACTGTTT CATGACGCGA AAGGCGTGGT CTTCGATCCA GTCGTCTTTG GCCGAAAATC CGCTTGCCAG GTACTTCACA ACGCCCATGG ATCGACTCCA AGAGCCCAAC GAAGCTCTAC TATCGGATAT GTACGTGCAC AATGCGGTGG AAGAGCCGCA GATCATTTTT CATCGATCGG CTCGGTCCGA ATTTAATGAA AAACTTCGGT ATGGACGAAG AAACAAAGTT GAGCTGCTTT TGAGGCTTGG TGTGTCGGGG CCTTGGGACA AATGGCCGTG GTTGGATAGT GAGAAGGCAA TTCTGGATCC AGCCCACGCT TCTGACGCTG TTGGCGATGT ACCAGTAGCT GGTTGGATAA CGCGGCTGTA TTCAGGAACC AAAGCTGCCG AAGTATCTGG TACAATTCGT TTTCGAGGCA TACTTCGAAG CCATGCCGTC ACATCGCTTC TGGAACGGCT TGACCTTCGA GCTGCACAAG ACATACACGG CTTGACTTCT TCAACATTAC TTTTCTTCAA CGAAAAGCAA TTGATGGGGG AGCGCAACCT CTGGAAGGCA GGTGAGCAGA AAGAAGTCTT TCGAGAGCTA GTGCAACTGG CTGACCAAGC GCTACTGTTT GGACCTTGGT CGGTTATGGA TAAGCAGCCT TACGGCTGTG GTGTATCGGG AAATTGTCAT GAATATTTTC ACCCGTCGCC GTACAGGTGG CCACAGAGGA ATGAATCTGG ACACATCGAC TGGTCGAAAC CGTTTGAGCG GCATGACGGT ATGCGTGCGC CCGGTACATC CCTCTTTAGC GCCGGAAGTG AGCAGTATGA TCGATCTGGT TTGGCTGCAA TGA
|
Protein sequence | MVFQIYDGNN SQKRKNAAVM KDGSSSVFQI RLWAVFLFTA LLGSVSILLT LSSAFTNQIN FCSESARQYE PGPVSWRQSA MAFQAVKAHQ KFTEKSIQRE MPVTLAQAHP TSSMILIRAL GNALPPRHST QQTLENLDFI LAHEESFPNT TRHWFVNRFV DPEVERLVLD RLQKANESYT VIPFDLQVYD KIEYAYDRIP KDQIHLPPET IEGEGLTEKE ILLTEEQIQH DKILYVININ GVRNAMLDYG RKHSSAEYIL PWDGNCFMTR KAWSSIQSSL AENPLARYFT TPMDRLQEPN EALLSDMYVH NAVEEPQIIF HRSARSEFNE KLRYGRRNKV ELLLRLGVSG PWDKWPWLDS EKAILDPAHA SDAVGDVPVA GWITRLYSGT KAAEVSGTIR FRGILRSHAV TSLLERLDLR AAQDIHGLTS STLLFFNEKQ LMGERNLWKA GEQKEVFREL VQLADQALLF GPWSVMDKQP YGCGGHRGMN LDTSTGRNRL SGMTVCVRPV HPSLAPEVSS MIDLVWLQ
|
| |