Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45174 |
Symbol | |
ID | 7200214 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 404080 |
End bp | 405633 |
Gene Length | 1554 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179201 |
Protein GI | 219116813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGCATTTTG TTGGTGAGAT TTCGCTCCCT TTCCCGTCAG AGAGACACGG CTATACACGG CTATACACAC ACATACACAC ACATACATCA CACAATCACC ATGAGCCGTG CGCCAACGTC CGCCGCGACG AAACCGAGTC GTCTACTGTT CGCTTCCTGC AACAGTCAGC ACCGTCCCCA AACACTGTGG CCGGCTATTC GTAGCCGCAA CGCCACAGCC TTTGTGTGGG CCGGTGATGC CGTGTACGCC GACGACTTTG ATACCGTCTA CACTTTTGGT TGGCCCCGCA TCGCCCGACT CCAATCGCGG GACGCCACAC CATCCGAACT TGCCGGACTC CTACACGACC AATTGACCCA CGACGGTTAC CAGTCGTTGG TGAAGGACCC CACCGTGACG ATTCTGGGAG CCTTGGATGA TCACGATTAC GGTGCCAATA ACGGGGACCT CACCTACGCA CACAAGGCCG CCAACGGCGC CGCGTACGTG GACTTTTTGA CCGCACAGAA ACCCTACCGC CTGGATGCGA TGCGTCAAAG AGCCGTTCGG GGACAAGGCG TCTACGGAGT GCAAGTCTTC GACTTTAGTC GACCCGTGGG TCACGAACTT GTTGATGAAG CGGAGGCCGG CATCGAACCG GATCATCACG AACACGAATA TAACGACAAC CACAACGGCA ACAAGGAGTC GTCACCATCA GTGCTTTCTA ATCAATCCGT AGCCGTCTTT CTCCTGGATG TGCGCTCCAA TAAAACTCCC TGGAAACAGC AGGGATTGGG AAAGTACCGG GTGGATTACG CTGCGGATTT TTTGGGTCCC GACCAGTGGG TCTGGTTCGA AACGGCCCTC CGGAGATCGA CCGCGACCGT CAATGTTATT GTACAAGGAC TACAAGTACA CGCTGACCGG TACATTGACG GGAACGTGGT TGAAGACTGG AGCCGCTTCC CCGCGGCCCA GCACCGTCTC TACCAAACCA TACTGCAATC CAACGTCCAA GCTCCCATTC TCGTATCGGG TGACGTGCAC ATGGCGGAAC TACTGCGCAA GGACTGTCAA CCAGTGGGTC GTAGGCAAAC TGATGGGGAG GCCGACGGTG ATCACCGCGA CGCCAATCGC ATGCTCCTGG AAGTTACCAC AAGTGGCATG ACGCATTCCT GGGGATCGCA CATATGCGCG CGGCCGGAGT CGAGTTGGTC CTGTCGCAAC GCCTACGTCG ATTGGAGTAT GAGTATGGGG ATGCACGTGG CTCATCACAA CGGTGCCTGG ACGGATCTGG TGGACCTAGA ATCAGCGGAA GAAGGCGCCA AAGCAGGAAT ACAATACAAT TTAGATCTAA ATTTTGGAGA ATTCGAGTTT GACTGGGAAG CAAAAGAGGT GAAAATTCGG ATTCACGGTA ACAATGTAAA CGCAACTGCG CCTTATTTGA GTACGCGGTG GGATTTTGAT ACATTGTCGG GTAAGACCCC GGCACCCCCG ACGGGGCTCG TGCGTGATCG TGATTTTGAC GACCTGCGAC GTGACCTCCA GCATCACGGT GTCGAGATTA CTGA
|
Protein sequence | MSRAPTSAAT KPSRLLFASC NSQHRPQTLW PAIRSRNATA FVWAGDAVYA DDFDTVYTFG WPRIARLQSR DATPSELAGL LHDQLTHDGY QSLVKDPTVT ILGALDDHDY GANNGDLTYA HKAANGAAYV DFLTAQKPYR LDAMRQRAVR GQGVYGVQVF DFSRPVGHEL VDEAEAGIEP DHHEHEYNDN HNGNKESSPS VLSNQSVAVF LLDVRSNKTP WKQQGLGKYR VDYAADFLGP DQWVWFETAL RRSTATVNVI VQGLQVHADR YIDGNVVEDW SRFPAAQHRL YQTILQSNVQ APILVSGDVH MAELLRKDCQ PVGRRQTDGE ADGDHRDANR MLLEVTTSGM THSWGSHICA RPESSWSCRN AYVDWSMSMG MHVAHHNGAW TDLVDLESAE EGAKAGIQYN LDLNFGEFEF DWEAKEVKIR IHGNNVNATA PYLSTRWDFD TLSGKTPAPP TGLVRDPSRC RDY
|
| |