Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41350 |
Symbol | |
ID | 7199209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 248013 |
End bp | 249665 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185296 |
Protein GI | 219130279 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCGC AACAGGACGA TCCTGGGTTC ACCTTTCCTC AGGAGGTGGC GTACACCGTC GCCCCGATCA TTTCAGGTCT CTTGTCGACC TTGGGATCAA CGGCGATTAT CTGGATGATC TTGACGGATT GGAATCGAAA AATTCGTCGT GTCAAGTACC GAATATTGTT GGGCTTGAGT TTGTCGGACG CCCTGAGTTC GATAGTACAG ATGTTCTGGG GAATCATGCT GCCCAAGGGG ACACCGGGTT CATGGGGGGC TATTGGGAAC AAGGCAACCT GCAGTGCGCA CGGATTTATT CTGCAGTTTG GCCTTTCGGG GAGCTTTTAC AATACGGCGT TGTCAGCCTA TTTCTACAAA TCCATCTGCT TCGGCATGAC CGACGCAACG TTCGCCGCCA AGTACGAACT CTGGATTCAT CTGACGTCCG TTCTCTTTCC GTTGGCAACC GCCGTAGGCG CATTGATTCT GGACATTTAC AGTGTCACCG GAGGAGGGTG TTACATTGCT CCGGACCCGC TACGATGTCA CCGCCGCGAC GACGTAGAGT GCCTGCGGGG AGAAAACGCC TACAAATACT CCTGGGCAGT CGCCGGAGTA CCCGTGGTCA TTTTCCTCCT CTATATTACC TACACGATGT TCCGGATTTA TCAAAAAGTT CGCCAAGTCT CCCGACGCTC GGAACGATTC GAGTTTCGCT CAACGCGCGT GTCCTACGAA ATGCCCGATT CGGAAAATCC GTCGGAAGAA CAACGGCGAC AGTCCGTGAA CGGCCAAGAT AGCAACGACA ACTTTTCGCT CCGCGAATTG GTTGAGCAGC ATGGGGATAC CACGGAACTG CAGCGGCCTC CCGCGGTTTC ATTCCAGCTC ACTGGTGGCA ATACTGGCAC GTCATCCGGA ACCGCTTCAC CTCCTCCGAC TCACCCTCTT CCTTCGGCTT CGTCGCGGGG AAGGACTTCT TTCAACAGCA GAAGATCCAT CCAGGATTCC AACCGTGTCA GCCGTATCCG AGAGACAGCC ATCCAAGCTT TCCTATACGT CGTTGCTTTC TTTGGAACCC ACTTTCCGTC CTTTATTCTC AACAATTTGG AAATGTTTGG GGGCACCAGT CCCTTTTATT TGGTCTTCTT GGCTTCATTC GCCTGGCCCT TGCAAGGTTT CTTCAACCTC TTTGTGTTTT TGCGACCGAG AATACGTAGC TGCCGTCGAC AAGAGCCATC CTTGTCCTAC TGCAAGGCCG CCTATCTGGC GTTGTTCCAC TACGACGAAG CTCGCGGTCG TGTCAACGAG TCGCAACTCA CGGACGCCAC CCCGGATGCG GCGAAGTTTC CTAGTGGCTC GGACTCGTGC AACGGCAGCC AAGCTCTGCA AATGATACGT GTATCACGTC TCGAATCCAT TGACGACAAT GACTATCGAG AAGAAAGCTT GTCTTCTGCG TCCGCCTACG AAGACGAAAC CAACAACGGG CCCTCCACGC TCGATACCGT TGAAAGCTAC CCAGCGAGTC AAGTCCCCGA CACGGCAACG ATGGCGGACG AAGAAAGCTA CTCGCCGGAC TCTTTCGAAG CTTCCAAGGA AATCAAGCAC GTTGTTTCTC TTTTACGGAC AAAAGAGCAC CCGGAAGAGA ATCAGGACCA TGACCGAAAT TAA
|
Protein sequence | MASQQDDPGF TFPQEVAYTV APIISGLLST LGSTAIIWMI LTDWNRKIRR VKYRILLGLS LSDALSSIVQ MFWGIMLPKG TPGSWGAIGN KATCSAHGFI LQFGLSGSFY NTALSAYFYK SICFGMTDAT FAAKYELWIH LTSVLFPLAT AVGALILDIY SVTGGGCYIA PDPLRCHRRD DVECLRGENA YKYSWAVAGV PVVIFLLYIT YTMFRIYQKV RQVSRRSERF EFRSTRVSYE MPDSENPSEE QRRQSVNGQD SNDNFSLREL VEQHGDTTEL QRPPAVSFQL TGGNTGTSSG TASPPPTHPL PSASSRGRTS FNSRRSIQDS NRVSRIRETA IQAFLYVVAF FGTHFPSFIL NNLEMFGGTS PFYLVFLASF AWPLQGFFNL FVFLRPRIRS CRRQEPSLSY CKAAYLALFH YDEARGRVNE SQLTDATPDA AKFPSGSDSC NGSQALQMIR VSRLESIDDN DYREESLSSA SAYEDETNNG PSTLDTVESY PASQVPDTAT MADEESYSPD SFEASKEIKH VVSLLRTKEH PEENQDHDRN
|
| |