Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41170 |
Symbol | |
ID | 7199027 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 126615 |
End bp | 128295 |
Gene Length | 1681 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185130 |
Protein GI | 219129931 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0989203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTGC TTGATATGAT GGTAGCTTGT CAGCTGTTCT TTCAAATAGC CACTATCTCG GCATTCTCTT TTCGCAGGGT ATCTTTCGGA ACTATTTCAT CGCGCCGTAC CCAACACCAA TCTCAGCTCT TCTCAAAGAC CGAGAGCAGT CGCAGCGCTC GTACTGAAGC AGCCACACGA TCTCTTGAAA ATTTGCGAGA GAGGCAAATG GAAGAACTGG CAGAAACCGA TCGTCTTTTG CAGCAAATCC GGCAAGTCGA GGTTAGTAGT CACAGCCCTA CCAATATATC GAGCACAAAC AAAGCGGCTG CCTCCATTCT AGCCGGAGTC GATTACGGAT TTCAAAGTCG GAGTGAAGGC GCAAGTTTTT CCGACCTCAA TGGCGGATCG CCTGCATTTG AAGGCTACGG ACCACCCTCG AATTTGTGGA AACTGGGGAC GCAGCAGTTT ATGCGTAATC TCAACGCCAT GAAAGGTGAG TACGCGGACG AGACCGACTT CGCTCTAACG GACTCGCAGA AAGAATTGCA CGCTCAGCTA GACGCCTTGA CGCTCAACGC CACCGGAATT TGGGACAGAG AAATGCAGAA TGGGCCAATT GAGGCCCCAT TCGTGATCAA GATACCGTAC TTTGGGCTCT GCTATATGCT TGATGAGGTA TTTGATGGCA AGTACATTCC ATCGCGTTTC TTTTTATTGG AAACAGTTGC GCGCATGCCT TATTTTTCGT ACATCACAAT GTTACACTTG TACGAAACTC TAGGTTTCTG GAGACGATCA GCGGGCATGA AGAGGATACA TTTTGCGGAA GAACTCAACG AATTTCACCA CCTGCTTATA ATGGAAAGTT TGGGGGGCGA CCAAGCTTGG TGGGTACGGT TTTTGGCTCA GCATTCAGCA ATCGTATATT ACGTCGCACT ATGCCTTTTA TGGGGTATAT CACCATCACT TTCATATCGC TTTTCGGAGC TGCTCGAAAC CCATGCCGTG AGCACCTACG GACAATTCTT GGACGAAAAC GAGGAAGCTC TCAAAAAGCT GCCACCGCCA CTTCCCGCTA TCGAATACTA CGCATTTGGT TCCTCCGATC CATTCTACGC GGAATTCCAA ACTACCGCCA TGTCCCAGGG TCAACCGGTA AGGTCCGACT TTCATGACAA TCGCCTGCCA GGATTATACT TGCGTACTAG GGCTAACTGA GCTTCGTCTT GTGCTTTCAT AGCTGCGGCG GCCTGGTGAG TCCATGCTGA GCTTGTACGA GGTGTTCCAA GCCATTAAGG CAGATGAGCT GGATCACGTC AGCACCATGG AAGCATGTCT CGATCCCGAA GCCAACACCC GATCCCCCTC CGTTGAAAAA CGCATCTTGC TCGGCCTAGC GTTGATTTCA ATAGTTGGAT TTACGGCATC AAATCTAGGC GGGGAGGCTT CCTTGATAGA TTCATTACCA GCAGATGTTG TCGGCGAAAC TTCAACGGGG GGAGCGGTTG ATGCCGTCGT GGCTAGTATT AGTGCGGCAG CGGCGAAATT TTCACTCGAC GAAACTTCCC AAGGTGGCTT GGGCAAAGCA GCGGTAGAGT TGGAAGAGCT AGGAGCCACC GGAGCTTTGC TTGAGGGGAG TCGCCGTGCC GTAATTGGGG CGTTGCAAGC TGTCCTTCGG TTCATAGGTA TTCTTCTGTA A
|
Protein sequence | MKLLDMMVAC QLFFQIATIS AFSFRRVSFG TISSRRTQHQ SQLFSKTESS RSARTEAATR SLENLRERQM EELAETDRLL QQIRQVEVSS HSPTNISSTN KAAASILAGV DYGFQSRSEG ASFSDLNGGS PAFEGYGPPS NLWKLGTQQF MRNLNAMKGE YADETDFALT DSQKELHAQL DALTLNATGI WDREMQNGPI EAPFVIKIPY FGLCYMLDEV FDGKYIPSRF FLLETVARMP YFSYITMLHL YETLGFWRRS AGMKRIHFAE ELNEFHHLLI MESLGGDQAW WVRFLAQHSA IVYYVALCLL WGISPSLSYR FSELLETHAV STYGQFLDEN EEALKKLPPP LPAIEYYAFG SSDPFYAEFQ TTAMSQGQPL RRPGESMLSL YEVFQAIKAD ELDHVSTMEA CLDPEANTRS PSVEKRILLG LALISIVGFT ASNLGGEASL IDSLPADVVG ETSTGGAVDA VVASISAAAA KFSLDETSQG GLGKAAVELE ELGATGALLE GSRRAVIGAL QAVLRFIGIL L
|
| |