Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49341 |
Symbol | |
ID | 7195501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 633391 |
End bp | 635702 |
Gene Length | 2312 bp |
Protein Length | 661 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184037 |
Protein GI | 219127634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTATTGTCAA CATGGGGTTT CTGTTCTGGC TAGCGACGTC CGTGATGTTG ATGACTTGGA TCATCCAAAT CACATCGTTT CCAACAAAAG TAAATCTTAG AATAGAGAAA TCTGGAGTCG TTTGCCTGAG AGGTTCTTTA GACTACGAGT ATGTCCCACC AAACGAAGGT TTGTTGCAGT TTGAAACCAA TTTAGCGAGC AGTTATCCGG AGAACACACC CGCTTCCTTG CGCGGGGAAG CTGTTCGCAG TGCCTTACGT TCCGGACAGT GTATTGGATG GAAATTGGAC GAAACGCCTC TAAAGTTTGG CGTTGTGGGT GTCAATGGCG ATGGCTGTTT GAGTTTTCTC AACAACAAAT TAACCCAAAC GTTTTCCGCC TCTGGTCTCG TAGATTCCCT TGGATCTTTC TACCAGGCTT GTTTGCTGAA CGGAAAAGGG CGCCTTGTAG ACCGATTATC CGTCGCGATC ACGAGTGTCG ACTCTGGATA CATGTTGACG TCGCCCGGTC ACGCAGGAGA AAGTCTTTTT CGCAAGTTGG ATCCGTATAT TTTTCCCCTC GACCAAGTCG AGCTCATTGA TAGCACCAGC TCCTCTTGCA TTTTTGACCT CGTTTCAACC AATATCGAAG ATGTTCGAAC AGTTTTCAAC GAGCAAATTT TACCGCGAAT TGACCTTAAG AACAACAACT GTTTTCAGCT GCCTTCGGCT CGACAATGTA CGATGATTCC TCTACAGGGT GGTGTTGATC CTTCTCTACT CATTGTACCC TCAACAAGCG TCAATTCGTG TGCGGCCGTC GGCTACACCT TTTGCTTCTT AAATGATCAC GAAAAGATAG GAACACATGT TTGGGACTTT CTGATTTCGG ATGGTAACAC GAAAGGACCC GTCCAGATCG GTGCTCTTGA GTTTGAAAGC CTGCGCATCG AATCGGGGTC CGTGGGGTTT GAACGTGAAA TGCTGGCCAA CCAAAAGGAC TCATTTCTTG CACCACCCAC GCCGCTAGAA TTGCATTTAG ATTATACAAT CAATATGGAA AAAGGATGTT ACCTTGGTCA GGAAGGGATC GCGTCCGTCG TCAAGAATCC AAGAGGGCCA CCTCGAATGC TGTATCAGGT TGTTTTCGAT GATGATTTCA ATGTTTACGA TTACCAGTCA GCTGGTGATC GAAGCATCGT GGAAAACTTG ACAAGAGTTC CGAGAGCAGG AGACAAAGTG TTTGTGCTGG GCAGCAATGC AGAAATTGAA GTTGGAGTGT TGACGTCAGT TGCCGAGCCA GGGTCCACGG GAGAGCCTGT AACTGTCGGA TTAGCGCTGG TCCGACGAGC AGACTCGATC ATCAAAAAAA TGAAAGCCAA AAACTTGGAA ATCAATCGTC GCATTGAGGT AGATACGCCA ATGGAGGGTG GTTCTGGAAT GATACCTCCG CCTGCTCTTG ATCCTTTGGA CGGGTTGGAA GTAATCATTG GTGGCGGTTC GGCCGTTGGT TCTTTGCGCG GGGTACCGTC GCGACGATTT CGGAACGGTC GCAATATGTT TGACAATATT CCAGACTTTG TGAACCAATT ATCCCAAGAG CAAGATGGTG AATTTCTCAT GGCTAATCGT AACAGCGATG GCACTCGGTT CCAACCGGAT GCCCCTGCCG CAAAAACTGC TTTTCTGCCA ACTGAAGACG GCGCAGCCAA CCTTGGCGAC GATGAGGACG ACCTGAAAAC CTTACAAAAG ACTGCAGCGA AAGCAAGAAA AGAAGCCGCA GCAGCGGCGG ATGAAGCGAG GCGGAAAGCT GAAAAAATGG ATTCGCTGCA AAAGCGTGCG GAAGAGGCGA TGGCCCGCCG CAAGGAAAAG GTAAGGGTAG AAACTGCGGC TGAGAGTACC AATGAGGACG AGGCCGATTC AGAAGCAAAG CGCAAGGCTG AAAAGATGGA ATTGCTCAAA AAGCGTGCGG AGGAGGCCGT TGGTCGTCGG GCACAAAAGA ACCAAGGAGG CGGATAAAGT CACAAAAGAC TATAATTTCT AGGAGATCCT GAGAAGATTG GAATGAAGAT CAGCTTGTTT CTTCGCATTT TGCAGCGGGA ATTTTGACTG CCTGGGGTCC CTAAAGAAGT AGCTGTCCGT TTGGACTGGA CTACTATGGC AAGGCAACGA TGAACTTCAC TTTTCGATGC TGTCAGAACA AATCTTTCAC GACGAGGTCG AATAATGTAC TGTTGTACAG GAGGAAGCCA CCAAGCACGC GTCCCTGCGA TGATCTATTA ATTAGCATTC ATCTGAGATT AGCTCAATTT ACATTAAACC TCTCGTTCCA TT
|
Protein sequence | MGFLFWLATS VMLMTWIIQI TSFPTKVNLR IEKSGVVCLR GSLDYEYVPP NEGLLQFETN LASSYPENTP ASLRGEAVRS ALRSGQCIGW KLDETPLKFG VVGVNGDGCL SFLNNKLTQT FSASGLVDSL GSFYQACLLN GKGRLVDRLS VAITSVDSGY MLTSPGHAGE SLFRKLDPYI FPLDQVELID STSSSCIFDL VSTNIEDVRT VFNEQILPRI DLKNNNCFQL PSARQCTMIP LQGGVDPSLL IVPSTSVNSC AAVGYTFCFL NDHEKIGTHV WDFLISDGNT KGPVQIGALE FESLRIESGS VGFEREMLAN QKDSFLAPPT PLELHLDYTI NMEKGCYLGQ EGIASVVKNP RGPPRMLYQV VFDDDFNVYD YQSAGDRSIV ENLTRVPRAG DKVFVLGSNA EIEVGVLTSV AEPGSTGEPV TVGLALVRRA DSIIKKMKAK NLEINRRIEV DTPMEGGSGM IPPPALDPLD GLEVIIGGGS AVGSLRGVPS RRFRNGRNMF DNIPDFVNQL SQEQDGEFLM ANRNSDGTRF QPDAPAAKTA FLPTEDGAAN LGDDEDDLKT LQKTAAKARK EAAAAADEAR RKAEKMDSLQ KRAEEAMARR KEKVRVETAA ESTNEDEADS EAKRKAEKME LLKKRAEEAV GRRAQKNQGG G
|
| |