Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21354 |
Symbol | |
ID | 7202165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 366371 |
End bp | 367997 |
Gene Length | 1627 bp |
Protein Length | 422 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181364 |
Protein GI | 219122044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.875011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTGAAGCC GTCATCACGT CACGTTCACA AGACTTGATG TACAGAGAGA CCAGAGGCTC GCACAAAGCA GTCGAAATCT CGCTGGTCGA TTCCAATCGA TCCGAAAACG TACTTTCGCT CCCACAACGC GTTAGTACGC GCGCATGTCC GCTTCAGTGG CGTCGGCACA AGCCATTCTC CGCAAAGGAT GGCCGCTCGT GAGTCAAGCG GCATCGAATC CGTACGCGAA CACGGGCAAA TGGATATTGG GAACCGCTGG TCTCGTCGTA GGCATGATCC ACGTGGGAGG TGTGACGCGG TTGACGCAAT CGGGCTTGTC CATGACGGAC TGGAGTCCAC TAGGATCGCT CCCACCGATT AGTAAAAAAG ACTGGGAGAA GGAGTTTGAT CGATACAAAC TTTTTCCAGA ATGGCAACAA CGCAAGTCAA TGACACTGTC CGATTTCCAG TTCATTTACG CCTGGGAATA TGGACACCGT ATGCTCGGTC GTTTCGTTGG TGTCGCGTTT GCCATGCCTT GGATGTACTT TACATTCAAA GGTAGGATAC CGAAGGGGTA CCAAAAGCGC ATGGTGGGAC TACTCGCCAT GGGTGGCACT CAAGGTATGG TTGGGTGGTG GATGGTAAAA TCCGGTCTAG GGGACGATCG GCGTGACGAA AAGCGTGAAA TTCGTGTTCG TCCCGTCCGT TTAACTTCCC ACTTATCCAT GGCCCTGGCA ACCTACGGTG CCCTGCTCTG GACGGGATTT GATATACTCG GTATTCCACA AGAAGCTAAT ATAAAGGATC AAGTCAAAAA GCTTAGCAAA GACGCGCTGC GTCACGCGCA GAAGCTTCGA TCTACCAGCC TCGTCTTGGC GGGCTTGACG TTTTGCACGG CTGCTTCAGG CGGTCTTGTG GCCGGAAATG ATGCAGGCCG TGCGTACAAT ACTTGGCCAA CTATGGGGGA TGAATGGATT CCTTCGGAAA TTATGGATTT GGTGCCTTGG CAACGTAATC TCACCGAAAA TACCGCCACG GTACAGTTCA ATCACCGCAT TCTAGGGACA ACGACCGCGT TTACCGCTCT CTATTTGGTA GGCGCGGGCC TGTCGCGAAA CCGTGGCATG CTTTTGACTC CCCAGGCTCG CAACGGTCTT TATGCGGTCG GAATAGCAGC GACCGGACAG TTCGCCCTCG GAGTCACAAC TTTGCTAACG TACGTACCTT TTTCGTTGGC GGCAGCCCAT CAGCTCGGGA GTGTTGTGGT TTTTACCAGC GGTCTGTATT TGGCGCACAG CTTGCGATAC GCACGTCCAG CATTGGTCCG GGCGGTAGTG ACATCCTCCA TATCGACCTC TGCCAGTACG ACGTCTCGCG GTGTTACAGC CACGGTCGCA CAAGTTGCTG CTTCGGGTGC CAAGGCTGTG TAACGATACT GTTAGGTTTT TTTTGTTGCA CGCGCAAGTC GGAACGAACA AAACCAATAT TGATTCCAGC ATGTTTAAAC GTACTTGGCC TACTCTATGT CTCGTAGGAA TATAGCAAGA AATGAGACTG CATGGTGCCA GGCTCATCCA GTTTAACGGA GGCTATTGAC CGTGCTACCC ACATTCGATG ACCGTGGTGT AGCCAGGCAG AAGCCGG
|
Protein sequence | MSASVASAQA ILRKGWPLVS QAASNPYANT GKWILGTAGL VVGMIHVGGV TRLTQSGLSM TDWSPLGSLP PISKKDWEKE FDRYKLFPEW QQRKSMTLSD FQFIYAWEYG HRMLGRFVGV AFAMPWMYFT FKGRIPKGYQ KRMVGLLAMG GTQGMVGWWM VKSGLGDDRR DEKREIRVRP VRLTSHLSMA LATYGALLWT GFDILGIPQE ANIKDQVKKL SKDALRHAQK LRSTSLVLAG LTFCTAASGG LVAGNDAGRA YNTWPTMGDE WIPSEIMDLV PWQRNLTENT ATVQFNHRIL GTTTAFTALY LVGAGLSRNR GMLLTPQARN GLYAVGIAAT GQFALGVTTL LTYVPFSLAA AHQLGSVVVF TSGLYLAHSL RYARPALVRA VVTSSISTSA STTSRGVTAT VAQVAASGAK AV
|
| |