Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14067 |
Symbol | |
ID | 7202448 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 561002 |
End bp | 562102 |
Gene Length | 1101 bp |
Protein Length | 271 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181583 |
Protein GI | 219122503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.952518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCCTA GTAGCCTCGT GCGACTAGTG AAGCCGTATC TATATACCTT TCGGAGTAAC GCCAAATTTC GCTGGTTTGG ACGAACAATT CTGGACGTTT ACGTCAGCGA GTTTGGGAGT TACCCAGAGT CGTACTACCG TACGGCAATT CAACAAGGTC GTATACGGGT CGGAAACGAA AAGGTGGATG TTGCATATAC TATACGATCC AACGACGTGC TAACCCATAC TGTACACCGT CACGAACCGG CCGTAGCGGT GTCCCAACCG CAGGCGCCAT TTGTGAAAGT TGTTGCCAAT TCGGAGACTT GGCTAGTGGT GGATAAACCT GGGACAATGC CGGTGCATCC TAGTGGCGCC TACCACTTAA ATTCGCTGCT ACCAATTTTG GAAAATACCT ACGGAAAGTT GTATCCCATC CATCGGCTCG ATCGTCTAAC AAGTGGCTTG GTTATCTTGG GGAAAACCCC TGAAGCTGCG AGGCAATTGG GGAAGGCAAT CAAGGAAAGA GACTCTTGCA CAAAACTGTA CATAGCTCGA GTTCGTGGTC GTTTTCCTTT CAACTGTGCA TCACACGTTC CGAACTTGTC CAGCCATAAG TCATACCCTC CACGGTATGG AGAATGGTCT GTGCTCCAAG ACATGGATGG CAAAAAAGAT AGTACCGGTA AGATTCGCAG TCGAAACTGT CATGGCTATA TGTTTGAGGA TATAAAGGGA ACGGTTCGAA ATGATTTGAC GTTGCAAACA TTTGGTAGCA AAACTGGAGG TAGATTGGAG GACTGGTTGC AAGCTCTTGA ATGCGAGGAC ATTTGTCAAA CCAATTTCTC GAGCAATAGC TTTGTGTGGA TGCGTCTCTG TTGCCCTGTA CGAGTAGAGG AACCGAAAAA CGGGATTTGC AAAGCTGGAA TATTTGACGA ACTCGACGAT AAAACTTACC ATGAAACAGT GAAAGCTGCC GAGACGTCTT TTGCCTTGCT TAAGTTTGAT GCCAAGTCTG ACTCGAGTGT GGTATTGTGC CGACCGGCAA CTGGTCGCAC CCATCAAATT CGGTTACACC TACAATACTT GGGCCATCCA ATCGCAAACG ATCCGAATTA T
|
Protein sequence | MAPSSLVRLV KPYLYTFRSN AKFRWFGRTI LDVYVSEFGS YPESYYRTAI QQGRIRVGNE KVDVAYTIRS NDVLTHTVHR HEPAVAVSQP QAPFVKVVAN SETWLVVDKP GTMPVHPSGA YHLNSLLPIL ENTYGKLYPI HRLDRLTSGL VILGKTPEAA RQLGKAIKER DSCTKLYIAR VRGRFPFNCA SHVPNLSSHK SYPPRYGEWS VLQDMDGKKD MKAAETSFAL LKFDAKSDSS VVLCRPATGR THQIRLHLQY LGHPIANDPN Y
|
| |