Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_10800 |
Symbol | |
ID | 7197764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 719444 |
End bp | 720559 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178288 |
Protein GI | 219114985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.929663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTCG AACGAAAGAC AACGATTGGA AAAGCAATAT CGGCTCCACT ATCAACCATG GCCCTGGCGT TGACCGTGGC GAATTTGGGC GTCATACCTT TCTCGTCAAG CGTTTATAGT ATGATCAACC AATACTTGGT GCCGCTTGCT GTTCCGATGT TGCTTTATGA TAGTGATATT CGTCGCGTCA TTCGAGACAC CGGGACTCTC TTGCTAGCAT TTGGTGTCGG TGCAATTGCC ACCGTTGTTG GTACGCTGGT TTCCTTTCCC ATTCTACCAA TGACATCGCT GGGTGATGAT GGATGGAGAG TAGCCTGTGC TCTGGCGGCG CGCCACATTG GAGGAGCCAT CAACTTCGTG GCAGTTGCGG AAACGCTGCA GATTTCGGGC ACAGTCGTCT CGGCCGCGAT TGCAGCCGAC AATGTCGTTG TGGCTCTATA TTTCGCATTT CTATTCGCAA TTTCGAATGC TGATCAGGTG GATGGACCCT CATCAGATTC CGGAACAAGC GACGCTTTGG AACTCGATGC AAGCGGAAGC GAAGTCGAGT ATTCGGAGGA TAGTATTTCC TTGTCATCAT TGGGTATTGC CCTGTCTGTA GCGTCAGGAC TAGTAACGAC CGGAAGAATC TTGACAAACT CTGTTTTGCC ATTGGGGACC TCCGCATTGC CACTGACGTC CGTGCTTACG GTAGCTGCGG CCACCATCTT CCCGAAGTTT TTCGTCAATA TACGTGCCGC TGGGAGTGCG CTTGGTATCT TGTGCATCCA AATGTTCTTT GCAGCGTCGG GTGCAGCTGG CTCGATTAGC CTCGTTATGC AGAAAGCCCC ATCGTTGTTT GCGTTTTCAG CGTTGCAAAT TGGTGTCCAT TTTGGCGTCC TCATGTCGGT CGGCCGCGGT ATCTTTCGGA TCCCGAGTAA GGAGCTCTAC TTAGCATCCA ATGCGAATGT CGGGGGGCCT ACGACTGCCG CCGCAATGGC AAAAGCCAAA GATTGGAAAT CATTGGTACT GCCAGCTCTG CTAGTGGGAA TTCTTGGCTA CGCATCTGCA ACCGCAATTG CCCTAGCACT CGGGCCAATT CTTGTCCGGC TTCCTCTTAT TGGCGAAAAA TCCTAA
|
Protein sequence | MSLERKTTIG KAISAPLSTM ALALTVANLG VIPFSSSVYS MINQYLVPLA VPMLLYDSDI RRVIRDTGTL LLAFGVGAIA TVVGTLVSFP ILPMTSLGDD GWRVACALAA RHIGGAINFV AVAETLQISG TVVSAAIAAD NVVVALYFAF LFAISNADQV DGPSSDSGTS DALELDASGS EVEYSEDSIS LSSLGIALSV ASGLVTTGRI LTNSVLPLGT SALPLTSVLT VAAATIFPKF FVNIRAAGSA LGILCIQMFF AASGAAGSIS LVMQKAPSLF AFSALQIGVH FGVLMSVGRG IFRIPSKELY LASNANVGGP TTAAAMAKAK DWKSLVLPAL LVGILGYASA TAIALALGPI LVRLPLIGEK S
|
| |