Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13833 |
Symbol | |
ID | 7202046 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 600074 |
End bp | 601504 |
Gene Length | 1431 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181407 |
Protein GI | 219122133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0243831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTACCAATG AGCGTCCTGT CTCAAAACTC AGACCGTCCA AGCCAATCAC TTCTCGAATT GATGATACCA TATTACGGGT AAGTCAAACG TTGTCGAGCA AGAGGGGGGC CGCGTCGCTC GTAGTGTCTA CGGACGGAAG CCTGGCCGGT ATTATGACTG ATACCGACAT CACACGCAGA GTGGTTGCAA AGCATATTGA TACGTCAGCA ACCTCTGTGA GCGAAGTTAT GACTCCAAAT CCAACGTGTG TTGCCATGAG CGATTCGGCA ATGGACGCAT TGACTACGAT GGTCGAGAAT CATTTCAGGC ATCTCCCTGT TGTAGACGAT CAAGGGTCAG TGGTTGGCCT GTTGGATATA GCGAAATGCT TAAACGATGC AATCAGCAAG TTGGAGCGCA CCAGTGAAAA GACGAATAGT GCTGCGGAGG ACGCGGTGAA GCAGATGGTG GCGCAGCAAG GAGCCGGTGG AGCCCAAGCT GCTGCACTGA AGGCACTCCT CGGCAATCTC ATGTCGCAGG CGTTCGGAGG GAAGCAGATG CCGACGCTTC GTAGTCTCTT GGCAGGAAAG CCAGGGACAC TTGTTGATCC ATCTACGAGC ATTCGTAACT GCGGTCTACG GATGGCAGAT AGTCGCAAAG CGGCTCTGGT AGTCGACGAC GGCGAACTTG TGGGTGTGTT TACTTTCAAG GATATGATGT CGCGAGCCGT GGCTAAGGAG CTAGATCTCG ACGTGACTCC GGTGTCACAG GTGATGACTC CCAGTCCGGA GTTTGTGTCG CCGGATATGA CTGTCTTGGA GGCCTTACAA TCGATGCACG ACAATAAGTT TTTGACTCTT CCGGTTTGTG AGAGCGATGG TCGGGTCGTT GGTCTGGTTG ATGTAATGGA CGTCATACAT GGTTGTGGTG GAGCTGAAGG CTGGAAGTCC ATCTTTAGCA ATGTAATGGA ATTGGACGAT ATCTCTGACG TGCATTCTCT CTCAAAATCA CGCGTATCAG GTCGAAGACT TGGATCACCA TCTCATATTA AAGCTCCACC AGAGACCCCT TACGTAACAA AGCTTCCTGG AAATATTCCG GCGACTTTGG AATTCGAAGA GCCTGATGAC CACGCTTCCT TCAATGGGAG TACAATCGGT GACGAAAGGG GCGTGTCCAA GCTGCTTAGT CCCGACGAAG GAAGTCTTGC CGCGGTTGTC GGTGTTTTTA AAGTCACCGA ACCAAACGGA AGGACCCATC GCATTCGATG CGAAACTTTG GTCACGGAGC TTCTAGAGGC GGTTGCGGAG AAGGTCGATA TTCCCCGAAG CCGTTTGCAG ATCCAGTATG TCGACGACGA AGGGGACACG GTCGTAATAA CGACTGATCA CGATGTCACG GAATCGTGGT CCTTTGCTCG CAAAGCTAAC CAAAAAGTCG CTAAATTGAA T
|
Protein sequence | STNERPVSKL RPSKPITSRI DDTILRVSQT LSSKRGAASL VVSTDGSLAG IMTDTDITRR VVAKHIDTSA TSVSEVMTPN PTCVAMSDSA MDALTTMVEN HFRHLPVVDD QGSVVGLLDI AKCLNDAISK LERTSEKTNS AAEDAVKQMV AQQGAGGAQA AALKALLGNL MSQAFGGKQM PTLRSLLAGK PGTLVDPSTS IRNCGLRMAD SRKAALVVDD GELVGVFTFK DMMSRAVAKE LDLDVTPVSQ VMTPSPEFVS PDMTVLEALQ SMHDNKFLTL PVCESDGRVV GLVDVMDVIH GCGGAEGWKS IFSNTPYVTK LPGNIPATLE FEEPDDHASF NGSTIGDERG VSKLLSPDEG SLAAVVGVFK VTEPNGRTHR IRCETLVTEL LEAVAEKVDI PRSRLQIQYV DDEGDTVVIT TDHDVTESWS FARKANQKVA KLN
|
| |